Hi @Ioana_Gligan, Iâm back again. I saw the latest IntelligentOCR release and itâs really cool!
I have a couple of questions / suggestions.
1 - Why do we need to extract a value first in order to be able to select it?
For example, I would like to be able to just choose the value of âYesâ or âNoâ for a boolean value without extracting any data or matching it to a value on the document. Is this possible?
Another example for that would be for the person validating the document to be able to type in a note for the document without having to be extracted or linked to anything on the document.
And another example would be selecting an item from a set without needing the value to be extracted or linked to the document.
As a kind of workaround, Iâve been using a RegEx extractor for these values of â(a)â without the quotations. This will match the letter a, then the person can select a value. But this is kind of messyâŚ
2 - How exactly should we use âkeyword setsâ in the Keyword Based Classifier?
Iâve played around with it a lot. I have over 40 different document types I am classifying. It seems that itâs better for me to keep all of my keywords in a single âkeyword setâ for a single Document Type as opposed to having multiple âkeyword setsâ for each document type.
I notice that when I use multiple sets that the accuracy of the classification is reduced, but when I keep all of the key words in one set, accuracy of classification is better.
KEYWORDS IN A SINGLE SET:
KEYWORDS IN MULTIPLE SETS:
When should we use more than 1 keyword set for a document?
What are your suggestions for properly classifying documents in this way?
3 - If the document classification is incorrect and we manually update the document type, is it possible to run the extractor again with the new document type?
For this case, I just donât want to manually input all details if the classification is wrong. Ideally, UiPath would be able to go through the extraction again, and save time on choosing all of the values.