Hi @Ioana_Gligan, I’m back again. I saw the latest IntelligentOCR release and it’s really cool!
I have a couple of questions / suggestions.
1 - Why do we need to extract a value first in order to be able to select it?
For example, I would like to be able to just choose the value of “Yes” or “No” for a boolean value without extracting any data or matching it to a value on the document. Is this possible?
Another example for that would be for the person validating the document to be able to type in a note for the document without having to be extracted or linked to anything on the document.
And another example would be selecting an item from a set without needing the value to be extracted or linked to the document.
As a kind of workaround, I’ve been using a RegEx extractor for these values of “(a)” without the quotations. This will match the letter a, then the person can select a value. But this is kind of messy…
2 - How exactly should we use ‘keyword sets’ in the Keyword Based Classifier?
I’ve played around with it a lot. I have over 40 different document types I am classifying. It seems that it’s better for me to keep all of my keywords in a single ‘keyword set’ for a single Document Type as opposed to having multiple ‘keyword sets’ for each document type.
I notice that when I use multiple sets that the accuracy of the classification is reduced, but when I keep all of the key words in one set, accuracy of classification is better.
KEYWORDS IN A SINGLE SET:
KEYWORDS IN MULTIPLE SETS:
When should we use more than 1 keyword set for a document?
What are your suggestions for properly classifying documents in this way?
3 - If the document classification is incorrect and we manually update the document type, is it possible to run the extractor again with the new document type?
For this case, I just don’t want to manually input all details if the classification is wrong. Ideally, UiPath would be able to go through the extraction again, and save time on choosing all of the values.