How to use the IntelligentOCR Package

Ioana_Gligan · September 12, 2019, 10:41am

The IntelligentOCR package allows users to perform document processing in their workflows, with some out-of-the-box functionality available for usage, as well as the framework required for building your own document classification and data extraction components.

To run the workflows, you must add your own ApiKey for Document Understanding from https://platform.uipath.com.

EDIT1: Action Center Integration Sample
Here is a sample workflow that uses an end to end document understanding processing workflow, and uses Action Center Integration for Human Validation:
SampleDUActionCenterIntegration-New.zip (620.8 KB)

ORIGINAL POSTING: Sample Document Understanding Basic Usage
Here is a sample workflow that performs:

digitization, using the OmniPage OCR engine available in UiPath,
document classification, using the Keyword Based Classifier,
data extraction, using both the Regex Based Extractor as well as the Machine Learning Extractor available for processing Invoices and Receipts
data validation, using the Present Validation Station attended activity, and
classifier training, for the Keyword Based Classifier.

Please note that the Taxonomy (list of document types and associated fields) is editable using the Taxonomy Manager wizard (wizard ribbon after the IntelligentOCR package is installed).
DocumentProcessing_IntelligentOCR300.zip (956.5 KB)

USEFUL RESOURCES:

https://docs.uipath.com/activities/docs/about-the-intelligentocr-activities-pack (documentation on each IntelligentOCR activity)
Receipt and Invoice AI - Now available in Public Preview! (documentation on the Machine Learning Extractor)
GitHub - UiPath/Document-Processing-Code-Samples: Code samples for document processing activities. (how to build your own classifier and extractor sample project)
https://docs.uipath.com/activities/docs/about-the-uipathdocumentprocessingcontracts (document processing contracts documentation)

Looking forward to hearing your feedback!

Ioana

Sriram07 · September 13, 2019, 7:55am

@Ioana_Gligan

Can we process pdf with more than 2 pages? because i found that some documents contains more than 2 pages. can we process all the pages ? using this extractor

Ioana_Gligan · September 13, 2019, 8:53am

Not using the community edition. The community edition is limited to documents of maximum two pages.

Luis_Martirena · September 30, 2019, 9:14pm

Thank you @Ioana_Gligan Ioana_Gligan
Do you have one of this with the “Extract semi-structured document” activity? I can’t make it works…

Roboz · October 1, 2019, 9:45am

Can we make the model learning using the classifiers and learning activities mentioned in your workflow?
Like if trains one time and next time for the same file it should be giving actual outputs(Trained + Original)
@Ioana_Gligan

Ioana_Gligan · October 7, 2019, 10:38am

Hello @Roboz,

Yes, model training / retraining can be enabled using the Train Classifiers Scope and Train Extractors Scope. Make sure to check out the documentation on how to write your own training activity for an extractor or classifier.

THanks,

Ioana

vmariejeanne · October 8, 2019, 12:12pm

Hello @Ioana_Gligan,
Where can we have the documentation on how to train and retrain?

Thanks,
Vincent

Lahiru.Fernando · October 8, 2019, 1:52pm

Hey @Ioana_Gligan

For training extractors, do we need to write our own extractor activities? Why I asked is because I don’t see a trainable extractor activity.
Machine learning extractor is already trained for a set of fields. Regex we got to code. What are the activities that we can train under train extractor scope?

Abhishek14 · October 9, 2019, 4:47am

Hi,

I m trying to use execute the workflow which u have attached.
But it is showing me unresolved Activity after installing Intelligent OCR activities.
i.e after Load Taxonomy…
kindly suggest which other package needs to be installed along with Intelligent OCR activity.

loginerror · October 9, 2019, 6:25am

Hi @Abhishek14

Could you provide a screenshot?
Normally you can right click on the broken dependency and select “Repair” for it to be corrected.

You should also remember to open the project from the project.json file, which will read all the dependencies required to run it and download them automatically.

Abhishek14 · October 9, 2019, 6:53am

Screenshot%20(107)

Gone through all the errors,Installed all the packages required
But I am unable to find the Machine Learning Extractor Package in Manage Activities?

Lahiru.Fernando · October 9, 2019, 7:09am

Hey @Abhishek14

Get the beta feed added to your package manager and search for that in that feed. Also enable pre release option and you’ll find it there

Ioana_Gligan · October 11, 2019, 3:15am

Hello @Lahiru.Fernando and @vmariejeanne,

In order to train extractors, you currently have to build your own

The machine learning extractor is pre-trained and does not expose the re-training capability at this moment.

If you have an in-house algorithm capable of learning, it is very easy to enable the feedback loop, but you do have to write your own training activity.

We will keep you posted when any out of the box options appear.

Thanks,

ioana

Foertsch · October 11, 2019, 8:00am

Hello, @Ioana_Gligan!
Api Key taken from orchestrator? If I only have Attended license of robot, without orchestrator?

loginerror · October 11, 2019, 8:09am

Hi @Foertsch

To get the API key, please navigate to the Licenses tab of your Cloud Account (not your Orchestrator instance):

Foertsch · October 11, 2019, 9:15am

Hello, @loginerror
Thanks for help!
Can i ask, Taxonomy editor and Keyword Based Classifier support cyrillic?
Let me make it simple: Intelligent OCR activities support cyrillic?
Thanks.

Ioana_Gligan · October 11, 2019, 1:03pm

Hello @Foertsch,

IntelligentOCR is language agnostic. You can define documen ttypes in cyrillic using the txonomy manager, they should be properly displayed in all wizards and in the validation station, keyword based classifier is language and alphabet agnostic… as long as it’s representable in UTF-8.

I have to warn you though that DIgitize Document is optimized for left to right top to bottom writing, and works best for latin languages… We will be optimizing this for other languages / alphabets as well.

Ioana

Pankit · October 15, 2019, 6:52am

I tried using the IntelligentOCR Package… Trained 5 invoices… is there a way to remove Present validation Step after training for a few times? Can I re-use the learning file without using Validation step for new invoices with same format?

@loginerror
@loana_Gligan
@alexcabuz

Foertsch · October 22, 2019, 12:03pm

Hello!
How to create file .json for activity Keyword Based Classifier? Is it created from UiPath itenface, like a taxonomy file from taxonomy editor?
Thanks.

Foertsch · October 22, 2019, 12:19pm

I’m sorry, i found solution for my question:
" The activity does not automatically create a file at the specified location. A best practice is to create an empty .JSON file at that location."

Topic		Replies	Views
Document Understanding: Document Splitting and Other Wonderful Stories :) Document Understanding	65	11484	January 15, 2022
Document Understanding: New Human-Robot Levels Available :) Product News news , document_understanding	51	6657	March 1, 2022
Receipt and Invoice AI - Now available in Public Preview! Document Understanding preview	196	52632	May 5, 2023
Document Processing 20.4 Beta: Human-Robot Interaction using Action Center Product News news	86	13779	November 16, 2021
Any demo video/tutorial available for Extract Semi-Structured Document Activity? Help studio	18	3723	April 20, 2020

How to use the IntelligentOCR Package

Related topics