OCR Engines performance

Velthurai · April 29, 2020, 4:18pm

Hi,

As of now, the following are the OCR engines available in UI Path.

1.Microsoft
2.Omnipage
3.Tesseract
4.Abby
5.Abby Cloud
6.Microsoft Project Oxford Online
7.Microsoft Azure Computer Vision
8.Google Cloud Vision

The question is which engine is best suitable to read Scanned (image based) PDFs as of now?

I have found this article and tested few samples of scanned documents with all above mentioned OCRs except Abby.

Can you en-light based on your past work on OCRs ,which will help me in choosing the right engine (Free on-premises or Paid on-premises or Cloud)?

Ioana_Gligan · April 30, 2020, 11:04am

Hello @Velthurai,

There is one more, the OmniPage OCR activity found in the UiPath.OmniPage.Activities package.

Unfortunately I cannot provide a comparison between the engines, as each of them has their advantages and disadvantages, and the best approach for choosing one is to actually gather a relevant sample set for your specific use case, and test them all out. This is the safest and sanest approach to go with, as in some cases one engine might perform better than others.

Ioana

Topic		Replies	Views
OCR Engines Academy Feedback activities	4	1174	September 4, 2020
OCR comparison Help	7	14628	July 18, 2019
Different type of ocr Academy Feedback activities	2	793	September 15, 2020
Query on OCR engines Activities pdf	1	838	January 16, 2022
Which OCR engines are free? Studio studio , question , activities_panel	4	4858	February 14, 2024

OCR Engines performance

Related topics