Unable to get data from multiple pages in single pdf- Abbyy OCR

Pkotla · November 5, 2019, 2:33pm

Hello all,

I am trying to scrape information from 4 page pdf document to txt file. I have decided to try Abbyy cloud OCR(Trial Version) with Uipath 2018.4. Every time I to run the bot, I see that only one page is getting scrapped though the Range is “All” which is the default. I also tried to change the range to “1-4” it still doesn’t work as expected. Can someone help me with this?

Palaniyappan · November 5, 2019, 2:40pm

Hi
welcome to UiPath community
but this would actually work on mentioning “All” in the range
did we try using normal READ PDF OCR activity
Cheers @Pkotla

Pkotla · November 5, 2019, 3:10pm

Yes I did! It’s a pretty straight forward bot. Don’t know why it is acting so weird.

I could see a similar issue in the below post as well.

Palaniyappan · November 5, 2019, 3:22pm

Fine
kindly try once with other ocr like google or microsoft and write that string output in a text file

Cheers @Pkotla

Pkotla · November 5, 2019, 3:47pm

I tried using google and Microsoft OCR . The result doesn’t look similar to the Abbyy OCR.

Here are few of the observations I have noticed while working on google and Microsoft OCR

Both the OCRs couldn’t scrape all the pages.
Accuracy and Alignment of text was no where closer to the Abbyy.
It took much longer time than usual.

Topic		Replies	Views
OCR screen scraping of a multiple pages PDF Academy Feedback	5	3422	July 15, 2021
Unable to get data from multiple pages in single pdf with abbyy activity process document Help activities , abbyy , question	1	1833	December 20, 2020
Read PDF text Issue Help activities	3	1138	May 25, 2018
Extract a particular Page data from multi page PDF document Help studio	6	3261	April 11, 2019
Read PDFwith OCR is not extracting all the pages using Tesseract OCR Engine Help activities	5	2157	June 29, 2019

Unable to get data from multiple pages in single pdf- Abbyy OCR

Related topics