PDF file with 2000 pages


#1

Hi Team,

I have some 2000 pages pdf and need to identify few values from the PDF. How actually should I do the automation in this scenario without affecting the bot performance?

Thanks,
Ulaga


#2

It’s hard to say without knowing all the details of your scenario.

I would focus on making sure I can consistently get the values I need consistently first, then work on improving the performance.

The simplest way to keep performance in mind from the beginning is to try and see if you can use the Read PDF Text rather than the Read PDF With OCR.


#3

Can’t say if it helps in your scenario, but refer this post:


#4

Thank you @vvaidya