How to extract data from multiple pdf

how to extract same field value in multiple pdfs?For example: invoice number :101234 in first.pdf, invoice number: 1012345 in second.pdf and so on…

Question: In for each i m reading each pdf file. then i want to extract each field and value present in the pdf into a excel sheet. Please help me

Hi Susanna

My understanding is that you need to do following:

1- Put all files in a folder.
2- Apply a loop
3- Open each file
4- Read required field using OCR/screen scrapping technique.
5- Assign that invoice number to an array.
6- Write that data in to an excel file.

Hope this makes sense.


1 Like

There should be an option to scrape from multiple pages, Then you just specify what information you want to scrape from each page.

Hi I’m relatively new to UiPath. I’m stuck on 3, how do you sequentially open pdf files in an application. Can you for example specify to open adobe acrobat each time with another file and then do screen scraping?

Hello Diabolique619,

I am attempting to do this and was able to achieve steps 1-3. Which screen scrapping/OCR technique would I use in order to apply it to all PDFs within the file?

hi Heathersmithx have you found this solution that mentioned by Diabolique619 ?

@umair_hanif - here is a possible solution: How to use the IntelligentOCR Package