PDF data extraction Automation

Hello Team

I need a template/flowchart for PDF automation whereas I can extract the data from PDF and create an excel file. When I am trying to extract data from PDF invoice the selectors are not working properly and entire document is getting selected instead of specific element although the PDF is in text format.

Thank you in advance

Regards
Aditya

Hi.
can you provide de PDF file ?

Thank you for looking in to it. Here is the attached sample PDF.

Regards
Aditya
Invoice 23216883-XYZ-Limited.pdf (43.0 KB)

i will do my best, what data do you want to retrieve ?

I want to extract the below data
Bill To
Invoice number
Invoice date
Description
Total

Regards
Aditya

BlankProcess15.7z (399.5 KB)
For this case i believe you should get the text by ocr and then use regex to get the data.

See the attachment (invoice number and invoice date → done)

Thank you. I am unable to open the attachment. Is there any way I can open the file

Regards
Aditya

use 7-Zip

new upload in .zip in case you can’t download the program.
BlankProcess15.zip (456.0 KB)

1 Like