PDF Invoice extraction - Multiline data extraction to excel

Hello everyone,

I have spent countless hours looking on the forum and on youtube for a solution to this problem. Any help would be greatly appreciated.

Scope: I received hundreds of PDF’s containing 1-3 line items in a Purchase order

Challenge: I need to extract the following data for each purchase order
supplier
PO NO
Delivery date
and 1-3 line items based on customer purchases

Below is an example of a PO I found online along with the desired output in excel. I have tried form extraction, machine learning and regex. I have not been successful.

Any support would be greatly appreciated. I would really appreciate a sample workflow if possible

Hi @MF.RPA

You can Assign Activity

System.Text.RegularExpressions.Regex.Matches(Input_String,“Regex_Expression_Link”).Tostring

This Regex for a Date

This Regex for PO No

For Line Item You can try with Data Scraping

Regards
Gokul