Dear all,
Good day,
I have a multiple pdf invoices(different Vendor) placed in the folder. example: Here, BOT must read the invoice number, Date , amount , address(BillTo) from the pdf’s and enter it into the Excel.
attached different invoices screenshot FYI
Please help me out
Please any one guide me how to design the flow or please share the flow
Kindly assist.
Thank you
wordpress.pdf (42.6 KB)
Invoice 1.pdf (28.8 KB)
invoice_200136.pdf (15.2 KB)
646412-Mellicent-Ivoshin.pdf (46.6 KB)
aliaga
(Ali Aga Mustofa)
March 24, 2023, 4:08am
2
Hi @Rohith_Shastri ,
Could you please inform us what method will you use for this case?
Extract text from PDF, OCR, etc.
Regards,
Text from Pdf like Invoice number, date, address, total amount only these fields
Extract Text From Pdf
Extracted Data like Invoice number, Date, Address, Total Amount after extract store it into the excel
aliaga
(Ali Aga Mustofa)
March 24, 2023, 4:20am
5
Hi @Rohith_Shastri ,
For this case you can use this step :
Foreach all pdf file
Use activity Text from PDF
Get first line of the output no 2
Using value no 3 you can use switch activity to determine whic template invoice will be process
Based on no 4, create several case for all template.
In specific case you can Regular Expression to get the value from specific field.
Please take a look from this post :
Hi,
Can you try the following? This removes lines which starts with “https://” or “(number) of (number) (date)”
System.Text.RegularExpressions.Regex.Replace(yourString,"(?<=^|\n)(https?://|\d+\s+of\s+\d+\s+\d+/\d+/\d+).*\n","")
Regards,
Regards,
Could you please send the complete flow by using above pdf