Different format PDF

Hello.

Is it possible to get an invoice number and amount and other details from different fromat pdf. ?

I need to put invoice data in an excel sheet but each pdf has different format

thanks

Hi @khaled_emad ,

The suggestions that we could provide you would depend on your case details :

  1. If Documents are Digital, and have a set number of Templates. We could adopt String/Regex methods to check if the Extraction works properly.
  2. If Documents may/may not be Digital, Meaning there are chances of scanned documents or images as well or If there are N number of document formats that you may receive. You would have to opt for Document Understanding Capabilities.

Let us know what criteria fits your use case. Also check upon the posts related to the Document Understanding, courses on the Academy if you find that DU is the best option for your case.

yeah its case 2. Scanned invoices

You can use UiPath Document understanding to get data from different pdf’s. You can find Document Understanding courses on UiPath Academy.

Or if the document have any pattern/keywords before the data to extract, then you can try using Read PDF with OCR activity to read the pdf text, and use Regex or String Manipulations to extract required data points from the text.

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.