How can I get specific data from a PDF (different templates) and input them in a just one excel file, without using OCR engineer?
The scenario is, I have different templates of PDFs (similar to an invoice), and I have to get some specific information such as date, invoice number, item number (each item has one), client, and so on… and input all this information in just one excel file.
The items don’t have title, in any template, and can be different quantity of lines between each PDF.
The excel file needs to be created with all the items of the invoice, one line for each item, and merged with all the data of different templates (different clients).
Example, of the Excel:
Column1: Item number
Column2: Quantity of the item
Column3: Client name (each client has a different template of the PDF)
Column4: Version of the Item
Column5: Invoice Number
Column6: Invoice Date
Item1 10 CLIENT1 1.0 14901 05/03/2018 Address1
Item2 3 CLIENT1 1.1 14901 05/03/2018 Address1
Item1 5 CLIENT2 1.1 489760 07/03/2018 Address2
Item1 1 CLIENT3 1.0 11133 08/03/2018 Address3