As you mentioned structure of the table is always the same you can split extracted data based on commas or spaces that work perfectly for you from the output of Read PDF with OCR.
using the write range activity to write the data into an Excel file.
I don’t want to read it with ocr, as it is a structured document. Is there no possibility, coming from an excel file, to read an xml with specific tags?
After reading an XML with specific tags you have to manipulate it using string manipulation anyhow and then you can format data in tabular format.
so rather than converting pdf to Excel you can read with OCR and manipulate data in tabular format.
yes but imagine that the excel table always has the same format and I always have to read the same column. Basically I didn’t say I can’t use ocr. The best approach remains xml and tags?
yes but initially if you have a PDF then you require OCR to Convert to other formats.
Otherwise, if excel is already present then XML and tags work best.