HI,
I have three pdf files in that I need to extract data below the field called Description or PRODUCT NAME or Description & Specification of Goods from those files using regular expressions.
One pdf file is scanned image in three of the files, I used ABBYY ocr to read the pdf but the output is not efficient since it has some misspelling of words…
Is anyone know how to solve both the problems?
these are all the pdf filesInvoice 5.pdf (51.2 KB)
Invoice 8.pdf (159.6 KB)
Invoice 7.pdf (47.4 KB)