I am unable to read and extract data from pdf file


#1

Sample_Credit_Card statement.pdf (838.2 KB)

Hi Vikas,

I have been trying to extract data from the attached pdf file. I have been on this for almost 24 hours now with no success.

I will like you to guide me through the process.

Thank you


#2

#3

Hi @realone,
This is an Scanned document and hence you need to use OCR engines to extract data out of it. You can use either Microsoft (MODI) or Google Tesseract OCR engines.

I have used the Screen Scrapping Wizard and it give 100% Results.

After extraction you need to perfom string manipulation if you want to extract the data into Tables.

Try and let me know how did it work for you?

Regards,
V