Reading scanned pdf data from tax statement


#1

Hello

I am trying to create a POC for which I need to read PDF data which can be normal pdf or scanned image.
In cases when this is scanned image, pdf quality is not great and sometimes crooked.

What is the recommended approach towards extracting this data ?
The pdf in question is tax statement so is a structured format, however I am having trouble reading it with google OCR and always getting error on Microsoft OCR.

Regards
Prashant