How to work on a file which consist both a pdf and a scanned image in the same folder

Hi All,

I have a folder where a pdf with tagged properties and with untagged properties are present. Could anyone help me to find a solution and which technique I need to follow to run both? I have used OCR but I am not getting the data in the proper format. Please refer the below images of a scanned file and the results that I got using OCR. Also please help me to find the specific element from whole text like this.

Many thanks in advance.


Try different OCR engines with different scaling factors to get better results.