I need to extract data from the scan copies of Health care form HCFA 1500. The images are not consistent with the quality and shape and size. But these are HCFA1500 forms. I need to scrape the data on the screen itself, I cant download nor I can have the soft copy of the images. Please advise
Please refer this post. @Bharat_Kumar
here they are extracting the scanned data from pdf.
same as you do your process
The problem is that the OCR data is not consistent to do string manipulation. for example in the below image I need to extract field 21, all the data with in field 24, 25, 26 etc.