I’m struggling with using Read PDF with OCR technology.
Company I work for takes PDF forms which are filled in (by handwriting, inside boxes to be filled) and these are scanned and received to us by e-mail. I need to be able to read these PDF files and the handwriting as best as possible.
How can I do this?
I have tried Read PDF with OCR (Tesseract OCR engine) but the result is looking like:
’ l
. as - I . .
- i . . ’ I
- ’“fim flswiaggvereesnt 7 - . Direct Order Form ‘
‘ “1.9 ~ Detpanfiemfvetegfisfifiai‘m‘ . RAP Mobility & Functional Support Products I
77477 77.7.7777717777777777: 7777777777, 7777777777777 777, 777 *7 7(7’7 ’7 77?, 77 1 .7777 ’ “i” 4:7” 77 |
Provider Hotline Number:
fioflderygails777 7777 777 7’77 77 7 .7. ’ 7: 777-7 74-
7 _ 7.0T [Mi 1:] PT 1:] [MO Domer (Specify Profession) ‘7 777 7.7 7’77 7 77 7 7 7‘
1“ masses,eesgrrebrr‘ - Name LEMLLXEYKE§77 77 7 7 7 . J
{ Providernumber 477777777» 77 7 g .
’ r - (RegideredNurseuse fisoeozw 7 7 J .
.1 - 7 .AHPRAnmnber) 77777177777" 7? 7 i 7 777’- .7 7 77 ——
g . ’ _ . ’ 'Employer
ETC.
…with lots of 77777’s and none of the actual handwritten components of the form seem to be listed anywhere in the above messy looking string. The PDF file is visibly quite clear to read so I don’t understand why it is coming out so jumbled up.