Hello guyz, newbie here.
I tried to extract data from PDF using OCR however, a single numeric character cannot properly read by OCR. Does anyone have same experience and normally what did you do to overcome this kind of challenging automation task?
So far activities used to read the data in PDF:
1. Native getting of text (Get Visible Text) → partially work on selected PDF
2. Read Text PDF → Cannot read. Returns an empty string of data.
3. Read Text PDF with OCR - some characters are translated into different characters.
> Microsoft OCR - Cannot translate correctly.
> Tesseract OCR - Cannot translate correctly.
4. Native Citrix → experience the same scenario.
Your input is highly appreciate.