READ PDF TEXT SHOWING GIBBERISH RESULT

@hacky i shared u

1 Like

Hi @hacky

The extraction doesn’t work because the pdf file is protected where page extraction is not allowed on this document.
Becasue of these restrictions bot is failing to extract data from PDF. (Protected is ON).

As an alternate you have to use OCR.

image

Regards,
Karthik Byggari

4 Likes

Hi
We can use READ PDF WITH OCR which would be fine to go for scanned documents or a image documents
Based on different scale we can improve the accuracy of the data
But once if the scale is set to a pdf and if data is obtained accurately then it would repeat the same kind of extraction with same scale always making it reliable

But you might face some changes in the data extraction only if there is any change in package version
I faced it once
But apart from that ocr data extraction for pdf is more reliable

Or

We can try with ABBY FLEXI CAPTURE if we hold a license for it

Cheers @hacky

3 Likes

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.