How to OCR specified field in PDF files while looping trough

pdf
ocr
scraping

#1

Hi, I got stuck :/, I know how to get list of files in directory, loop and open the files while looping, but how to OCR specified field (part of pdf file) in each PDF (they are scanned docs), something like “Screen scraping -> Get OCR text”. Preferably without opening the files, maybe using coordinates ? thanks M.


#2

Hi,
If your PDF are in the same format then you could make use of Read PDF with OCR and then make use of string manipulation like indexing and substring to get specified text.
Please refer this.