Scanned pdf (OCR)

Hi all,

I have some scanned pdfs that I can read correctly with OCR Omnipage/UiPath, so no need for the UiPath DU, also considering that it is a single template.

now, I have to check whether the header (which is basically a logo) is present and this of course via get pdf with ocr I cannot read.

Do I necessarily need a Document Understanding licence to be able to do this type of verification? Or are there other ways?

Thanks in advance

Hi @andreus91

I believe you could use the UiAutomation to detect an image. For that, you would need to loop over all documents, open them one by one in a PDF Viewer of your choice, and use the Find Image Matches or the Check App State activity:

Maybe even opening those PDFs in Chrome might help a bit here.

1 Like

HI @andreus91

Do you tried with read PDF text and used regex and see?

If you have a sample pdf can you send it here?

Regards
Sudharsan