Reading a scanned file PDF (instead of reading PDF)

Hi - my current workflow is designed to read PDF but the problem is, the PDF is scanned so it is not working. What should I include instead of the “Read PDF function” as shown in the image below:

image

A first shift could be using the Read PDF With OCR Activity

But keep in mind: Depending on the quality and some other factors the OCR extraction result can be of high or low quality

Thank Peter, how do I exactly add the OCR activity in the workflow?

@amenoufy

use -> "read pdf with ocr " activity and ocr engine i.e microssoft ocr or tesseract ocr

Drag the activity to your workflow:
grafik

Filter for the OCR engines
grafik

Drag the one of your choice and make some RnD on the settings

Also have a look on some relevant courses from the UiPath Academy

This is so helpful. Do I have to change anything here?

we do some sample runs and we play with the parameters for getting the best results. Just explore what best setting will work for your case. Sure, the Language setting is highly recommended

Do I need to download something in UiPath for the Microsoft OCR? It does not work
image

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.