Check if document is a scanned pdf or not

Hi All,

I am working with PDF documents, 90% of the time they are machine-generated documents so I do not need the OCR to extract text from it, and sometimes they are scanned documents and I have set up the intelligent OCR to extract required text and that works fine.

Now I want my robot to first check if the document is scanned and then direct it to the intelligent OCR workflow for data extraction otherwise just use the normal read PDF workflow. How can I achieve this?

@SenzoD If the Read PDF Text Activity returns empty for the Scanned Documents, you can Start the Intelligent OCR workflow. Does it return Empty Value?

1 Like

@supermanPunch, thanks man, that sounds like a good idea, i have not checked if it returns an empty string but i will test it and let you know.

1 Like