Retry Scope - Digitize: Word contains invalid character: any waterborne

I am facing an issue to digitize a pdf document.

The file name is “BL_97859 1.pdf”, which not contains special characters.
The error is happening just with this specific pdf. Others documents are being read properly.

The Digitize of this specific document is returning this error:
Retry Scope - Digitize: Word contains invalid character: any waterborne

I think the document might have invalid characters inside its content, but I have never heard about errors in Digitization because of that.

Thanks in advance.

@Samuel_Simao

May be there are some watermarks…try using the exclude keywors or include keywords properties to control it to extract what you need might help

Also try opening the internal error/error details from locals pane and check it might have more info

Cheers

Hi @Samuel_Simao

To address this issue, you may want to try using a different OCR engine or digitization tool to see if it can successfully process the PDF file. Alternatively, you can try converting the PDF file to a different format, such as a Word document or a plain text file, and then using a different tool to digitize the content.

Hi, @Anil_G and @Nitya1,

Thank you guys for your help.

So this is an exception caused by the OCR engine that I am using.
I am currently using UiPath OCR.

I tested with Tesseract OCR, althoug it took longer to digitize, it read the document properly.

Thnak you all again.

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.