The read PDF activity returns different result for different version PDF files

I have a flow to read PDF quality certificates to upload to an ERP system. For a number of files this works fine, and for the rest of the files, the UiPath flow is hanging on a Matches activity.

The good and the bad files follow the same flow: 1. Read PDF text, followed by 2. a “remove special characters” sequence to remove chr(16) to chr(31)

The “good” PDF’s files have property producer = vdd2pdf 165.9 , the bad PDF files have property producer = vdd2pdf 243.8.2

Does anyone have experienced similar difficulties with the read PDF text activity ?

@dries.verstraete

Welcome to the community

As per discription looks like the issue might be with the data that is ready can you check from locals panel what is the difference in data…

As of its pdf issue alone then it should hang on read pdf text

Cheers

Hello,
Thanks for your reply. the txt output from the Read PDF TExt activity sure is different for a “good” file compared to a “bad” file. Please find a “Good” and a “bad” example attached, together with the output from the read PDF text I captured from the locals panel.

Example “Good”
Attached file_ 24060749_Nederlands.pdf (52,8 KB)
Arcelor_After read PDF text_Good_Ndl_01.txt (2,9 KB)

Example “Bad”:
24060139.pdf (57,7 KB)
Arcelor_After read PDF text Bad.txt (4,3 KB)

Removal with a separate activity of the “|” characters froim the “bad” file did not solve the issue. So there must be others control characters in the “bad” PDF.