Read PDF with OCR - Two same element

pdf
studio

#1

Hi guys, I’m working on an automation to read the PDF if it contains the word “DTP”. However, in the PDF we have the word “DTP A/C” also. How can i instruct the robot to ignore the word “DTP A/C” and only take another word which is “DTP” only.

Any ideas?


#2

Hello @Serran_Neru,

How about changing the condition. If it has “DTP A/C” don’t read it. But maybe i didn’t understand.
Another way would be to use regex, and compare the matches.

Since you are using OCR, you are getting also the text position


maybe you can use this in your favor somehow

Hope it helps