Hello, I am a UiPath newbie and I have a question.How can I use OCR to automatically read the data in the PDF (e.g. name, date, student number, and then transfer the data to the specified excel?
Because the location of PDF image may be different, so I hope I can read the content after the “keyword” instead of the location.
Finally, since I am a newbie and not familiar with the system, can you please list the steps or system pictures for my reference? Thanks
You can use “Get OCR Text” activity. And inside, you can use Tesseract OCR Engine.
This way, Tesseract OCR Engine has the output properties called “Result”, and “Text”.
It will output the full text in your PDF.
After that, you can use regex to extract the specific string after your “keyword”.
The key point here is to search within the properties of the Tesseract OCR activity, not within the properties of the Get OCR Text activity. Because the focus is usually on the properties of Get OCR Text, the full text output provided by Tesseract OCR can sometimes be overlooked.