Can the UiPath community confirm if I am on the right path for this project. I want to extract a policy number from a different scanned PDFs, the PDF is not tagged. The each PDF has different font and color and word size and the policy number changes location.
I tried relative scraping, but this is imaged based so it works for the first PDF and not the other because the font changes so it can not find the same exact image of the anchor.
I think the only two solutions to this are:
- Create a robot for each type of PDF that all have the same font, color, word size and policy number is always in the same place.
- Extract the text from the PDF with OCR, then have it create a data table or word document or other type, then have it extract the information from the data table using string manipulation.
Is this correct or is their a different method?