I am trying to extract specific data from pdf (which contains 18pages) to excel, I need to extract all data from pdf.
Could any help me with how to extract data by using RegEx or any other solution?
@Sri_Harsha As it’s a scanned copy the quality of the image is very low, so it would be difficult to extract all the information via the UiPath default OCR engine or Computer Vision AI Module. The default OCR engine does provide the result but it’s not accurate. You have to use some kind of third-party OCR engine like ABBY to do your job.
Hi Harsh,
I tried using Microsft OCR and Tesseract OCR,Not giving expected scanned result,
Please try with Abby,If all data is captured then we can process it using ReGex,