OCR and image automation

Henry · July 6, 2018, 6:39am

Hi,
I was trying out the example of using OCR and image automation. https://studio.uipath.com/docs/example-of-using-ocr-and-image-automation

I need to scrape the $500 value from the image in the pdf.
I have attached my xaml file. Could someone please point out where I went wrong as I am not getting the required value. I have attached the xaml and pdf along with this. Your help will be much appreciated.
ocr xaml.zip (242.3 KB)

Thanks in advance

amithvs · July 6, 2018, 7:25am

I am also looking for the same. I hope someone helps. Also I am looking for solution for scrap data from image. Which OCR engine gibves best accurate result? also how to install ABBY, tesseract ocr engines? @ddpadil
@ovi . Expecting help guys

evangemert · July 6, 2018, 10:37am

You should use ‘Read PDF with OCR’. This returns the full text of the PDF file. In this case your desired output is on the last line. I used some simple string manipulation to extract it. If the PDF files are always like this, then this will do. If they are unstructured, you’re gonna need different manipulation techniques to do the job.
Main.xaml (8.2 KB)

Henry · July 6, 2018, 11:28am

Thanks for the reply. I tried using “Read PDF with OCR”. But its not accurate, everytime I am getting different values.

Henry · July 6, 2018, 11:39am

If the required data was in between it would have been difficult. So is there any other way we can scrap the required data other than using string manipulation technique. I mean like using “set clipping region” (the one showed in the example).

Thanks in advance

evangemert · July 6, 2018, 11:48am

You could try setting the zoom to a certain percentage to get the $500 inside the screen (instead of using page down) and then set a clipping region, but that’s gonna be pretty unstable I reckon (as the text will always need to be exactly inside your clipping region). I still think that using Read PDF with OCR will get you the most stable results. Did you try both the Microsoft and Google OCR engine? And have you tried different scales? For me the microsoft ocr engine with a scale of 1 already did the trick.

niranjanaa_sha · June 9, 2020, 6:03am

Hi All,

I have a scenario in Image automation in which new service need to be added by removing another service from an application(desktop).Once the Service has been entered, it will expand the parent service and highlight the service which we entered.So bot need identify the highlighted service (will be in orange color) and click the service in the grid.Other services(text) in the grid will be either in grey or white in color.Once the highlighted area has been clicked it will go and click the remove button in below screen. Service name is dynamically placed anywhere in the grid. Its purely image based automation other than keyboard shortcuts can be used.Kindly help me to identify a solution for the issue.i have attached image of the sample workflow. This is the grid looks when a text is highlighted.!

Topic		Replies	Views
Extracting data from images Using OCR Studio uiautomation	12	1202	November 30, 2021
Reading From Image Based PDF Help	2	955	July 22, 2019
How to extract text from image using OCR Help ocr	12	17069	October 29, 2018
Pdf Extract from OCR Text Task Capture	4	1536	August 15, 2020
PDF Value Extract Automation Starter uiautomation , activities , studio , question , activities_panel	5	1055	October 3, 2022

Most Active Users - Yesterday
Anil_G
ashokkarale
jinal.shah
Gautham_Pattabiraman
postwick
chandreshsinh.jadeja
vrdabberu
Ajay_Mishra
sven.wullum1
Vyshnavi_Nalumachu
More details...

OCR and image automation

Related Topics