Get text form jpg file

ykuzin · August 26, 2019, 3:38pm

Hi,
I am after extracting text from jpg file. Not sure if standard screen scraping methods should help as the process I am automating is repetetive - there should be dozens of images and script should iterate over those images in a loop. Data Scraping mechanism allows to extract text once? or am i wrong?
As for now I have to export jpg to pdf and i really want to avoid that, either way OCR with read pdf works really well.

Palaniyappan · August 26, 2019, 3:51pm

yah thats right and that is what it is actually intended to
–well we can convert that read text from image to a pdf by writing it to a word document with word application scope using append text activity
–then we can use EXPORT TO PDF to convert that as pdf
–to use word application scope download the packge UiPath.word.activities from manage packages and once installed we can use the above mentioned activities to get this done

hope this would help you
Cheers @ykuzin

aksh1yadav · August 26, 2019, 5:03pm

Hey @ykuzin

The easiest solution is to read all images from its respective directory and and then use Load Image Activity which will return you an Image Variable output then use this returned Image Variable output with, based on best results returned by Available free OCR Engines (Paid also you can use if you wanna opt for it ) to process your image and Perform String manipulation on returned Text by OCR Engines.

but you have to keep in mind that this will also be the slowest solution if image size is bug and text existence also because OCR time is proportional with image size.

Regards…!!
Aksh

ykuzin · August 27, 2019, 7:47am

Hi @Palaniyappan @aksh1yadav,
Thanks for all the datails. So i tried to OCR pdf files it returned me error “Read PDF with OCR: Error performing OCR: Unable to initialize Microsoft engine MicrosoftErrorCreateEngine” . Surfing on forum led me to the topic (Microsoft OCR in Microsoft office 2016 - #3 by rachelfonseca) saying this error can be trouble shoot by locating Optical Character Reconginiton somewhere in settings which I happen to not find running on Windows 7.
Any ideas i can check required version against pre installed one?

aksh1yadav · August 27, 2019, 8:26am

Hey @ykuzin

Which Office Version You are using? May i know?

Regards…!!
Aksh

ykuzin · August 27, 2019, 10:19am

It’s Office 2016, pal .

aksh1yadav · August 27, 2019, 11:20am

Hey @ykuzin

Follow below steps to turn on the OCR feature in Microsoft Office.

Go to the ‘Add or Uninstall Program’ in Microsoft Windows, select ‘Microsoft Office’ from the list.
Click on the ‘Change’ button

image931×106 10.9 KB
Check the ‘Add or Remove Features’ button
Click on ‘Continue’

image624×563 11.4 KB
In the following dialog box, select ‘Office Tool’ > ‘Microsoft Office Document Imaging’ > ‘ Scanning, OCR and Indexing Service Filter’ and under the drop down list choose ‘run from computer’.

image665×561 18.5 KB
Proceed with the installation by clicking the ‘Continue’ button

Wait for the installation and when the installation is successful

Regards…!!
Aksh

PeteGrabec · August 29, 2019, 11:24am

I’ve just tested it, and it worked on the first hit. I used Microsoft OCR Engine, an Excel Application scope, and wrote the String result into an Excel cell, simple and easy.

This will help me automate a mundane task at work: I need to extract a serial number from the product packages. I usually take a photo of the package on site, and deal with the rest later on in my office. Now this little sequence completely changes the way how we will be handling the product serial numbers from now on. What an elegant way of extracting text from a .jpg file.

That’s absolutely awesome @ykuzin, @aksh1yadav Thank You!

kuzinyd · August 29, 2019, 12:51pm

@PeteGrabec Genuinely glad to share your delight!

VGtha · September 18, 2019, 5:17pm

Hi,

I am having the same issue, i am using Office 365 pro plus, could you please help me how to enable OCR in Office 365 ? for change option i am getting only two options those are quick repair and online repair.

Thanks a lot.

Aakanksha_Pathak · March 13, 2020, 11:37am

i tried scraping data from a jpg file but it is not retrieving the correct data Capture

Topic		Replies	Views
Image Text Extraction ( UiPath Version 2022) Studio studio , question , activities_panel	2	511	January 27, 2023
How could I loop though JPG file using microsft OCR Activities ocr , activities	1	709	January 22, 2021
Read image from OCR Help	8	4369	August 22, 2019
Extracting data from images Using OCR Studio uiautomation	12	1879	November 30, 2021
Load image and microsoft OCR Help studio	1	1267	October 28, 2019

Get text form jpg file

Related topics