Error during screen scraping & OCR

ocr
scraping
studio

#1

Hello everyone!

I am trying to do recognize text from an invoice-like image (.png) using data scraping recording
ScreenScraper wisard comes up. Native or Full text do not return any result. I select OCR method and refresh, then error-message comes up. It says "app launch is not possible due ot missing api-ms-win-crt-l1-1-0.dll Try reinstalling the app."
Any thoughts would be much appreciated! Thanks!


#2

Hi, @koltsev1, this file is lots of places in my system.
See below:
no_api_file

and some places not OS-specific:
api_other-places

If the file exists on your system but you still get the error, there’s still hope.
It looks like you’re running Russian Windows, true? That’s probably not the reason, I’m thinking.
Check out this link and this link re Universal CRT (C Runtime).
I found this C Runtime update link in the first link above, maybe that will help.

Regards,
burque505


#3

Thanks. Widnows can’t find this file. I’try checking links that you provided @burque505


#4

@koltsev1, you bet! Hope it works. Let us know.
Regards,
burque505


#5

Hi!
My problem was resolved by downloading api-ms-win-crt-l1-1-0.dll manually. It sounds like your problem is pretty much the same thing. Just make sure that you choose correct version of dll according to you system.


#6

@Brainytapf @burque505 Hi all

I ran windows update which burque505 proposed, and Google OCR activity started working. Thanks a lot!
However, kind of fun, default language is “eng” while I’m trying to recognize russian words.
Changing lang to “rus” and running sequence throws in an exception. Trying to solve this now.

Update.
Ok, so I installed russian language from tesseract github. Now all works!!!
This link is to language installation guidance https://activities.uipath.com/docs/google-ocr


#8

@koltsev1, glad you got it going and thanks for the link. I’m going to try to install Russian now.

I also got it working by just removing “eng” from the Language property of the Google OCR. Although you don’t need it now, just for grins here’s the workflow.

Russian.zip (73.8 KB)

Edit: (Just a few tips on installing the tesseract data for Google OCR)

Here’s my workflow.
RussianOCR.zip (113.8 KB)

The ‘tessdata’ files I need to add languages are here. I scrolled down and went to the link for rus.traineddata.

Here’s where it has to go, for me (change the path according to your UiPath Studio version - this is for CE, by the way):

tessdata

Once that file is in your ‘tessdata’ folder, restart UiPath Studio (it won’t update otherwise).

For my guinea pig I chose Pravda. Using Citrix recording, I first used a ‘Click OCR Text’ to make sure I had Russian available from Google OCR. Here’s what I picked:

obshyestvo

I then replaced that activity with a ‘Get OCR Text’ activity, indicating the element in the browser. Then I changed the OCR language in the Properties panel:

ocrprops

I added a variable to capture the text, and displayed it in a message box:

mbox

Edit: Works with Chinese also (I added chi_sim.traineddata and chi_tra.traineddata).

zh_mbox

Workflow: ChineseOCR.zip (196.8 KB)

Regards,
burque505


#9

I tried to scrap the data in full screen mode( Photo viewer ) using Get text activity with Abby OCR engine it throws a exception like selector not found.

Thanks,
Tharma KS


#11

Do you have a bag in Chinese? About the OCR,can you give 457867530@qq.com please!