OCR Get Text-Activity not working unattended with browser

tobido · February 3, 2021, 9:56am

Hi everyone,

I am trying to read the text of an html element via OCR (website is angular based and content is loaded lazy, why alternatives like html inner text are not working here).

Attended the process is working fine, but runs into timeout when started unattended (Get OCR Text ‘DIV’: Timeout reached).

I isolated and reproduced the problem (see image) by starting the process from studio and minize my remote desktop connection. Seems that OCR is not working, when browser is kind of ‘not visible’.

We played with the WaitForReady-Property as well as Profile, Scale and Invert from OCR Engine Activity (Change of OCR engine is not an option due to cloud service policy, but does not seem to be the problem here as Tesseract is working fine in attended mode).

Am I missing something basic here?

Thanks in advance

Pablito · February 4, 2021, 3:47pm

Hi @tobido,
OCR as name says will not work unattended as it requires screen to be captured during automation. So it requires active session to work. For unattended solutions you need to work with selectors and properties which will help to get text from defined element.

tobido · February 5, 2021, 8:39am

Hi Pawel,
thanks for your answer. OCR stands for optical character recognition, meaning scraping characters instead of reading properties. Not sure what you are referring to here. Many OCR components in many applications work perfectly fine without any screen interaction.

New single page applications are often built with customized html elements, where content is set via data attributes (sth. like < div data-content=“{{ value }}”> < /div >), commonly used in connection with INTL, for instance.

Any idea how I can retrieve this value in unattended mode?

Thanks and kind regards

Pablito · February 5, 2021, 3:09pm

Hi @tobido,
Sorry for wrong shortcut explanation. I was thinking good but wrote something completely wrong… hard day
What I meant is of course screen is not needed, but to capture OCR data the OCR engine need to have access to session so it could grab let’s say proper coordinates of what needs to be captured. And this type of action can’t be 100% unattended. So it can run as background automation.

Topic		Replies	Views
OCR method failed to scrape this UI Element Help ocr , activities	8	6155	February 15, 2018
Click OCR Text! Help	3	4254	October 29, 2018
Issue occurred when using the Get OCR Text activity in the screen scraping method for an Unattended Process Activities ocr , activities , question , unattended , screen-scraping , get-ocr-text , screen-scraping-in-unattended , get-ocr-text-in-unattneded , how-to-use-screen-scraping-in-backg , timeout-reached	3	1300	September 17, 2021
Activities OCR in AWS without Orchestrator Robot ocr , activities , unattended	0	854	May 5, 2020
Get OCR Text failure Help	6	3382	February 12, 2019

OCR Get Text-Activity not working unattended with browser

Related topics