Extract Content from various news using Screen Scraping Function

annalyy · July 17, 2018, 8:57am

Hello all,

I am trying to extract content from various news from different sources with the screen scraping function, and check if the extracted text contains certain words such as “UI Path”.
However, I encountered the following problems:

Get text function will load before the website is loaded completely. As the internet icon varies depending on the news website, I tried to use the image vanish function on the following icon that appears in the tab when the website is loading, but it doesn’t work as the icon is changing while loading.
As the website varies from time to time, my selector is unstable and the get text function is not correct most of the time.
e.g.
https://www.scmp.com/news/world/united-states-canada/article/2155617/ex-employee-zhang-xiaolang-denies-stealing-apples
https://www.cnbc.com/2018/07/12/stormy-daniels-arrested-in-columbus-ohio-while-performing-avenatti.html?recirc=taboolainternal

The selector I am using for the above two news link would be as follows:

Attach Browser selector:

Get full text selector:

Hope you guys can provide some ideas for me to proceed this foward.
Thanks!!

loginerror · July 17, 2018, 10:23am

Hi @annalyy

Website are always tricky to automate. Personally, I had most luck with a simple Delay activity that delays the process for a few seconds while waiting for the website to load.

You could also try the solution of @bogdanripa from this post, but you will need to wait for him to update his selector

Turns out the info is available in the documentation of the Get Attribute activity:

Therefore you can try a Get Attribute activity with a selector "<webctrl>" and readystate as the attribute name

annalyy · July 18, 2018, 8:22am

Thanks!!

Topic		Replies	Views
How can i extract the image after hovering on some element Help	12	2893	July 3, 2019
Scrape text from web page Studio uiautomation , activities , question	4	847	February 20, 2020
WebScraping in Modern Version Activities uiautomation , question , uiautomationx , uiautomationnext , ui , questions , uia , question-uiautomation , q	3	43	December 2, 2024
Web Automation question Help activities , data_scraping , web , question	1	884	January 25, 2020
Dynamic selector for Get Text activity Help uiautomation , activities , question	7	3295	November 27, 2020

Extract Content from various news using Screen Scraping Function

Related topics