Navigate To All Links & Scraping All Of Them

Gokhan · March 20, 2017, 2:08pm

Hi Everyone,

I’m quite new on Uipath.

So i have a Web page that i have to record all new entry from that web site to excel everyday.
For example today i have 787 new entry and i get all of them with scraping to csv file.

But, i have to click all links of them and again i have to scraping to csv file.
Every page is the same;

Links are;
https://www.examplelink.com/link/?id=506228,
https://www.examplelink.com/link/?id=506229
https://www.examplelink.com/link/?id=506230

only id part is different

Every elements are the same on that links. So if i can do the “Navigate To” all links i think i can scraping all the data inside of them.

I hope i could explain it

Any ideas?

Thanks in advance

Cosin · March 20, 2017, 4:13pm

A For Each Row loop to iterate through the dataTable. And inside the loop, a simple openUrl + scrape

Gokhan · March 21, 2017, 10:47am

Could you please explain more

As i said i am new on Uipath.

Thank you for this!

Cosin · March 21, 2017, 10:53am

I’ll see if anyone can pick this up; if not, ping me again soon

Gokhan · March 27, 2017, 5:30am

Hi again,

Could you please help me

Cosin · March 27, 2017, 11:22am

Ok, so the scraping to CSV part is done and working, right?

Have a look at the dataTables tutorial to see how to loop through the entire table.

(around 10:00 in)

And in the For Each loop, instead of the WriteLine, use an Open Browser with the URL from the CSV file.

vreddy · August 30, 2017, 7:23am

Hi Cosin,

Even i am facing same problem, i used for each loop, but i am able go for first link, but not all the links. could you please help me.

ddpadil · August 30, 2017, 7:26am

Hi,
Could you please upload workflow and excel.

vreddy · August 30, 2017, 11:24am

ddpadil · August 30, 2017, 11:35am

Hi,
It would’ve been better if you could have uploaded workflow.
Anyways by looking into screenshot your using single datatable but i’m not sure how it will identifies the column name/index of url in get row item activity as its not picking from excel.
So could you please do the following.

Drag read range activity after write range and create one more data table and then use For each row activity and pass the column name(url) in Get row item activity and then navigate to activity.

vreddy · August 30, 2017, 12:00pm

Samsung.zip (188.6 KB)

vreddy · August 30, 2017, 12:00pm

Hi,

find the attached workflow

vreddy · August 30, 2017, 12:58pm

Hi,

I tried using read range activity, its not working. could u please help

ddpadil · August 30, 2017, 1:30pm

Hi,
Check now. It’s working
Its saving the report with mobile name and url then navigating to the respective url in same tab.

Here we go.
Main.xaml (24.7 KB)

vreddy · August 31, 2017, 4:53am

Hi Dilip,

Thank you its working.

But when scraping the url for “sponsored links it is scraping url in (/gp/slredirect/picassoRedirect.html/ref=pa_sp_atf_aps_sr_pg1_1?ie=UTF8&adId=A00172302RXUJ87S0Z95A&url=https%3A%2F%2Fwww.amazon.com%2FDashboard-Ultimate-Flexible-APPS2Car-Windshiled%2Fdp%2FB071FLGFML%2Fref%3Dsr_1_1%3Fie%3DUTF8%26qid%3D1504154475%26sr%3D8-1-spons%26keywords%3Dsamsung%2Bmobiles%26psc%3D1&qualifier=1504154475&id=6765837114264527&widgetName=sp_atf
)” in this format, so it is not able to find the ui element. throwing error.

without sponsored it is scrpaing link in this format. this is working(https://www.amazon.com/Samsung-Galaxy-J7-Prime-G610F/dp/B01MUSD2ST/ref=sr_1_3?ie=UTF8&qid=1504154475&sr=8-3&keywords=samsung+mobiles)

ddpadil · August 31, 2017, 5:00am

Yep .
I think its popup url.
looks like sponsors url doesn’t seems to have protocol over which data is sent (https)
First try to add (https://www.amazon.com/“sponsor link” and make some changes accordingly untill that link opens up. If it works then you can hardcode same for all the sponsors links.

vreddy · August 31, 2017, 5:17am

Yes exactly. I don’t want to scrap pop URL’s. So is there any option like i can search for "sponsored " on each link and if “sponsored” is present don’t scrap else scrap it.

ddpadil · August 31, 2017, 5:42am

There is no filter while data scraping .
So one way is to by filtering the DataTable.
Filter the Url column which doesn’t have https in the start line. (search google or forum )
then copy to the another cell and read from that cell and use navigate to for further processing.

vreddy · August 31, 2017, 5:46am

Do selectors help to filter while data scraping.

ddpadil · August 31, 2017, 5:48am

You can filter the table for other purpose. i’m not sure how do you identify the sponsor link unless you scrape it right?

Topic		Replies	Views
How to automate link following! Studio uiautomation	3	859	December 9, 2021
Data table extraction from different web pages with the same structure Studio excel , database , data_scraping	0	1170	April 7, 2020
How to move down one, click link, extract data, and repeat Help	11	3454	February 19, 2019
How to scrape the hyper-links with same icons Help	6	1367	September 4, 2019
Get links(Urls) from excel and open one by one Studio	1	2987	December 18, 2021

Navigate To All Links & Scraping All Of Them

Related topics