So i have a Web page that i have to record all new entry from that web site to excel everyday.
For example today i have 787 new entry and i get all of them with scraping to csv file.
But, i have to click all links of them and again i have to scraping to csv file.
Every page is the same;
Hi,
It would’ve been better if you could have uploaded workflow.
Anyways by looking into screenshot your using single datatable but i’m not sure how it will identifies the column name/index of url in get row item activity as its not picking from excel.
So could you please do the following.
Drag read range activity after write range and create one more data table and then use For each row activity and pass the column name(url) in Get row item activity and then navigate to activity.
But when scraping the url for “sponsored links it is scraping url in (/gp/slredirect/picassoRedirect.html/ref=pa_sp_atf_aps_sr_pg1_1?ie=UTF8&adId=A00172302RXUJ87S0Z95A&url=https%3A%2F%2Fwww.amazon.com%2FDashboard-Ultimate-Flexible-APPS2Car-Windshiled%2Fdp%2FB071FLGFML%2Fref%3Dsr_1_1%3Fie%3DUTF8%26qid%3D1504154475%26sr%3D8-1-spons%26keywords%3Dsamsung%2Bmobiles%26psc%3D1&qualifier=1504154475&id=6765837114264527&widgetName=sp_atf
)” in this format, so it is not able to find the ui element. throwing error.
Yep .
I think its popup url.
looks like sponsors url doesn’t seems to have protocol over which data is sent (https)
First try to add (https://www.amazon.com/“sponsor link” and make some changes accordingly untill that link opens up. If it works then you can hardcode same for all the sponsors links.
Yes exactly. I don’t want to scrap pop URL’s. So is there any option like i can search for "sponsored " on each link and if “sponsored” is present don’t scrap else scrap it.
There is no filter while data scraping .
So one way is to by filtering the DataTable.
Filter the Url column which doesn’t have https in the start line. (search google or forum )
then copy to the another cell and read from that cell and use navigate to for further processing.