[Help] Data Scraping doesn't work well

Dam_Son · October 7, 2021, 3:36pm

Here what is my bot do:
Reading 10 URL in excel (ReadRange)-> For Each URL → Open URL Website (Use Application/browser) → Data Scraping → WriteRange → Close Browser.
I set MaxNumberOfResult is 100. Each page has 10 results, so the bot need to scrap 10 pages.
However, there are 2 problems:

In the first URL, bot didn’t scrap page 1. Bot open url, then automatically click Next button without scraping page 1.
After scraping first URL, in the second URL. MaxNumberOfResult doesn’t work, bot scrap data unlimit.
How can I solve those problems?
Thanks for reading.
My version is 2021.10.0-beta.5978

rahulsharma · October 7, 2021, 5:53pm

Have you tried data scraping on thise links yourself?

Make sure selector is dynamic and works with all, shouldn’t have static part in it.

Also if you can show the page ans selector that would help is to help you better.

ManiPrajwal_K · October 8, 2021, 6:11am

Are you getting data in th preview while scraping with DataScraping activity?

manjula_rajendran · October 8, 2021, 6:27am

Hi @Dam_Son ,

Did you check the preview or output data which is write to excel? Confirm scraping element pattern is same.

If you are getting the data by clicking on next button it will go on click all the pages but only 100 datas only saved in your scraped data. check it. If you want to stop immediately after 100 records or some number of pages then handle with initializing and incrementing Count variable.

Thanks,
Manjula

Dam_Son · October 8, 2021, 4:23pm

@rahulsharma @manjula_rajendran @ManiPrajwal_K Thank you guys. I found out there were 2 problems in my process.

I forget to clear data table after each Url.
Like @rahulsharma said, selector wasn’t dynamic for all pages. That’s why bot scrap data nonstop as in fact, bot didn’t get any data. The number of result is 0 so bot just click Next button and I feel that bot scrap data nonstop.

However, it still has some contradictions. If I didn’t clear data table, so after First URL, the number of result in data table is 100 so definitely on the second URL, data scrapping activity doesn’t need to work as it reaches MaxNumberOfResult (100).

Anyway, I’ve solved the problem thank you guys for your help.

rahulsharma · October 8, 2021, 5:15pm

the size of datatable → you can increase the number of rows to be extracted in the ‘Extract Data Table’ activity to desired number or keep 0 for ‘all’ value to be extracted

If you are using any datatable, make sure if you want to have max rows extracted, make the limit value as -1 in Build datatable ans make it 0 in extract data table.

Happy Automating @Dam_Son

system · October 11, 2021, 5:15pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Data scraping is not working as explained Activities uiautomation , activities , question	12	1818	February 2, 2021
Data scrapper problems Studio datatable , excel , selector , uiautomation , robot , activities , studio , data_scraping , question , workflow_analyzer	12	1101	May 30, 2022
Data Scraping multiple page issue Help selector , uiautomation , activities , data_scraping , question	5	960	March 23, 2020
Data Scraping - maxnumberofresults not working Help	12	2825	July 14, 2020
DATA SCRAPING NOT WORKING BECAUSE NEXT PAGE BUTTON NOT VISIBLE Something Else feedback	18	3175	April 20, 2022

Most Active Users - Yesterday
ashokkarale
prashant1603765
sharazkm32
V_Roboto_V
sonaliaggarwal47
Ranveer_S_Thakur
Aki1111
arivu96
chaitanyaKumar
manasrlenka25
More details...

[Help] Data Scraping doesn't work well

Related topics