I’m working on a project where I have to use data scarping activity from website using google chrome, and then store the result in an Excel file. The process is straightforward, and when I run the workflow, sometime it extract the data as expected and return the result in the Excel. and when I re-run the workflow again it’s not extracting any data and returns empty result without any changes in the workflow.
Note: No matter how many I re-run the project, it can be works fine in any attempts.
This can be caused by non-reliable selector of the targeted table.
Sometimes it is working, sometimes it does not. If you are scraping a public website, this can be even a built-in protection against scraping.
Search for “Selector” in properties of Extract Structured Data. Open that selector in UiExplorer, try to refresh the page and see if it is still pointing at the correct target.
you can play around with this property in case we have the feeling that the table gets scraped sooner than it loads. this may help:
You can share your workflow and/or the selector of that activity with us, that will help.
Can you please confirm data is always there in the table to scrap?
Also please provide enough delay before and after the Data scrapping. Also, if you are extracting from multiple pages, please provide a delay between pages a higher values if there is any loading issue.
I think here delay is causing the issue and it can be resolved by providng proper delay.
Yes, the data is always there & I did add enough delay before and after the scraping but the issue still the same.
Let’s say out of 10 run, 3 or 4 stopped after scraping with empty data table as a result. I can’t highlight where is the problem exactly !
I want to ensure whenever I run the project, it should work as expected from the first try, since it is a PoC demo, I don’t want the try to stop in front of them.