I have a table in a web page with let’s say 10 rows with two columns: Col1 and Col2.
How can I extract data from nth Row row to mth Row, for example I want to extract data from 3rd row to next 5 rows i.e. till 7th row using data scrapping in UiPath.
Thank you for your reply, the issue is actually the HTML table is very huge with approx 90000+ rows and it is in the single page, and the scrapping (both modern and classic) fails with error “HRESULT E_FAIL has been returned from a call to a COM component” while using the wizard. So I thought of scrapping the data with first 1000 rows and then again 1000 rows and so on and finally merge into a data table when all the rows are scrapped.
I have also tried using Find Children activity and GetText Activity after that and I am able to get the data. But it takes huge time to loop through 90000+ uiElements from the Find Children activity. So though of any fastest way by data scrapping.
We can limit number of rows to be scrapped in the data scrapping wizard, but can we manipulate metadata to provide any condition to scrape from 1001th row in the second time for next 1000 rows, 2001th row in the third time for next 1000 rows etc.
In your flow after build data table activity which activity did you use, it is appearing as missing activity in my workspace. Do I need to install any package for that? I am using studio 2020.10.2. Please elaborate on that activity.
I have already mentioned that I have tried this and it was successful, but you can imagine how much time it takes to loop through each children when the table has more than 90000+ rows. I need a faster solution to scrape the data as per business demand.
I’m sorry the data is private so can not share but if you share the detailed flow, I can try and let you know if it is working for me. The example you have given, I have similar kind of table with more rows and columns.
I have tried your way but getting an error while deserializing, Can you help me a bit.
Though I can’t give you the URL, but I have created a small table similar to my requirement and the html code is given below (As a new user, it is not allowing me to attach). Please have a look.
Sure will help you. Just one question before
When you open manually the above provided URL (was done in FireFox, feel free to adopt it to Chrome or other browser), was the starter help xaml running or not at your end?
I am running it in Chrome, though the error occured for both Chrome, Firefox and Edge browser. However the issue is resolved, just formatted the HTML string to replace " " to empty string and the deserialization is successful and the data table was created. Thank you for showing me the correct way, it is a great help to me.
No the data table has one blank row in the bigining. Rest of the table is perfect and it has the footer row as well. I have created the columns as same as the table in the Build Table activity.
ok, was just thinking about using the th elements for dynamic constructing the datatable.
But if its working then it is fine.
Just do your final testing and let us know the result by marking the solving post as solution or your further open questions. Maybe you also can us give some feeback on No of rows, processing time of the datatable extraction.Thanks
I have tested this with 14 users I have now who has huge data and it is working fine as of now. So marking it as a solution. Will catch up again if I face any issue in future.
Just need to resolve one thing, the amount column is getting converted into String, is there any fastest way to convert it back to Int32. I need those in Int32 format. I have tried ChangeDataColumn activity but it generates error cannot convert DbNull to integer. Any idea? @ppr
Can you show me actually LINQ From i in… Statement which is in use
And give some details on columnstructures and how the empty target structure is set up