When data scraping, is it possible get data on a partial html/css match?

Gleedo · November 28, 2019, 1:31pm

Hi all,

When scraping data from the following link, It successfully scrapes the first 5 matches on product name and then skips a number of products.

Under inspection, it seems this is because the class name is different from where the intial match was created. So it is looking for a class name of ‘s-item__title s-item__title–has-tags’ and then the ones that get missed have a class name of ‘s-item__title’.

So my question is, is there a way to edit the data definition so that ‘s-item__title*’ are picked up (if that makes sense)?

Or if that is not possible, can you offer some other possible solution for this problem?

Thanks

Gleedo · November 28, 2019, 4:00pm

So far the only way I have found to get both sets of data from different html/css tags is to do two separate scrapes and then merge the data tables.

Even then, some of the rows from each set are in the wrong order in excel with NO sort order applied.

tera · November 29, 2019, 8:23am

hi,
This metadata may solve it.

metadata.txt (521 Bytes)

Thanks,
-Tera

Gleedo · November 30, 2019, 6:15pm

Thank you @tera, that did indeed work and everything is in the correct order.

Would you happen to know if the xml schema for the scraping is documented anywhere? I would really like to know more about this as I’m sure manual edits are quite a common thing to do.

Anyways, thanks for your response

tera · December 2, 2019, 1:39am

I’m glad to help you

However, no documents published by UiPath were found.
The information I have is probably a fragment and may not be accurate enough to be taught to others.
I am sorry that I cannot help you.

The best way to get accurate information about this is to contact the support team.

I hope you get the information you need.

system · December 5, 2019, 1:44am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Scrape specific classes based on text Help activities , data_scraping , question	7	1168	December 26, 2019
Lesson 5 - Practise 1: Web Data Scraping - data rows without names Help	4	1754	September 7, 2018
Data scrapping not working well Help studio	14	2472	May 28, 2019
Datascraping: extract correlated data not showing up Off-Topic Discussions	36	5382	April 2, 2020
Web scraping on unique web searches Help csv , data_scraping	9	5976	July 30, 2018

When data scraping, is it possible get data on a partial html/css match?

Related topics