Data Scraping of Images

Hi,
I want to get the details of an ecommerce website including images of all the products of all the categories. But image is not being scraped using data scraping with .png or .jpg format.
It is being scraped in .html format. So, what should be the optimized way for this process?

Thanks in advance :slight_smile:

This is a good question and I’m having a similar problem.

I am scrapping a webpage where I can scrape about 95% of the data needed. However, the one item the scraper is struggling with is an image. The information the image has is valuable and the data is in it (I confirmed it because viewed the source of the webpage and the actual number that I need is present).

Is there anyway I can scape all this information together in one document? As opposed of doing it in separate documents and then merging them?

Looking forward to your response.

Okay, I think I’m getting closer. I can see the source with Edit Data Definition. However, after I click okay, the data definition is not changing. How can I edit the change permanently? Here’s where I’m at.

Thanks!

edit data definition

Did you try modifying at activity level after the Datascrape translates into an activity?

1 Like

Hi @vvaidya, thank you, this is great.

I’m still having problems extracting the stars in a review.
I was looking at the Advanced UI Automation, Lesson 5, video and it said (and showed) the following.

213
00:15:40,890 → 00:15:49,459
each block: A title, a cover, a date,
the name of the author, the price, rating,

214
00:15:49,459 → 00:15:50,950
and many others.

However, what do I need to do to the XML code to extract the data? Right now, I’m just getting blank columns like the one below. FYI, I’m doing this on a popular restaurant review website, not Amazon. This method does work on Amazon.

I feel like I’m so close, but there’s something I’m missing.

1 Like

Alright, getting a little further. Through the UiElement finder I discovered the rating option, but when I select it, it gives me the following error. Any ideas why I’m getting this error?

You need the title? Try unchecking the css-selector and check the title.

1 Like

That worked for the title. Thanks! I’m still having problems executing the action that I want, getting the ratingValue of the review. But I think I’ll just do it in Python. Thanks for all your help.

Python worked, but I don’t understand why UiPath recognized the image and not the number that was included in the html codes.

For example, it took this prop, which didn’t work because it’s an image. Data scraping just didn’t recognize it.

<div itemprop="reviewRating" itemscope itemtype="http://schema.org/Rating">

And NOT this one. it didn’t do the hidden figure 4.0 which is imbedded in the html source code.

#<meta itemprop="ratingValue" content="**4.0**">

if anyone has any ideas of how to make this work on uipath, that would be great. I’m still puzzled on what the program can’t and can pick up.

Hi vvaidya,
I am also trying to scrape images from an e-commerce website and unable to. After some google search , I stumbled upon this thread and was trying to figure out what exactly did you ask to modify at activity level? Below is my xml

Can Any one help me in Scraping Images as well as other datas. I’m Scrapping datas from a website , But I can’t Scrap the Images. When I’m Exporting to CSV Other datas excluding Images are present. Actually I’m a new comer in this RPA Industry . Kindly guide me And Sugesst me the Steps to Automate …

Thanks everyone for the leads here. I am able to extract the image from a Web Page Table using Data Scraping.

Step 1. Run The Data Scraping Wizard and extract your required columns including the Image column.
Lets Say your Image column id is 7.
Step 2. At this time your Image Column will show Null if you have selected text.
Step 3. Click on the Edit Data Definition in Scraping Wizard.
Step 4. Change the attr to “src” from “text”

Thanks!
Atul Trikha

2 Likes

Hi Can you help me for extract image as well.
According to your solution it extracted only image url not image.
How can i extract image?

Thanks

Nice, really works. Thanks