Hi,
I want to get the details of an ecommerce website including images of all the products of all the categories. But image is not being scraped using data scraping with .png or .jpg format.
It is being scraped in .html format. So, what should be the optimized way for this process?
This is a good question and I’m having a similar problem.
I am scrapping a webpage where I can scrape about 95% of the data needed. However, the one item the scraper is struggling with is an image. The information the image has is valuable and the data is in it (I confirmed it because viewed the source of the webpage and the actual number that I need is present).
Is there anyway I can scape all this information together in one document? As opposed of doing it in separate documents and then merging them?
Okay, I think I’m getting closer. I can see the source with Edit Data Definition. However, after I click okay, the data definition is not changing. How can I edit the change permanently? Here’s where I’m at.
I’m still having problems extracting the stars in a review.
I was looking at the Advanced UI Automation, Lesson 5, video and it said (and showed) the following.
213
00:15:40,890 → 00:15:49,459
each block: A title, a cover, a date,
the name of the author, the price, rating,
However, what do I need to do to the XML code to extract the data? Right now, I’m just getting blank columns like the one below. FYI, I’m doing this on a popular restaurant review website, not Amazon. This method does work on Amazon.
Alright, getting a little further. Through the UiElement finder I discovered the rating option, but when I select it, it gives me the following error. Any ideas why I’m getting this error?
That worked for the title. Thanks! I’m still having problems executing the action that I want, getting the ratingValue of the review. But I think I’ll just do it in Python. Thanks for all your help.
Hi vvaidya,
I am also trying to scrape images from an e-commerce website and unable to. After some google search , I stumbled upon this thread and was trying to figure out what exactly did you ask to modify at activity level? Below is my xml
Can Any one help me in Scraping Images as well as other datas. I’m Scrapping datas from a website , But I can’t Scrap the Images. When I’m Exporting to CSV Other datas excluding Images are present. Actually I’m a new comer in this RPA Industry . Kindly guide me And Sugesst me the Steps to Automate …
Thanks everyone for the leads here. I am able to extract the image from a Web Page Table using Data Scraping.
Step 1. Run The Data Scraping Wizard and extract your required columns including the Image column.
Lets Say your Image column id is 7.
Step 2. At this time your Image Column will show Null if you have selected text.
Step 3. Click on the Edit Data Definition in Scraping Wizard.
Step 4. Change the attr to “src” from “text”