Image URL extraction from Amazon Web Page

Hi Everyone!

I’m trying to extract data from Amazon and the “Extract Table Data” activity works just fine. I would like to extract the URL for the image as well using StudioX.

Thanks!

@Asm_6658
Welcome to the forum

Have a look here:

And here:

As described configure one column for the src attribute of the image (tag=img) element

Thanks, Peter! Appreciate your help.

The URL is getting extracted in this format – /HP-ENVY-6055e-Wireless-Printer/dp/B08XYRVSRT/ref=zg_bs_172646_1?_encoding=UTF8&psc=1&refRID=G61JAVYNPFAST5PT3SG2

Capture|690x248

The https part is missing.

Any other suggestions!!

Hi @Asm_6658 ,

In most of the website when you extract URL it won’t give full URLs. You need to modify it. After getting the url you need to append the basic url

For example:

Your extracted url is
x_URL= HP-DeskJet-2335-Advantage-Printer/dp/B08CYRFX24/ref=sr_1_1_sspa?dchild=1&keywords=HP+Printer&qid=1631081020&sr=8-1-spons&psc=1&spLa=ZW5jcnlwdGVkUXVhbGlmaWVyPUFBUFAxNzlHS1VRTFYmZW5jcnlwdGVkSWQ9QTA5OTgxNzUxNDVDSDlPOFpIVTdFJmVuY3J5cHRlZEFkSWQ9QTA2NTE4NTYxMjdWNlFTMUZWSzBSJndpZGdldE5hbWU9c3BfYXRmJmFjdGlvbj1jbGlja1JlZGlyZWN0JmRvTm90TG9nQ2xpY2s9dHJ1ZQ==

Then your basic url is
URL = “https://www.amazon.in/

Now concatenate the urls.
URL+x_URL

1 Like

Thanks, Manjula! :slight_smile:

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.