Data scraping in web site

this is link in that i have to do

first you are open the link than it will show you link 8 links

Capture11

in that you have to click 1 link
after this link will open the pdf file
in that file you have to download this file and also you have to extract the data

data extract will be that

12

this data will be extract into excel file

and it will be saved on new folder like
download pdf file will be in download pdf folder

and data extract file means excel file in output folder
we have to click all of the 8 link using for each

please guys help me

@Anil_G

Hi @purnima_mandan

Please follow these steps

  1. Use data scrapping and get all the urls
  2. Loop through each url and use navigate to open the url and download the file… if they are file links…then directly use download file activity with the url to save the file
  1. Read pdf and use regex to extract table data

As it is a well constructed table can try opening pdf and use data scrapping for the table as well

Cheers

i was try and see the video but it was not working

because in that he will select only 1 link in http request but we have to do all the link

anyone can do this process can you help me

Hi @purnima_mandan

You have to combine it with for loop

Please read through all the steps

cheers

Hi,

The following sample will help you.

Sample20221211-2aL.zip (6.5 KB)

Regards,

RequestURL and Filename for response attachment, as the following.

Please note that These property name might be vary in different WebApi package version.

Regards,

can you able to do like i was explain in what should i do in the process
@Yoichi

Hi,

Does the above Sample20221211-2aL.zip work for you? If there is any problem, please share it.

Regards,

yes it was work but i have to download pdf file under downloadpdf folder
and data extract of result file under output folder
and also extract pdf data in different xlsx file like download pdf separete that like xlsx file also separate

Hi,

How about the following sample?

Please modify the following 2 variable to what you need.

image

Sample20221211-2aLv2.zip (6.6 KB)

Regards,

it will not coming in download folder and Output folder
and
12

can we take exactly data scarping like these way in excel file

Hello @purnima_mandan

Have you tried with the Modern Table extraction. That will help you to easily extract the urls and tables.

Thanks

Hi,

can we take exactly data scarping like these way in excel file

Can you try the following?

Sample20221211-2aLv3.zip (13.1 KB)

it will not coming in download folder and Output folder

How did you set downloadFolder variable and outputFolder variable?
Can you share your project or screenshot?

Regards,


this is your output
but i want like this output

in that picture there will be 2 folder output and downloadfilepdf
so bot will create these 2 folder automatically and output xlsx file and pdf file come under this folder
@Yoichi

Hi,

It’s good to just modify your folder settings and create it as the following.

Sample20221211-2aLv4.zip (13.4 KB)

Regards,

thank you this was a proper Bot Process
thank you soo much @Yoichi

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.