How to scrap from multiple pages with pagination label as 1, 2,3,4

Hello,
I want to scrap data from a website with multiple pages with no “Next” or '>" link.
The pagination is in 1, 2, 3, 4, …

I am using Data Scraping Feature on UIPath.

This is the website: https://www.handelsregister.de/rp_web/result.do?Page=1

I will appreciate any help to solve this

Thanks

1 Like

Hello @azizakinsola The link you sent redirects to a search result that with an empty query. Do you want to scrape the data on the left side? If you can elaborate more, I can help you out.

Hello Karl,
Thank you for your response.
If you click on the link, then click on the “Advance search” on the side

Choose any option in the Register court.

Then search.

From the result, click on “Double search result” on the page as many time shown below

It will show the different pages in pagination as shown.

Thanks

1 Like

Hi @azizakinsola ,

First you take the number of page in one variable. Eg: TotalPage =5

Then open do-while loop until Count > TotalPage.

Inside the loop click on each page link selector and do your filters and scarp the data and save into datatable.

I am actually new to Uipath. I am using the Data Scraping feature on UI path, and I think it is using do loop.
Please can you show me some example on how to do it it do while loop.
Thank you

Is it possible to share your webpage link? I can make a workflow for you.

Here i’m attaching the simple do-while loop.
help.xaml (6.4 KB)

Yes.

This is the website: Registerportal

Click on the " Advance search" , them choose any option in the " Register Court" and Search.

Click on “Double search result” as many time as possible to show the other pages in Pagination

Thank you.

Hello @azizakinsola

Sry for the late reply. That is actually easy. There are two ways for you to do this. If let’s assume after you click on the ‘Double the limit for search this’ you get 3 pages with last page (1 ,2 ,3).

Method one: fyou can place a click activity to click on the number ‘1’ blue number on the site. Make sure that the aaname is also one. Then scrape any data you want from the first page. After you’re done, you can then do the same for the second and third numbers.

Method two: You can scrape data from every page by setting to clipboard the URLs and assigning a hotkey (Ctrl + V) to the domain bar.

Example

h-ttps://www.handelsregister.de/rp_web/result.do?Page=1
h-ttps://www.handelsregister.de/rp_web/result.do?Page=2
h-ttps://www.handelsregister.de/rp_web/result.do?Page=3

Both will work but the first option is faster as you don’t need to click on the upper URL bar before pasting the domain. If you have any questions, just ask!

Hi @azizakinsola ,

Find the workflow here.

RegisterPortal.xaml (24.8 KB)

Thanks,
Manjula

1 Like

Thank you Karl, I am going to try it and see if it will work

Thank you so much Manjula. I really appreciate taking your time to create the Workflow for me. Looking through it, I am sure it will do exactly what I want.

But, I am not sure of the image that will be on the “send hotkey ‘SELECT’” activities.

and on the “Get text ‘SPAN’”

Thank you Manjula

I have fix it Manjuna. Thank you so much

Hi @azizakinsola , Glad it worked for you.

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.