Need to Get the data from the browser

Hi all,

Need to get the data from a browser which is similar to //https://www.w3schools.com/html/default.asp//

where i need to capture the data which is on the left side of the browser as
HTML Tutorial
HTML Forms
HTML Graphics
HTML Media and so on as headings and i need the count and topic names under these headings.

How to do that ?

@Prashanth_Paladugu

  1. Open Browser:
  • Use the “Open Browser” activity to navigate to the specified URL (https://www.w3schools.com/html/default.asp).
  1. Data Scraping:
  • Use the “Data Scraping” wizard to extract data from the left side of the browser.
    • Indicate the first element on the left side (e.g., “HTML Tutorial”) to start the data extraction.
    • Follow the wizard to indicate and select the next elements and sub-elements that you want to capture.
  1. Configure Data Scraping:
  • Configure the data scraping wizard to extract both the heading and the count and topic names under each heading.
  1. Review the Extracted Data:
  • After completing the data scraping, you will get a DataTable containing the extracted data.
  1. Process the Extracted Data:
  • Use a “For Each Row” activity to iterate through the DataTable.
  • Inside the loop, you can access the values using row(“ColumnName”) and process them as needed.

Thanks for the update. But in my situation we are not aware when the next heading will be started. i mean in the above example its clear that topics under HTML Tutorial, HTML Forms etc. but need them as dynamic ones. and we have similar heading structure and space one after another heading.

And i’m not getting the Topic headings in the scarping.

When working with find children

we can get:

Postprocessing the result let us split on H2 and bundle with the corresponding items

1 Like