How do I **Extract Table Data** with hyperlinks from a webpage?

@PeruT
it can be achieved while configuring the custom datascraping:

  • start Datascrapping Wizard
  • indicate first cell
  • click no for
  • configure first column (indicating firdt and second cell)
  • click on extract correlated data
    • indicate first cell Company column
    • configure:
      grafik
  • click on extract correlated data
    • click on first link, second link, configure:
      grafik
      get:
      grafik

and so on

Kindly note to enhance the retrieved link withe base url info:
retrieved: market-activity/ipos/overview?dealId=1139176-95261
base url: https://www.nasdaq.com/

For Filling up / Composing BaseURL and URL we can do it without any for each and column updates:

  • define an addtional Datacolumn: Name: BaseURL, DEFAULTVALUE: the base URL
    grafik

  • Add an additional Datacolumn: Name: “FullUrl”

  • Define an Expression (Compution Rule) for Full URL Column:
    grafik

Result:

After this with a

ExtractDataTableVar.DefaultView().toTable(False,{"Symbol", "Company Name", "FullUrl",..... All other Columns to keep})

the helper columns URL, BaseUrl can be easily removed.

The described approach is a variation of Thomas’ suggestion. Feel free to combine both of it

2 Likes