personally i would use the get visible text activity, that will get everything on the screen and then i would use matches activity to only find text next to href=
You can use FindChldren activity with FIND_DESCENDANTS scope to get the elements you’re interested with. You can tweak the filter as you like.("<webctrl tag='A|LINK' matching:tag='regex' />" for example, I don’t have my studio here to check but you have the idea)
You can then iterate over the elements and get their href attribute (GetAttribute is an option, as you mentioned)
EDIT: Another approch would be to get the source code as text with http activity for example and either use an html parser or use regex. If you prefer this approach, I can elaborate.
Thank you that is really helpful. I am trying to use the Weblink Extractor. In the For each activity I get an exclamation mark when I put ‘item’ in the Write Line activity. Do you know why this might be?
Use HTTP Request activity with your url as endpoint and you’ll have the page source code as output (String). You can then use a Regex to find the links for example with pattern = "(?<=\bhref\="")[^""]+?(?="")". A better approach would be to use a html parser (I’m a noob in VB and I don’t know any but in python you can use BeautifulSoup).
In attached workflow, you’ll find a sequence with HTTP Request and Regex and another sequence with FindChildren.
For each case, you’ll find the result as a string a NewLine as separator and as an Array of String. With FindChildren, I filter A elements only but you can edit the selector. The variables’ scope is kept to their respective sequence.