Extract the HTML content from a webpage

How to extract the HTML content from a webpage using UiPath activities or packages?

I plan to generate selectors to download the required files based on the extracted HTML content using a script.

Hi @Sathish_Kumar_S

Please refer below post

@Sathish_Kumar_S

Indicate the element from which you need html and then use get attribute with innerhtml

also if you need to get the files selectors why not get them directly

also if multiple for each ui element would help

cheers

Below is a screenshot from the website, and the structure is not well-defined — it contains multiple nested trees.
To extract the row number and document number from the HTML content, using Python code would be more efficient.
Could you please suggest the best way to extract the complete HTML content of the page?

@Sathish_Kumar_S

Open ui explorer and use the center top panel to see if there is a relation with hierarchy if yes then we can use the selector accordingly

also as all the files look as tag A you can use that tag to get the href to download the file

cheers

cheers

how to use tag to get the href to download the file ?

I would like to get full page HTML content.. Can you help me with a right UiPath activties or package?

Because i need to know the Row number and document number & Colu name attribute values to click only the required files

@Sathish_Kumar_S

get attribute will help you get the href

in get attribute you can use tag A alone in selector and increase index to get all the values

full page html content might not have the required details

but here is how you can get..use a get attribute activity and indicate the full screen and select innerhtml in the attribute field..the output would be full html

cheers

cheers

I’m unable to select the entire page using the Get Attribute method; it only works for specific elements like tables or text.

In the UiPath Forum post, Jinal Shah mentioned that we can use the “UiPath.HTML.Activities” package to extract HTML content. However, this package does not appear to be available in the official feeds. Could you please check and suggest an alternative?

Additionally, is it possible to use the HTML Agility Pack ? If so, could you guide me on how to use it?

@Sathish_Kumar_S

just move the cursor properly

or just select any element on the ui explorer and in center top window go to the top row and select it and remove all the below rows in selector in center down..it would give element for full page

cheers