How to extract open graph meta data from a webpage?

marksmayo · June 26, 2019, 11:08pm

Learning RPA with UIPath. Happily extracting onscreen data from a website, processing it, using it, etc. Navigating in browsers, clicking links and all that.

However, there’s information in the page that isn’t visible, but is in the source, eg, open graph meta tags:

<meta property="og:image" content="https://example.com/foo.jpg" />

What options are open to me to extract this with UIPath? I gather there’s an ExtractMetaData flag from ExtractData but I’ve yet to find a useful tutorial that I can follow at this stage

I found this forum piece, but the attachment someone provided to solve it errors when I open up in UIPath Studio, and well, I just feel this is enough of a common thing that surely someone has done it before with some basic steps for me?

Many thanks,

Mark

TomDiFulvio · June 27, 2019, 3:50am

Off the top of my head I can think of a manual way to do it. You would:

Right click on the page and select “View Page Source”.
Copy the text into memory.
Use the extract text snippet that uses regex to get the actual values out of the text.

Other than that, HTML is essentially XML so you could repurpose this to parse the copied text and get your meta tags out.

You could make the text copying more robust by downloading the page source using something like this.

Either of those suit your needs or do you need some other ideas?

Topic		Replies	Views
How to extract html tags using uipath? Studio uiautomation , robot , activities , studio , question , workflow_diff , append_tags , code_review , html , code , extract-tag , read-html	13	4700	June 27, 2022
Extract data from graph from a open website Activities datatable , excel , selector , uiautomation , pdf , mail , orchestrator , activities , studio , data_scraping , web , string , studiox , question , activities_panel	6	1657	April 20, 2022
Extraction of info from web page into excel file Help	8	4794	August 26, 2019
Data_Extraction from a website Help browser , activities , data_scraping , web , question	6	1155	December 5, 2019
Extraction of data from a graph in an Application Help studio	1	1012	October 13, 2020

How to extract open graph meta data from a webpage?

Related topics