Data scrapping for getting Inner Html of a tag

@Sunitha_Premakumaran
Let’s continue on the results from above. A closer look on the structure had the outcome that the content is not offered in a structured form (Question - Answer). Instead it is available in the form Question - Answer Part 1…Answer Part n). Also there are tables and other execptions present.

However to start I did for you following demo implementation with the purpose to provide you an initial entry and showing some working tools.

Read In all Article parts is done with Find children
grafik
filtering to all direct children

With a switch on the HTML Tag Name the children sequence is processed (Demo: Log Messages)

in the case of answers with an addtional find children the links from the answer block are fetched as well:

and as a first result it is logging:
grafik

Just take it as base and incorporate more structure variation handling and dumping out in a datatable / Excel by your own - Here is the XAML:
Sunitha_Premakumaran_2.xaml (11.3 KB)

For adressing your initial question (getting innerhtml) find an alternate retrieving the entire article part as innerhtml. XAML is here:
Sunitha_Premakumaran_1.xaml (6.4 KB)

1 Like