html is perfect for the parsing
Otherwise we can also setup a custom extraction e.g. with find children etc.