we would recommend to process the file directly as it has schema of e.g. spreadsheet
here we can check following options:
- html/xml parsing and processing
- XML Stylesheet transformations to bring it within tabular data
not far away an example of xml spreadsheet processing: