We try to extract automatically files from WORD or PPT.
There is a way by creating a zip and getting some data from such an archive file. It works well for media files as pictures.
For other files, there are listed in a repository named “embeddings”, as files xxx.bin.
Trying to rename those file xxx.bin with other suffix as .doc, .xls, …, does not allow us to open the file.
Is there some who could help us to succeed?
Thanks in advance.