How to extract the tabular data from pdf which contain text as well as structure data and page number is not fixed too…
Hi @devendra_bhardwaj2, welcome to the Community.
If the data in the file is not available in a fixed position or if the file is of semi-structured/unstructured format, then you could consider the option of implementing Document Understanding to extract such data.
Using DU, you can build & retrain ML models which will help you extract the data from files like this.
Hope this helps,
Best Regards.
Hello @devendra_bhardwaj2 ,
Is the pdf structure fix?
If at all its fixed you can simply use regex along with String.indexOf function to get contents from pdf.
No it is dynamic data…