How can I scrape data from an unstructured PDF where the information is divided into three columns on a single page? I need to organize this unstructured data into a structured format and then extract the required information using regex and put in separate columns
Here is the sample screenshot of it.
Either provide a sample pdf or provide the output of read pdf text with preserve format and without to analyze the format and give a suggestion
Cheers
This is the PDF file I am trying to scrape, containing information spanning from page 3 to 129, where the data is organized into three columns on a each page. How can I extract the information from it?
provider_results.pdf (364.1 KB)