Extracting data from pdf with fixed headings

Hi everyone,

Can anyone help me with a scenario where I have a pdf file and it has some headings for each paragraph.

Can I just extract those headings ?

Or can you help me in another scenario if I give the heading as start index and can I extract the paragraph below that heading ?

The only issue I have here is I don’t have the end index fixed here . But I have the start index or heading fixed .


Can you show us the pdf?

Hi ,

Please find the attached Sample file for the reference.

file-example_PDF_1MB.pdf (1017.7 KB)

I’m trying to extract data below each heading.

and headinsg might change… My input is 5 headlines which I will be storing in an asset and passing the headline one by one to pick up the data below that headline.