Need to remove PDF Headers and Footers

I’m working with PDF extraction.I have headers(Document type like name and other) and footer(Page numbers) in PDF.While extracting the PDF i’m getting these Headers and footers also. So, i need to delete these.

Give me any suggestions.


Can you please share which read method you are using to extract data from pdf?

I’m using read PDF text activity.

Can you share your pdf file? Way to remove footer and header depends on Pdf file’s content.


Read PDF text will give you single string with entire data from PDF file. In this case only way to remove footer and header is string manipulation methods. Use String Split or Substring, regex format to achieve it.

It is a confidential PDF file.

but using split or substring we will get page wise nearly we have 300-400 pages in that PDF file.