I am using regex to take certain parts of a PDF but it also takes page no and other lines that I don’t need. I am not sure how to remove the words, since it can change from pdf to pdf.
The only thing that remains constant in all the pdf’s is the page number format.
I have attached the image below.
I want to remove everything circled red and highlighted parts are not constant.
Thanks in Advance