I have a pdf file and wanted to extract a specific data alone from the pdf file.
If I manually try to open the pdf β copy and paste it in a notepad. I am getting the content(address) each line after line.
If I try to use the Read PDF activity and then use Write Text file activity. The content contains the full text like the table values beside it and not able to get the address alone separately.
Kindly help me on how to extract a specific content alone from PDF.
Note: No Keyword to find the Start and End point for the content.
Please read the PDF first. If the PDF is of normal format then please use Read PDF Text activity and if the PDF is of scanned PDF format please use Read PDF With OCR.
Thank you for the info. I would like to get one more help I am trying to work with Regex from Friday. But unable to find the solution. I want to Regex the below content. Please help with this also (Note: Address details are present in each line after line)
It is client protected data, which I will not be able to share. So only I have mentioned a sample of how it will look like. And it will be in notepad only