Hi @upendra_koneru,
This use case is tricky. The example provided is dependent on how well the RegEx pattern is crafted by you. Since I do not have the pdf file, you have to provide anchors and ensure the correct page boundaries are identified - not just the key words.
The idea:
- Read pdf page by page with activity Read PDF Text
- Search the text string with activity IsMatch
- If a match (Boolean) is found, add a datarow containing the search text and starting page number
- increment page number
- repeat step 2
- If the second page is read, update the previous datarow ending page number
- When last page is read, update the datarow ending page number
- Finally Extract PDF Page Range to extract the pages.
Note: activity Assign Regex Pattern is to replace a space with \s for regular expression to work correctly. You will need to change it accordingly for the text you are searching
The example contains a sample pdf which you can test to verify the workings…
PDFExtract.zip (102.5 KB)