Split PDF file into many files based on a specific text. PDF file consists of images

Hello @mahsa.mohk - have you tried the new PDF activities? You have a new Extract PDF Page Range activity, which can get arguments such as “1-4” (first four pages), or “5-END” (all pages from page 5 to the end of file)… This might help!

You can decide where to split by using the “Digitize Document” activity, and try to find the word you are searching for either in the DocumentObjectModel object, or in the Text version (and then search in the DocumentObjectModel where that index appears to identify the page). This is indeed a little bit of custom code, but it shouldn’t be too complicated…

2 Likes