How to seperate PDF page or split if text is detected?

Jelrey · May 25, 2021, 8:34am

Here , I have loop though PDF pages and added a condition that if a text is detected on that page do something , how do we split and separated that page that contains the text ? For example in the screenshot below if the result is true then get that pdf page and save but only that page. Thanks

thewmak · May 25, 2021, 10:12am

I don’t think you can tell which page the text is in based on the text output from “Read PDF With OCR”

Some workarounds that I can think of:

A. Split the PDF into pages first using “Extract PDF Page Range” activity. Then read PDF text page by page. Delete those pages that don’t meet your requirement.

B. Find some indicators within the text output that can indicates which pages the text is in (e.g. page X of X). Then extract the PDF pages accordingly.

C. May also try “Document Understanding” module if you can access that, although I don’t think it will be better than approaches A/B

kadiravan_kalidoss · May 25, 2021, 10:18am

can you provide the sample pdf @Jelrey

prasath17 · May 25, 2021, 10:22am

@Jelrey - As i mentioned In your previous post, pdf splitter will only split the page when itit is a match …(index+1).tostring. here Index variable comes from For each loop output…

Please check the previously shared code, i have tested both the cases and everything is working fine.

prasath17 · May 25, 2021, 10:26am

@Jelrey - I would suggest, please wait until you have tried and made sure everything is working and then close this thread.

Jelrey · May 25, 2021, 10:28am

Done bro

prasath17 · May 25, 2021, 10:28am

Means, working now?? i.e issue is solved??

system · May 28, 2021, 10:29am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to loop at each page in a pdf file looking for text or digitize? Studio uiautomation	11	3711	May 24, 2021
Split PDF on string matching Activities pdf , question	5	1789	August 21, 2023
How to split pdf document into separate page Studio studio , question , designer_canvas	4	65	October 15, 2024
How to split pdf pages and extract? Help pdf , activities , question	4	16955	September 25, 2020
Split PDF file into many files based on a specific text. PDF file consists of images Help ocr , activities	6	5384	February 20, 2020

Most Active Users - Yesterday
sonaliaggarwal47
A_Learner
sharazkm32
More details...

How to seperate PDF page or split if text is detected?

Related topics