Hello everybody, I am working on a project where the robot has to read PDF files downloaded from a web page, they contain various bookmarks and I only need 2. So, from a specific bookmark, it has to find the page number where the text of the bookmark is located that I need and then just save that p…

Hi @Araceli91 welcome to uipath communiity well it is possible to extract the data directly using regex or string manipulation. If you can share the details regarding the data u need to extract i can help you with the regex pattern

Hello @NIVED_NAMBIAR Thank you so much for your answer! So, there are two possibilities. The robot could find two PDF files, how? When it downloads the PDF file and opens it (I use acrobat reader) in the bookmarks, it must look for the one called: “Soci”, if there is then it should save the page…

u mean save the all text in that particular page @Araceli91

@Araceli91 - please take a look at this post [image] In pdf Get the string avilable page number in to excel Robot @vijayabhaskar1987 - here you go… FindPDFPageNo.zip (764.9 KB) In this attached example, I have searched for the word “Certificate” in my pdf which is availab…

Yes, in the sense that I should take all the text of that specific page.

Thanks, I’ll take a look at the post. :slightly_smiling_face:

Hi @Araceli91 Then u can try this idea as well I think there is an activity to get the pdf pages count, Now get the pdf pages count and store in a varaible Now loop through the numbers of pages and read the pdf page by page, after reading PDF for one page , check whether the word is coming in th…

I finally tried with your suggestion, but it doesn’t work for me. I’m trying another way, if I can make it all work I’ll be happy to share it with you. :grinning:

[image] I just realized that I can share images, then this would be my pdf file. From which I have to extract only the “Soci” bookmark for example. I tried to read page by page with the robot and then, for each page, look for if that text contains the keyword "Soci but it didn’t work. I also tri…

Hi @Araceli91 Can u show thw page where the soci word is found Also please check whether the string data read from pdf contains Soci word by writing to text file

Take a specific piece of text and the page it is on, from a PDF file

Help Activities

prasath17 (Guru) June 20, 2021, 2:22pm 5

@Araceli91 - please take a look at this post

Topic		Replies	Views
How to search particular text in pdf and extract footer number of that text page? Studio pdf , get-text , pdf-extraction , ismatch , search-for-activity , footer	7	1556	May 27, 2022
Extract a particular Page data from multi page PDF document Help studio	6	3293	April 11, 2019
Get specific page number based on keyword from PDF files Help pdf , activities , question	11	4245	January 8, 2021
Extract PDF oages contain specific text Activities pdf , activities , question	5	1937	October 26, 2022
Page number from pdf Activities pdf , question , pdf-extraction	6	1123	July 19, 2023

Take a specific piece of text and the page it is on, from a PDF file

Related topics