I am working on a project where the robot has to read PDF files downloaded from a web page, they contain various bookmarks and I only need 2. So, from a specific bookmark, it has to find the page number where the text of the bookmark is located that I need and then just save that piece of text.
Sorry but I’m new and I don’t know how to do it
Could you kindly orient me?
Thank you so much for your answer!
So, there are two possibilities.
The robot could find two PDF files, how?
When it downloads the PDF file and opens it (I use acrobat reader) in the bookmarks, it must look for the one called: “Soci”, if there is then it should save the page number in which it found it and all the text of the same.
Otherwise, if that bookmark did not exist, then it must look for the bookmarks: “titolari di cariche e qualifiche” and “informazioni sullo statuto” and do the same thing, that is, save the page number they are on and all the text.
I think there is an activity to get the pdf pages count,
Now get the pdf pages count and store in a varaible
Now loop through the numbers of pages and read the pdf page by page, after reading PDF for one page , check whether the word is coming in that string or not, if it is there u can exit out of loop and then store the data as well.
Same way u can build for other conditions as well here
I just realized that I can share images, then this would be my pdf file.
From which I have to extract only the “Soci” bookmark for example.
I tried to read page by page with the robot and then, for each page, look for if that text contains the keyword "Soci but it didn’t work.
I also tried with the “is match” activity, yet with “myText.Contains” but nothing, I’ll try yet another way.