Hi, I’m trying to search a PDF that opens in an IE tab. I’ve tried finding the keyword I’m searching for using Find Text, I can’t use Find OCR text because it’s a multiple page documents and it would be way too inefficient, and the only way I’ve found that works is saving the PDF then using Read PDF with OCR, which takes forever…
Using Ctrl + F to bring up the Find box and typing the phrase into it works great, but I can’t get the robot to type into the box at all. It works if I step through manually, but any time I try and run normally it can’t find the text box.
The Type Into activity is in an Attach Window activity with a selector of
The selector for the Type Into activity itself is
I cannot share the workflow because it contains information I can’t share, but any help or advice is much appreciated!
This looks contrast @WitzgaWJ as you say you want to use read pdf ocr to get the whole pdf as a text, but this looks like we are looking for a word in pdf
I want and need something much faster than saving the PDF from IE to the computer then using Read PDF with OCR to find a 2 word string. This is the current state and it routinely takes over 30 seconds which is not acceptable if there is any faster way.
I only care about the presence/absence of the string so I’m trying to use Find to establish that. I do not need to extract any information, just see if the string is there.
Fantastic
Then rather to get into a application, applying ctrl+F, might some other miss out while processing,
While Read pdf ocr doen’t need application either which is one of the plus points where can run this activity in many machine without application…and is the most reliable and faster way when compared to using find text…
So to be more faster, we can use API but the web application we use must have that option
so i would suggest to go either with read pdf ocr or api
Hope this would help you
Kindly try this and let know for any queries or clarification
Cheers @WitzgaWJ
I don’t know if this makes a difference but the PDF is not open in Adobe Reader, it is opened as a tab in IE. I tried to find a way to access Change Reading Options but I’m not finding it.
Fine try with API call @WitzgaWJ
Hope that web application has such option
Check whether it takes REST or SOAP api and use the accordingly
I tagged a link for this in the previous comment
Cheers @WitzgaWJ
if you go to Task Manager you will see that probably “Acrord32.exe” process is running. It’s only shown in IE because that file is on some remote server ( if you look into developer console you will probably find some href ) and if you click “esc” you will see Adobe options and interface.
So even it’s opened in IE, Reader is doing all hard work and all reading settings are loaded from Adobe reader program. If you want to change this you need to open Adobe and go to preferences and change the settings.
I had couple very similar situations and this helped me.
I did use “Esc” to see the Adobe options/interface, but nowhere in there can I find any option to access Change Reading Options. There is no “Edit” menu present as far as I can see. I’ve included a screenshot of the Adobe options and interface I am seeing.
My mistake, I didn’t explained well. Close IE and please just open Adobe Reader from Start and change reading options there. And then when you open that file again on IE it should load that settings that you made in Reader (at least that was my case).
So I’m trying your suggestion and it worked once when I got a popup on opening the PDF and I selected the right options, but since then it hasn’t been. I’ve tried to select the right options from the link you provided earlier but Adobe just keeps changing them back to the “Tagged reading order” that’s causing the issue.
Oh yes! I strugled with that for some time and at the end I finally ended making separate workflow for resetting all settings for IE everytime process starts.