I’m trying to simply extract data from an already OCRed PDF file and I’m having some troubles. At the moment I’m using read pdf text-append text-text exists. My goal is to read the pdf text contents, write it to a word file (or text file, whichever works better), than check if it contains specific content. When I run this, It seems to read the pdf forever and never actually finishes. The other day I ran it then went out for a 30 minute walk and came back and it was still running, no error messages or anything. The PDF isn’t that big as well, it’s only one page. Anyone know why this is happening?
I’m not sure how you see that? I didn’t get any message saying which activity was running. The second activity is the append text from the word activities.
Another thing to note is that I just tried the exact same sequence with another pdf with some lorem ibsum, and it works perfectly and takes 1 or 2 seconds. So maybe it’s just the pdf itself? but either way how else would I get around this…