It may be that I’m misunderstanding the purpose of Click OCR Text. As I’ve watched some videos etc it seems that it’s intended to find the UI Element that the text is in, and click that. But for me the UI Element is just the entire window, so that’s what it clicks.
I need to find the pixel coordinates of the text inside the window (Citrix, so basically just an image), and click at those coordinates.
You can try and see if CV works for your case. It will largely depend on the OCR engine (Omnipage ocr is a good option among free ones. UiPath Screen Ocr if community)
I’m not sure what your screenshot is supposed to be showing me. How would I use CV to click the text “100000”? Keep in mind, the text won’t always be 100000. That’s just an example, it’ll be a different value for each transaction.
I tried the Click OCR Text activity as well. It seems to be clicking the text itself and not the element. In this case, the snipping tool window is the element and the Red dot is where the bot clicked.
Clicking the text is what I want. For me it just clicks the middle of the window. Can you post info on how you configured the Click OCR Text? I sort of thought Citrix is why these kind of activities exist. It would be a big road block if this won’t work in Citrix.
Do you know of a way to get the x/y coordinates of the text into variables or something, and then I could use those as offsets for a click against the window itself.
OK this was driving me nuts. I tried exactly what you did and it still didn’t work.
Then I thought…hmmm maybe because I’m developing on a remote machine through the VMWare Horizon client.
So I tried it on my local desktop and it worked just fine. Which means there’s a problem trying to use Click OCR Text in a remote window (not sure if Horizon is RDP, or what).
When you hover over an element when indicating the selector, this little UiPath window should display exactly what you’re hovering over. Sometimes that’s not the case but is offset, causing displaced hardware clicks. I’ve had that issue before, to fix the resolution have to signout and sign back in.
the workflow screenshot I posted above is not static, it’s getting the region of the ocr text dynamically and clicking on it based on those coordinates.
This isn’t a displacement issue. It’s clicking exactly in the middle of the window. If I change it to top left or anything else, it follows exactly that.