AI Computer Vision for generic automation

Our team has requested to check whether we can create generic automation using AI Computer Vision. Can we develop a single automation solution for different web applications?

Suppose two screens in the attached image are from two different web applications. I need to create a single automation solution to fetch details such as Name, Age, and Address. There may be more applications.

Is this achievable using AI Computer Vision? If it is achievable, can anyone please help me with an example solution?

@dipin.puthusseril

it can work with little changes but might not go with one single..as it uses a backend ml model too which would try to identify based on position and text and relative visual representation as well

few it can ..but might not do all

its more a trail and error you need to go with

cheers

1 Like

Hi @dipin.puthusseril

I don’t think you would be able to just do with standard automation for both websites.

Positioning of the elements are different on both sites.

But you can give a try to fetch descriptors for the elements and compare.

You can have different flows for both sites in one automations itself. Like for websites1, follow sequence A, for website 2, follow sequence B

1 Like

Thanks @Anil_G and @sonaliaggarwal47 for the suggestions. It was helpful!

I have few more queries related to AI Computer Vision. Thanks in advance!

Can we make a dynamic solution for the application in below screenshot using AI Computer Vision? Is there any option to pass dynamic values for Currency and return Amount and Change using AI Computer Vision? This might be possible using usual web scraping methods, but this query is specific to AI Computer Vision.

@dipin.puthusseril

Ideally web scrapping would do but not sure why youwant to use ai only

One way you can do is using kind og anchor and add variable in text in descriptor and also need to paas coordinates too by finding the relative co ordinates of the currency you found

Cheers

1 Like

Hi @dipin.puthusseril

Yes, you can provide dynamic values for these in your descriptors.

We are also using the dynamic descriptors for our CV process.

For e.g, in below example, we are passing a variable from queue item as text. Similarly you can provide for anchor as well if you need.

Simply replace static text with “+variable+” inside single quotes ‘static text’

“Target: Text '”+in_TransactionItem.SpecificContent(“Licence Class”).ToString+“’ (421,416,240,14)

Hope this helps.

Regards
Sonali

1 Like

Thank you all for the valuable information and help!

I am new to UiPath, so my questions might be irrelevant.

Can anyone please help me understand more about Computer Vision by answering the following questions?

  1. If an application can be fully automated using both normal web scraping methods and Computer Vision, is there any benefit in using Computer Vision over normal web scraping?
  2. Can you provide a scenario where Computer Vision can be used over normal web scraping to automate an application, even if it is not in a Citrix or remote environment?
  3. If the application frequently has slight changes in the colour or position of elements, which method would be more beneficial: normal web scraping or Computer Vision?

Additionally, it would be great if you could add more points to help me understand the benefits of using Computer Vision in a native system (not in Citrix or remote environments).

@Anil_G @sonaliaggarwal47

Computer vision is more tolerant than web scraping methods. If selectors change computer vision is more stable. Therefore, I would even often recommend to use computer vision.

Further, it is easier to automate with computer vision than identifying fitting selectors. Finding the right selector can be tricky sometimes.

The field also shifts more and more to Computer vision. UiPath has introduced a self-healing agent, which is aligned with computer vision.

If the position changes than computer vision is better. If the colour changes both will work, if you do not name the colour in the web scrapping selector.

1 Like

@dipin.puthusseril

In all above scenarios normal web scrapping is bwtter

Cheers

1 Like

@dipin.puthusseril yes, it is as @Anil_G says. For dynamic use cases normal web scrapping is better.

My answer was only related to your questions related to computer vision vs. web scrapping in general.

1 Like