Generative Extraction & Classification using Document Understanding in cross-platform projects - Public Preview

Overview

Note: As with all preview features in Studio Web, this functionality is only available for Community accounts and not accessible for Enterprise - to try it out without a community account, feel free to download the DocumentUnderstanding.Activities 2.3.1-preview package in Studio.

We understand that classifying and extracting information from diverse document types can be challenging and time-consuming, especially when dealing with custom or unstructured use cases. Therefore, we are excited to introduce generative capabilities in our Document Understanding Activities package - having the Classify Document and Extract Document Data Activities now empower users to input prompts based on which to either classify the document or extract data from it, making the process more intuitive, efficient, and flexible! :dancer:t2:

In this way, when working with either activity, users have the option to select the Predefined Project and the “Generative Classifier” or “Generative Extractor” as model to work with - requiring the input of key-value pairs, where the user can provide his prompt as input as sampled below:

The prompt will then be sent to a large language model, together with data from the document, to classify or extract the required information which will then be consumed in the workflow.

Note: At this time the generative model is not retrainable and as always, all data handling adheres to our standard terms of service.

Documentation

How to Get Started

Pre-requisite: DocumentUnderstanding.Activities - min. version 2.3 package
Simply create your cross-platform workflow in your preferred Studio environment and when using either Classify Document or Extract Document Data:

  1. Select the Predefined project
  2. Select the Generative Classifier or Extractor
  3. Provide your prompt as key value pairs, where:
    • key will be the Document Type (e.g. CV) or Field Name (e.g. email)
    • value will be a description for determining either of them (e.g., CV containing candidate skills and experience or email address of the document)
      When running your workflow if using the Validation Station, one can see why the extractor has selected a particular answer for a field.

Limitations

Table processing may not always lead to the best results - we’re working on fixing this, so if you encounter issues, please shout out

Charging

Charging will happen based on AI Units - we don’t have this finalized yet, but we will update you here, once we have all details in place.

Do reach out if you give Generative Classification or Extraction capabilities a try and let us know how it’s going! What are we missing? What would you like to see further? Looking forward to your thoughts! :dancer:t2:

14 Likes

Having followed the ‘How to Get Started’ instructions the generative extractor doesn’t appear to be there for us.

We are on the latest version of Document Understanding activity available too.
image

Can you provide any guidance on how we can get access to this?

1 Like

@Joshua_Allan_0 the feature is part of the 2.3 activities package - I do not see it in your list of packages, can it be that you didn’t check the “include prereleases” checkbox? So that you can also see packages which are not GA, but preview :slight_smile:

3 Likes

Ahh problem solved. Thanks!

I wonder if this activity is also available in Studio Web?
Kind regards,
Emil

@ebeloglavec yes, absolutely! :slight_smile:

@Monica_Secelean - In the Extract Document Data activity, I don’t have the option to select from the drop down. It’s basically empty. Anything I might have forgotten?

image

can you make sure you have this installed and let us know if it’s still not working? :slight_smile: I’ll modify the forum post in the meantime :slight_smile:

Still not working :confused:

@islam.spaho what Studio version are you using?

Is there a setting that must be adjusted because we do not see Generative Extractor in Studio Web?

Thank you very much for your help.
Emil

i’m using Studio 2022.10.3

@ebeloglavec is this an older workflow of yours? ideally, for new workflows, it should be there (as they would reference our latest package version)

@islam.spaho looks can you retrieve any projects in EDD? if not, are you connected to your orchestrator instance?

Even in a new workflow the Extractor does not show up.

Hello Monica,
This mean the model (Trained by UiPath) and managed by UiPath?
Regards,
Balram.

Very cool, working on my end!

1 Like

I’m connected to Orchstrator and the dropdown is empty. No other Projects to be selected. The funny part of the story is that i can select a project in Classify Document activity and not in EDD. ^^

@ebeloglavec any chance you can export the wf and send it to me at monica.secelean@uipath.com?