Generative extraction & classification in Document Understanding Cloud APIs

Monica_Secelean · January 16, 2024, 1:46pm

We’re happy to announce that we are supporting generative extraction & classification capabilities in our Document Understanding Cloud APIs!
Do you want to process unstructured documents? Extract information based on a prompt? Process documents without setting up a dedicated specialized model?
Leverage our generative capabilities!
Available as generative extractor and classifier, with our latest release you are able to perform prompt-based classification and extraction operations via simple API calls.

Generative Classifier
To leverage the Generative Classifier, simply discover it as part of the Predefined project as displayed below:

and consume it by providing the necessary input prompts for identifying the required Document Types (either synchronous or asynchronous):

Generative Extractor
Similarly, to leverage the Generative Extractor, simply discover it as part of the Predefined project as displayed below:

and consume it by providing the necessary input prompts for the fields to be extracted (either synchronous or asynchronous):

So, what do you think? Make sure to give our new features a try & let us know your thoughts!

nisargkadam23 · January 25, 2024, 4:06am

How do we define the taxanomy? While using Generative Extractor?

zell12 · January 25, 2024, 5:13am

The prompt/question id will be the taxonomy fields itself, I believe

Ioana_Gligan · January 25, 2024, 1:52pm

@zell12 is right, the prompts will be the fields indeed.

@nisargkadam23 if you need a specific taxonomy, you can use the Generative Extractor that comes with IntelligentOCR and the DocumentUnderstanding.ML packages. (define taxonomy, configure gen extractor within the IOCR Data Extraction Scope, define prompts for whatever fields you need to grab using Gen Extraction).

For APIs you will be calling the endpoint for specific prompts and will be getting the answers for those prompts.

Ajay_Wadhawan · January 25, 2024, 10:45pm

Does it also have action center in the workflow for human validation if required.

Lahiru.Fernando · January 28, 2024, 1:52am

With API’s we can get the response, and pass the data to an action center through Create Validation Task activities.

Monica_Secelean · January 28, 2024, 8:45pm

@nisargkadam23 simply provide the prompt as input request

AI_GPT · February 8, 2024, 3:45pm

What is the metering & charging for these API calls?

Monica_Secelean · February 14, 2024, 8:49am

@AI_GPT find details here: Document Understanding - Metering & Charging Logic

Sandeep_Alexander_Goni · February 27, 2024, 1:35pm

@Monica_Secelean these are great offerings by UiPath DU and I do have a question based on the features I have tried. The scenario which I am particularly interested in is asynchronous generative extraction and would love to know if there are more examples of how this flavor of DU extraction is handled optimally through this new offering.

I am interested in knowing the configurations because asynchronous route is a complex one and even through a well trained models the confidence levels of asynchronous data extraction are quite low.

Has anyone else tried this route and played around with the features. Insights are much appreciated as we are in the process of building a POV for a use case which can use this method but only if it’s viable.

@Lahiru.Fernando or @Syed_Pasha or @zell12 any insights from your side or any other community member that can provide more pointers here. Much appreciated!

Monica_Secelean · March 5, 2024, 4:28pm

@Sandeep_Alexander_Goni I don’t have insights about the feature usage I can share, but do reach out if you face issues

Sandeep_Alexander_Goni · March 8, 2024, 5:34pm

Thank you for the response @Monica_Secelean

AI_GPT · May 6, 2024, 3:32pm

@Monica_Secelean, I see the licensing model has been updated for Modern and classic modern experience.

Can you outline the key differences between the Classic and Modern licensing models for the DU?

Is migrating to the Modern model recommended for all users? If so, what are the benefits and any potential considerations?

If migration is recommended, could you please provide information on the process for users to transition from the Classic to the Modern model?

Since this update impacts user access to the DU, any additional documentation outlining these changes would be extremely helpful.

Topic		Replies	Views
Generative Extraction & Classification using Document Understanding in cross-platform projects - Public Preview Product News activities , document_understanding , document_processing , generative_document_understanding	73	6608	June 11, 2024
Classify Document (Generative classifier) Activities activities , studio , document_understanding	2	513	November 9, 2023
UiPath Document Understanding APIでUiPathの外部から生成AI抽出機能を活用する方法を執筆しました。 Other activities blog-or-website-post	0	12	December 21, 2024
Generative Classifier API not yielding classification result Document Understanding api , feedback , document_understanding	2	50	August 19, 2024
New UiPath Document Understanding features have been released! Document Understanding news , document_understanding	23	14834	March 12, 2021

Generative extraction & classification in Document Understanding Cloud APIs

Related topics