Generative Extraction & Classification using Document Understanding in cross-platform projects - Public Preview

Monica_Secelean · August 14, 2023, 3:37pm

@balaraman.ramiya updated the note to make it clearer, hope it’s better now

Monica_Secelean · August 14, 2023, 3:39pm

@islam.spaho very strange, thanks for reporting, looks like a small bug - we will try to reproduce it/check what’s wrong - if you can think of any tips of us, do let us know

Monica_Secelean · August 15, 2023, 2:30pm

@islam.spaho we cannot reproduce the issue would you mind scheduling a call with me, to go through your issue? please fw me a meeting invite at monica.secelean@uipath.com

ababab2828 · August 17, 2023, 3:19pm

May I know if is there a way to find out the question (i.e., value) in the Classify Document Activity?
What is the name of the dictionary holding the Key-Value pair in the Classify Document Activity?

mlellison · August 21, 2023, 9:50am

Hi @Monica_Secelean ,

May I ask what is the suggested prompt for table processing?

I used below prompts and all result in empty value.

1.) Extract all data in table including header in json format
2.) Extract all data in table including header in json format (Qty, Description, Unit Price, Amount)
3.) What are those value in table

g.ward · August 22, 2023, 6:11pm

Hi,

This is an exciting package, having some real fun with it in studio. However, when trying to test it out unattended on some data, with the older DU ML activities pack also installed on the latest preview, I reliably get this error:

Could not load file or assembly ‘UiPath.DocumentUnderstanding.Persistence, Version=6.12.0.0, Culture=neutral, PublicKeyToken=null’. The system cannot find the file specified.

System.IO.FileNotFoundException: Could not load file or assembly ‘UiPath.DocumentUnderstanding.Persistence, Version=6.12.0.0, Culture=neutral, PublicKeyToken=null’. The system cannot find the file specified. at UiPath.IntelligentOCR.Activities.CreateDocumentClassificationAction…ctor()

Unless Iremove the new DU 2.3.1 package, in which case it works fine.

Is this intended?

Monica_Secelean · August 25, 2023, 11:04am

@ababab2828 not for the moment - what would be your use case/why would you need it?

Monica_Secelean · August 25, 2023, 11:05am

We don’t yet have a way for handling tables - for this particular use case, we currently recommend using specialized models.

Monica_Secelean · August 25, 2023, 11:08am

@mlellison maybe you try modifying the prompts to questions, like: Can you extract the table data in a json format? I can’t tell for sure that it will work, but worth giving a try (truthfully, the solution works for some tables, while it fails for others)

Monica_Secelean · August 25, 2023, 11:11am

@g.ward we recommend the DocumentUnderstanding package to be used on its own - not in combination with the IntelligentOCR & ML package; that said, it shouldn’t actually break the workflow if used together - it’s just that the framework of the 2 differs very much and we plan investing mostly in the DocumentUnderstanding package.
Maybe try a workflow using only the DocumentUnderstanding package? or is there anything stopping you from (missing feature, probably, as we work towards adding those!)
Looking forward to hearing from you!

ababab2828 · August 25, 2023, 2:50pm

Perhaps to find a way to document the question that generates the document classification for a specific Document Type. This will help to look back from the output panel on how to improve the question for classifying that type of document if the result does not meet the expectation when processing a stream of different types of documents. Otherwise, one has to go back to the Classify activity to check into the prompt.

mlellison · August 29, 2023, 7:55am

Thanks @Monica_Secelean

May I ask where can we find updates about the fix on table processing?

For those who are interested in table extraction, I have followed below video and able to extract table data (The prompt gives good accuracy, just sometime missing the last datarow)

g.ward · August 29, 2023, 2:44pm

Hi Monica,

Thanks for the reply.

You are correct, its to augment an existing old process. I’ll look into how easy it would be to convert the old stuff into the new package.

In terms of other features, will the classifier ever return multiple results? e.g. if it was 80% sure on one category and 60% sure on another? Alternatively, could it return page by page results for when we have multiple documents stitched together? Currently we have a nasty pdf split algorithm to deal with such things.

Thanks,

Gareth

Monica_Secelean · September 4, 2023, 2:38pm

@ababab2828 you are right in the sense that, you would need to test & modify your prompts until the proper results are achieved. If you want some reference to the prompts, would you mind saving them as variables and provide the variables as prompt input to the Classify Document Activity?
Something like:

Document Type Name: Invoice
Prompt: invoicePrompt
where invoicePrompt is a variable where the prompt is persisted

While you cannot do this yet, we can think about enabling it if it helps your use case - what do you think?

Monica_Secelean · September 4, 2023, 2:41pm

@mlellison there are no updates with regards to the activities - I was just suggesting you use different prompts and see how it works, because officially, we don’t yet provide table support for generative extraction - although we are looking into it for the moment, we recommend using specialized models for table extraction

Monica_Secelean · September 4, 2023, 2:47pm

Hi @g.ward - we are looking into how we can provide a migration path from the old to the new package - but as the new package is still catching up on feature parity, it will take us a while until we can do so

For the moment, the classifier returns one result only - but we are looking at providing splitting capabilities to it soon
I’m sorry I don’t have better news - we are working on cool stuff though and will be happy to update you once we’re ready to release ! Till then, please keep the feedback coming - it helps shape the product

Monica_Secelean · September 4, 2023, 7:27pm

@islam.spaho any idea, are you using a community or an enterprise account?

ababab2828 · September 13, 2023, 1:25pm

This sounds good and will definitely help!

Ajay_Wadhawan · September 15, 2023, 6:56am

This is good. However organizations are concerned about security aspect of using Gen AI. Is it safe for organization data.

Joshua_Allan_0 · September 18, 2023, 7:52am

Trying to use the extract and when running in debug I get the following error

Extract Document Data: Unable to cast object of type ‘UiPath.IntelligentOCR.StudioWeb.Activities.DataExtraction.DocumentData1[UiPath.IntelligentOCR.StudioWeb.Activities.SWEntities.CustomGptDocumentTypeC737D34CB07B4D25AceeB7E566F41F35.Bundle.CustomGptDocumentTypeC737D34CB07B4D25AceeB7E566F41F35]' to type 'UiPath.IntelligentOCR.StudioWeb.Activities.DataExtraction.IDocumentData1[UiPath.IntelligentOCR.StudioWeb.Activities.SWEntities.CustomGptDocumentType7D89Cc4075Fc4A929517Fdb1E3Bce64D.Bundle.CustomGptDocumentType7D89Cc4075Fc4A929517Fdb1E3Bce64D]’.

This is developed in Studio (Desktop) and not sure where I am going wrong with this?

Topic		Replies	Views
UiPath Community 2023.10 Release - Document Understanding Product News	2	1089	November 15, 2023
Process PDF Files, Classify Documents & more with new Document Understanding Activities in Studio Web Product News document_understanding , studio-web	5	1646	April 7, 2023
Not able to find UiPath.DocumentUnderstanding.Activities in manage packages Activities activities , question , document_understanding	7	382	October 9, 2023
UiPath Community 2024.4 Release - Document Understanding Activities Product News document_understanding , document_processing	2	595	May 9, 2024
Data Extraction using Document Understanding on Studio Web Studio Web document_understanding , uipath-drafts , data-extraction , studio-web	2	1323	March 24, 2022

Most Active Users - Yesterday
ashokkarale
Yoichi
vineelag
Arvind_Kumar1
asshiyuta
J0ska
Foxtrek_64
Murali_Boni
arivu96
SenzoD
More details...

Generative Extraction & Classification using Document Understanding in cross-platform projects - Public Preview

Related Topics