Document Understanding - Train Data

Ashmi_Uththama_Handunge · January 21, 2025, 6:37am

Hi,

I developed a project to extract data from Invoices. I used machine learning extractor and machine learning extractor trainer. I want to train the data based on human feedback. I built a project in AI Center and create a ML Package, ML skills using Out-of-the-box > Invoices model. And then I labelled a dataset using schema. And I create a pipeline with enabling auto retraining.

Then I need to know , when human validate some data , how it trained? Can we re -trained the pre - trained model with our new data?
What happened in data labelling ,if i add a new regular field which is not in the schema ? Can I add a new regular filed?

With our new human validated data , will it retrain the full model or will it retain only the dataset we gave?

adi.mehare · January 21, 2025, 8:17am

Follow above link for retraining Invoice model, let me know if you need any help.

Ashmi_Uththama_Handunge · January 21, 2025, 9:22am

Thank you very much. And can we download the datasets in public endpoints and train?

Anil_G · January 22, 2025, 7:18am

@Ashmi_Uththama_Handunge

so this is how the flow goes

Create a dataset in ai center
use the dataset while training and creating the skill…enable auto retrain and auto upgrade for ml skill
now in process when a new validated data comes in upload the file to dataset created in step1
now as the retraining is enabled when next retrain interval comes all the documents present in dataset will be used for training and new model ans skill is created…which gets auto consumes anyways and this cycle continues
now if you want to add new fields then you need to change the taxonomy as well and also label and train all the documents for new field as well

Hope this helps

cheers

Topic		Replies	Views
Retraining the model in Uipath Document Understanding AI Center question , document_understanding , ai_center , uipath	1	152	January 18, 2025
Retrain any uipath public end point without AI center AI Center question , ai_center	4	596	December 19, 2022
Can we manually train it and add other fields that can be extracted from an invoice Document Understanding activities , question	3	1274	June 18, 2020
Train Invoice ML Model with Validation Station data AI Center question , ai_center	2	1851	May 19, 2022
Classifier retarining Orchestrator orchestrator , question , ai_center	1	277	August 10, 2023

Document Understanding - Train Data

Related topics