How to import and retrain the dataset in document understanding without using data manager or orchestrator for create or import dataset

Ajay_Kumar10 · August 21, 2023, 1:19pm

I’m working on the docuemnt understanding to parse the different format of Invoice pdfs,

So I want to retrain the UiPath invoice model.
Help me to retrain the model by import and retrain the model without using the data manager or Orchestrator for dataset. I want to import it locally through the UiPath Studio.
@ppr @nisargkadam23 @RAKESH_KUMAR_BEHERA @Vibhor.Shrivastava @mukeshkala @sandeep13

supermanPunch · August 21, 2023, 1:32pm

Maybe you require to auto-export the validated data back to the Dataset, if so, you could check the below post :

If the above doesn’t clarify, you could provide us some more info on what is the actual requirement.

Ajay_Kumar10 · August 25, 2023, 10:07am

Hi @supermanPunch,
I want to retrain the ML model, but when I create the datasets and import human validation station training data through the machine learning extractor trainer. after that, I create and run ML pipeline so it is failing.

supermanPunch · August 25, 2023, 10:35am

@Ajay_Kumar10 ,

What was the configuration done in pipeline ? What was the dataset selected ?

We would require to Perform the Export feature in Document Manager (Or Schedule this), so that the fine tune folder data gets combined with the previous data as well and gets stored in the export folder. This way we would require to select the export folder as the dataset for the pipeline and set auto-retraining parameter to true.

The Second point in the Topic suggested in the previous post mentions that :

muthuerd · September 5, 2023, 8:56am

@Ajay_Kumar10 -
We have two option for retraining.

Manual:
Once the Document are validated in Action Center, then keep the document in local/shared drive folder, it will create a three folders. Zip all the three folder and import in Data Manager then export it to dataset and start the pipeline.
Auto Retraining -
You have to schedule this in Data Manager & Pipeline.
Recommended is manual so that you will have the track of retraining data.

Let me know if you have any question

Topic		Replies	Views
How to continuously retrain Invoices ML model with Action Center Validation Station input? Document Understanding orchestrator , studio , question , ai_center	6	3099	September 18, 2021
Retraining the model in Uipath Document Understanding AI Center question , document_understanding , ai_center , uipath	1	18	January 18, 2025
RE-Train Invoice ML Model with Validation Station data AI Center question , ai_center	2	1146	October 20, 2022
How to define pipeline UiPath Fine Tune Training Data? Activities ocr , question , aicenter	3	1462	May 26, 2022
Re-Train the invoice's out of the box model from UiPath AI Center question , ai_center	1	1401	March 19, 2021

Most Active Users - Yesterday
Anil_G
ashokkarale
singh_sumit
Yoichi
sonaliaggarwal47
supermanPunch
naveen.s
Arham_Shahzad
Youri98
ppr
More details...

How to import and retrain the dataset in document understanding without using data manager or orchestrator for create or import dataset

Related topics