How to train same custom DU model multiple times?

shrey.shah · March 22, 2022, 5:22am

I have trained a ML document understanding model using my data (invoices). Now according to this video Training UiPath Document Understanding ML Models - Data Manager - Part 2 | RPA - YouTube he says to increase the accuracy either train on a large set of documents or train the same model multiple times (as he does in the video where he trained the same model multiple times and has different version of the same model i.e. 11.0, 11.1, 11.2 and so on!). My question is:

1.) Is it advisable to train the same model multiple times?

2.) If yes then how can I do it as I only have 10 documents trained. What minor version should I choose when setting up the training pipeline? Should it be 0 or the latest version?

suraj.setty · March 22, 2022, 5:31am

Hi @shrey.shah

It is recommended to add some more documents into your Data Manager and label them as you only have 10 documents, which will not help to improve the accuracy instead add more documents to the same Data Label and train them.

Please choose always the latest version for the Minor version for the pipeline run.

Make sure Data Labelling is done properly to achieve more accurate results.

“Label documents” Label documents

Hope this helps,
Thanks.

shrey.shah · March 22, 2022, 5:39am

@suraj.setty What you suggest for the minor version and what is suggested in this video UiPath Machine Learning Model Training - Best Practices | RPA | Artificial Intelligence - YouTube are at odds! Can you kindly clear the confusion? In the video it is suggested to always choose the minor version as 0 but you are saying the opposite!

suraj.setty · March 22, 2022, 5:44am

Hi @shrey.shah

If you want to train the latest Model created you can go with the latest version or if you want to train the starting model created you can go with the 0 version.

You can use any of the version deleting on the requirement, as I use the latest Model to train each time to get more accurate results.

You can follow the video if you want to.

Thanks.

shrey.shah · March 22, 2022, 5:51am

@suraj.setty I am confused. Let’s say I trained once and upgraded from 11.0(base version which only contains the default data) to 11.1 (base version data + custom data). Now if I want to retrain the same model, if I choose the minor version as 1 would it consider only the the custom data or also the default data?

Generalizing the question, what is the difference between choosing minor version as 0 and 1?

suraj.setty · March 22, 2022, 6:10am

Hi @shrey.shah

Your understanding is Correct once you train the base version say 11.0(base version which only contains the default data) to 11.1 (base version data + custom data).

Since the updated version 11.1 contains both the data , you can train that Version to improve the Model.

Lets say if you train 11.0 again which only contains the base model , there will be again discrepancy with the data.

Hope I was able to answer some doubts.

Thanks.

shrey.shah · March 22, 2022, 6:56am

@suraj.setty Can you provide any links to articles/blogs where I can read about the minor version?

suraj.setty · March 22, 2022, 7:43am

Hi @shrey.shah

There is no specific article explaining about the Minor Version.

Please have a look

Hope this helps

UIPath - Easily Retrain AI Starter Models for More Accurate Results | RPA Forum - Powered by Robiquity

Training High Performing Models (uipath.com)

Document Understanding - Auto-retraining of ML Models - News / New Features - UiPath Community Forum

Retraining AI Starter Models for More Accurate Results Is Easy | UiPath

Thanks.

ushu · March 22, 2022, 5:01pm

@shrey.shah In one of the doc UiPath mentioned that it is always better to train on the lower version

Also, use the below link on ML training, best practises

shrey.shah · March 23, 2022, 5:33am

@ushu What is the logic behind this? As mentioned in my previous comment if I train initially, the version will change from 11.0(base version which contains only the base data) to 11.1 (upgraded version which contains base data+custom_data). Now if I want to retrain on some new docs or on the same docs, I should select 11.1 as it contains more data and would result in better results! but according to the link you provided, it says other wise. Can you point out where am I wrong in my logic?

@suraj.setty Any help/guidance would be appreciated!

suraj.setty · March 23, 2022, 5:46am

Hi @shrey.shah

I would suggest to follow the document as per the best practice , I think my method may not work every time.

You can select the Minor version as 0 as suggested in the article.

Thanks .

Topic		Replies	Views
Document undertanding Training, Evaluation , Full pipeline Document Understanding question	3	25	March 18, 2026
Which minor version should I choose during a Full pipeline run and during an Evaluation run? Studio studio , question , document_understanding , activities_panel	3	1170	March 25, 2022
DU Training Model Automation Cloud automation_cloud , question	5	116	November 11, 2024
AI Center Pipeline Version Accuracy Issues AI Center question , ai_center	16	1417	December 3, 2022
ML Skill - Pipeline Run AI Center question , ai_center , pipeline-training	1	67	April 23, 2025

How to train same custom DU model multiple times?

Related topics