Trainable document splitter model in Modern Projects

andras.palfi · October 28, 2025, 9:00am

We’re excited to announce that a new trainable document splitter and classifier model is now available in Public Preview for tenants based in Europe and the US.

This new model extends the power of document classification to multi-document packets — enabling you to split and classify documents within a single workflow.

Key details

Availability: Currently, the feature is only available in Europe and the US. Availability for new reagions will be confirmed at a later date
Pricing: The new splitter and classifier model falls under the existing Modern Projects pricing model, where each page is charged 1 AI Units regardless of numbers of operations on that page. Please note that final pricing for General Availability (GA) may be subject to change.

What it does

The new model can:

Classify entire documents (e.g., identify whether a file is an ID, invoice, or application form).
Split and classify multi-document packets — such as mortgage or loan application files containing multiple sub-documents.

For example:
A 10-page mortgage application PDF might contain:

Pages 1–2 → ID
Pages 3–6 → Application form
Pages 7–10 → Bank statement

The new model can be trained to automatically detect these boundaries and assign the correct document type to each section.

Getting Started

Create a project

Start a Modern Project and enable the toggle “Enable new splitter and classifier model.”
To perform splitting in addition to classification, also turn on “Enable splitting.”

f6c180ed-6294-4497-9dd5-e4db6878cf581024×776 113 KB

Upload your documents

Go to Classify and Split Documents and upload your document packets.
Once processed, select the uploaded files and click Split to open the annotation interface.
If your project already has a trained model, the documents will be pre-annotated automatically — saving time and showing predictions.

b0eefa83-5e51-4c84-be95-15c4c7ca74841024×507 32.4 KB

Define your classification taxonomy

Click New document type to define each document type in your taxonomy.
Choose from predefined types or create custom ones with:
- A name
- A short description (1–3 sentences)
- A few key indicators (e.g., invoice number, total amount, seller information)
  
  24092df6-e4a9-4e6c-bc68-8974160c6eb11024×467 52.6 KB

Annotate and confirm splits

Mark where each document starts and ends, and assign a document type to each range.
Click Confirm to process and generate sub-documents.
Each sub-document appears under its document type in the Build section and gets pre-annotated with the schema of that type.
You can skip non-relevant pages by labeling them as “–Unknown–.”

e8bd8c6f-c915-4094-8535-5515c16c910a640×554 63.1 KB

Train your model

Training begins automatically once you have at least five annotated sub-documents.
Training status is visible in the Classification pane.

Review metrics

Navigate to the Measure page and review model metrics

Publish the Model

Once training is complete, publish your model in the Publish section to make it available for use in your automations.
Published models can be versioned, managed, and reused across projects.
The version of the new splitter and classifier model is 25.9

Screenshot 2025-10-27 at 15.55.491580×1328 125 KB

Consume the Model in Your Workflow or via APIs

Use the published model directly in UiPath workflows to automatically classify or split incoming documents.
Currently, the new splitter and classifier model can be consumed through IntelligentOCR.Activities 6.27.0.
You can also access the model through APIs to integrate document processing into other systems.

Reviewing Predictions

After training, all project documents are updated with predictions from the model.
You can review results by:

Comparing Ground Truth (Type) vs Predicted Type in the Classification table.
Viewing sub-documents by enabling “Include sub-documents” in the View menu.
Enabling “Show Prediction” in the annotation interface to see how the model performed.

Classification-Only Option

If you only need classification (not splitting), simply disable the “Enable splitting” toggle.
The model will then classify whole documents as before.

Current Limitations (Public Preview)

Some limitations apply during the preview phase:

Dataset

Minimum document types: 1
Minimum samples:
- Single document type → at least 5 samples
- Multiple types → at least 5 total documents (1 per type minimum)
Maximum document size: 160 MB or 500 pages
Training triggered after 5 annotation changes

Annotation

Pages cannot be reordered or deleted

Features not yet available

Splitting info in the Monitor page
Retraining for splitting/classification
Splitting support in for the cross-platform activities (DU.Activities)
Migrating splitting and classification data sets across environments

These limitations will evolve as we move toward General Availability (GA).

Share Your Feedback!

We’d love to hear your thoughts as you try the new splitter and classifier model:

How well does it handle your document packets?
What improvements would help before GA?
Any issues or surprises you encountered?

Your feedback will help us make the feature even better before full release.

David_Hernandez2 · October 28, 2025, 4:17pm

Will this be available in non-modern as well?

andras.palfi · October 29, 2025, 8:21am

Hi David, no, it won’t be available in classic projects. Do you have any blockers/issues with using Modern Projects that we should be aware of?

edevries · October 29, 2025, 12:53pm

I’m not David, but I can say the main issue with Modern Projects is how expensive they are.

David_Hernandez2 · October 29, 2025, 1:47pm

From what I have seen, Modern projects cost more.

andras.palfi · November 10, 2025, 8:35am

Hi, could you clarify what you mean by saying the main issue with Modern Projects is how expensive they are?

Are you referring specifically to splitting use cases, or to Modern Projects in general?

It’s true that some use cases can be more expensive, but many are not. I’m not trying to argue for either approach — both Modern and Classic (AI Center) Projects have their own advantages and disadvantages in terms of costs.

For example, in AI Center/Classic Projects, users pay for infrastructure costs — both for training and for model serving. Classification costs are also additive, meaning that if every document needs to be classified and extracted, the total cost will exceed that of Modern Projects (>1 AI Unit per page).

In Modern Projects, however, there are no infrastructure costs, and both classification and extraction together cost 1 AI Unit per page. So, if every document requires both classification and extraction, Modern Projects will actually be cheaper. On the other hand, if only a portion of documents need extraction, then Modern Projects can end up being more expensive.

For splitting use cases, we’re currently exploring alternative pricing models and would really appreciate your input.

Vajrang · December 1, 2025, 3:52pm

is there a video explaining the classification process?

when I use the project after classification, still getting single document type instead of multiple document types.

is this designed to work like intelligent keyword classifier which automatically gives different document types from a single document?

Kanal_Kumanan · March 10, 2026, 7:59pm

Hi, I have trained the classifier with multiple document types, and earlier the predicted type was showing correctly. However, now the predicted type is appearing as “Unknown” for the documents. Previously, those same documents were predicted with the correct type, but now they are being marked as “Unknown.” Could you please help me understand the reason for this issue?

Topic		Replies	Views
Automation Cloud Document Understanding page based classification Document Understanding	5	194	January 20, 2025
UiPath Document Understanding Machine Learning Classifier Public Endpoint release Product News feedback , document_understanding	6	3179	July 8, 2022
How to use classification and splitting option in Latest document understanding feature? Is there any guidelines document provided by Uipath? Document Understanding question , document_understanding	2	60	January 10, 2026
Document Understanding: Machine Learning Document Classification Community Release Document Understanding document_understanding	14	5721	March 3, 2023
Document Understanding: Splitting in Classic project AI Center question , document_understanding , ai_center , classic-project , splitting	5	212	June 20, 2024