Step by step document understanding creation

n.bounoun · December 9, 2025, 11:29am

I’m looking for a step by step to practice for a document understang?
now i’m using studio desktop

thank for advance

nishi_jain · December 9, 2025, 11:49am

Please confirm you need steps which can be follow to extract the data from document or do you need to understand how du is work

prashant1603765 · December 9, 2025, 12:00pm

Hi @n.bounoun

Welcome to UiPath,

I suggest pls go through the below link:

and

If helpful, mark as solution. Happy automation with UiPath

Jp_Khetan · December 9, 2025, 12:08pm

hii @n.bounoun

watch this Videos There is step by step videos are there for DU.

watch here:- https://www.youtube.com/watch?v=VHozzZiybJU&list=PLEYSwx3duQ2BWUnJORX4YVK54geNDJInW

nishi_jain · December 9, 2025, 12:09pm

Hi @n.bounoun

Please follow below steps to extract the data form invoices

Create new project
Install required packages UiPath.IntelligentOCR.Activities and UiPath.DocumentUnderstanding.ML.Activities.
Create folder - place sample invoices there
Add For Each activity (TypeArgument = String) to loop through file paths (use Directory.GetFiles(“path”,“*.pdf”)).Inside loop, use Load Document or just pass file path to Digitize Document.
Digitize Document
Activity: Digitize Document (from IntelligentOCR).
• Properties: DocumentPath = current file, Output = digitizedDocument (type Document).
• Choose OCR engine: Tesseract (offline) or Microsoft/Google OCR (if available). For scanned docs try OmniPage/Abbyy if licensed; otherwise tune Tesseract options.
Classify Document (optional for single type)
If you expect multiple document types, use Classify Document Scope + classifiers (KeywordBasedClassifer or ML Classifier). For single invoice type you can skip classification.
Data Extraction Scope
• Activity: Data Extraction Scope — Input: digitizedDocument.
• Inside Data Extraction Scope add extractors (order matters):
• Regex Based Extractor — good for invoice number, dates, totals.
• Form Extractor or Intelligent Form Extractor — good for structured tables.
• Keyword Based Extractor — fallback for field proximity.
• ML Extractor (pretrained) if you have it — better for variable layouts.
• Configure fields: Use a Taxonomy file or define fields manually (like InvoiceNumber, InvoiceDate, SupplierName, TotalAmount, LineItems).
Present Validation Station
• Add Present Validation Station (shows results to human for confirmation).
• Output: validationResult (Document object with validated answers).
Export / Save Results
• After validation, convert extraction results to a DataTable: iterate validationResult.DataExtractionResult.Documents(0).Taxonomy or use helper methods. Simplest: build a DataTable with columns FileName, InvoiceNumber and Add Data Row with values taken from validated fields (validationResult…GetTaxonomyValue(“InvoiceNumber”) depending on how you stored it).
• Use Write Range to save to Excel.
Cheers

If you find solution is helpful please mark solved .

mkankatala · December 9, 2025, 1:08pm

Hi @n.bounoun

It’s recommend you to start a document understanding and document understanding framework course in UiPath Academy which will give you idea on Document understanding process. If you are interested to learn from watching videos the below playlist is for you,

Hope it helps!!

n.bounoun · December 9, 2025, 2:40pm

do you have some zip file of project ?
thank you

nishi_jain · December 9, 2025, 2:43pm

Hi @n.bounoun

Currently I don’t have you can follow this and try to do some practice it will help you to understand the concept

If this solution worked for you please mark solved

Cheers

DavidMartin · December 10, 2025, 6:02pm

Hi,
Depending on your scenario you might want to consider UiPath IXP - it is the latest way of delivering document processing functionality.

Topic		Replies	Views
Extract Specific Data from Invoices,,,,, Studio studio , question , project_panel	3	202	July 23, 2024
UiPath Document Understanding - Full Tutorial Other activities youtube-video	0	728	April 29, 2024
Use case - DU Document Understanding question , document_understanding	3	88	January 12, 2026
How to extract required information's from different type of PDF invoices? Activities ocr , activities , question	3	955	July 8, 2021
New UiPath Document Understanding features have been released! Document Understanding news , document_understanding	23	14981	March 12, 2021

Step by step document understanding creation

Related topics