Document Understanding in UiPath Studio

Hey, I have a question related to Document Understanding. I am extracting a table from a PDF, and the number of rows may vary. I get extra rows in the results. For example, the table has 2 rows, but it returns 7 rows. The other 5 come from different sections of the document. I re-indicated the table, but the result is the same on 3 different documents. Can someone shed light on it?

Plus, which OCR is better for DU? I am using UiPath Document OCR. I was thinking about to use Omnipage OCR. Thanks a lot.

@Yasir_Yaqoob

You can use document understanding tab to create ML skill by training multiple documents (at least 10).
Once you train your ML skill it will capture only desired rows. Hope all pdf have same structure and rows can be any number.

Crate table fields and click on predict button it will predict it like below.
In case something indicated wrongly, you can update the fields

@Yasir_Yaqoob

  1. Is it having same number of columns and same column names?
  2. train more sample to get accurate date
  3. while annotating make sure you indicate the pages from where you need data only

cheers

You are speaking about training first. I am not a training model. But I would like to do. Could you let me know how can I use that trained model in UiPath Studio?

@Yasir_Yaqoob Sure

click on create project

Create a document type for example invoices

click open document type

import at least 10 files and indicate the fields

I knew that part. Next, If I want to build automation in UiPath Studio, how will I select that trained project?

@Yasir_Yaqoob you have to create ML skill for extraction with automated training option


once training is completed use this ML skill in your project with ML extractor
image

OK, thank you. I will give it a try very soon and will mark your answer as solution. Appreciate your time.

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.