Hi guys, pls provide solution for this issue in Document understanding

Lokesh_M2 · April 25, 2023, 8:40am

I trying to do Data labelling in AI Center. In some Pdf documents some fields are missing and if i am hiding this fields. Now it is not extracting that fields data available in other documents also. Pls provide solution.tq

Anil_G · April 25, 2023, 8:48am

@Lokesh_M2

You need not hide…if not present basically it would be blank or no data

cheers

arjunshenoy · April 25, 2023, 8:48am

Hi @Lokesh_M2

At the time of Data Labeling, it is recommended to use the documents which mandatorily contain the appropriate field that you will be labeled, as the same labeled data will be sent to the dataset, which further build the ML model.

If the field in not available in any of the labeling document, you can simply leave the field empty for that specific field, but do not mark it as hidden (It will hide the field when exporting to the dataset and thus you are not able extract)

Leave it empty like this:

Hope this helps,
Best Regards.

Lokesh_M2 · April 25, 2023, 8:59am

If i leave fields empty while exporting it is giving error.

arjunshenoy · April 25, 2023, 9:00am

@Lokesh_M2

This is not because you leave the field empty. As the error itself indicates, you need to label at least 10 pages in order to export it to the dataset.

Best Regards.

Lokesh_M2 · April 25, 2023, 9:04am

But for my native docs at least one field is missing.

arjunshenoy · April 25, 2023, 9:07am

@Lokesh_M2

You might have to get more data in that case. As per UiPath’s official documentation:

→ For Regular fields, you need at least 20-50 document samples per field. So, if you need to extract 10 regular fields, you need at least 200-500 document samples.

→ For Column fields, you need at least 50-200 document samples per column field, so for 5 column fields, with clean and simple layouts, you might get good results with 300 document samples.

→ Classification fields generally require at least 10-20 document samples from each class.

https://docs.uipath.com/document-understanding/automation-cloud/latest/user-guide/training-high-performing-models

Best Regards.

Lokesh_M2 · April 25, 2023, 9:16am

Thank you @arjunshenoy for your valuable info.

Lokesh_M2 · April 25, 2023, 9:16am

Thank you @Anil_G

system · April 28, 2023, 9:17am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ML skill - field ID Activities document_understanding	10	708	May 19, 2023
ML skill model Activities mlservices	3	446	May 6, 2023
Do we always need to label every field? AI Center question , ai_center	1	23	December 14, 2024
Few Questions About Document Understanding Document Understanding orchestrator , activities , studio , question	3	994	June 6, 2024
Training AI Model - Document Understanding AI Center question , document_understanding , ai_center	4	1420	September 1, 2022

Hi guys, pls provide solution for this issue in Document understanding

Related topics