About the Document Understanding category

Ryan_Rush · June 10, 2019, 6:47pm

Everything you know about UiPath Document Understanding.

Document Understanding is the ability to extract and interpret information and meaning from a wide range of document types (e.g., structured, unstructured), storage formats (e.g., images, PDFs, text), and objects (e.g., handwriting, stamps, logos).

Pankit · June 27, 2019, 11:39am

Still in Beta versions? Can we use it for Production?

charliefik · July 29, 2020, 7:07pm

Hi,

I’m going through the document understanding course at the moment.

This question may not be relevant as it doesn’t relate to images/pdfs/text.

If you have data coming in to a process as an excel attachment from a variety of sources that are not consistent with the data column positioning, table starting row, header names, data that might span multiple rows.

Is there a method of setting up rules like you would do for non standard documents in the document understanding tool-set that could keep the rules for finding the data or using ML models which could learn how to extract the data through training, like you can do with scanned documents.

satsoni · September 21, 2020, 8:32am

HI, this is an interesting Q, did you figure out an answer? I would suggest a worst case scenario is that you save you spreadsheet as a pdf…not sure if this is a generic solution becuase some excel tables will span across many pages.

a real and final solution is to work with a data scientist an build your own ML engine for this. it does not sound extremely difficult to accomplish. I stand to be corrected

regards
Sats

charliefik · October 10, 2020, 6:29am

Hi Sats,

I haven’t got a solution (it was an issue I encountered ages ago and at the time this was just one of many issues with the process that made it unsuitable).

I would still like to know if there are any solutions out there. You need a resilient/neat way to store the rules for all the different formats of excel input data and maybe a ML model is ideal but I don’t know how to go about doing that.

Thanks,

Kesavaraj_K · November 11, 2020, 7:58pm

Hi Everyone,

I have been working in DU for sometime and still couldn’t figure out an unattended way of automating it.

There will be always a scenario where some document will have lesser confidence level of data extraction. In such, We will be using Validation Station and Action Center.

My Query is that while using such activities as given below stops the process until someone responds to the action created. This creates a void time of non-execution

Create Validation Action
Wait and Resume Validation Action

Pls anyone enlighten me if i’m going wrong?

Topic		Replies	Views
Strategy for understanding various forms of excel Document Understanding ml	1	1181	September 20, 2020
What is the best way to handle Document Processing and Document Understanding? Activities activities , question , document_understanding	1	539	November 30, 2022
Extract Unstructured table data from pdf Studio studio , question , activities_panel	6	1554	September 30, 2021
Document Understandng Studio studio , question	4	1015	April 15, 2021
Documentation Documentation	1	827	November 18, 2021

Most Active Users - Yesterday
Anil_G
ashokkarale
jinal.shah
Gautham_Pattabiraman
postwick
chandreshsinh.jadeja
vrdabberu
Ajay_Mishra
sven.wullum1
Vyshnavi_Nalumachu
More details...

About the Document Understanding category

Related Topics