UiPath Document Understanding - Build a Document Understanding Automation in Studio - Additional exercise

Submit your additional exercise workflow

  1. Upload the solution zip file.
  2. Openly discuss any issues or doubts by including screenshots for clarity and receive guidance from UiPath MVP if you encounter roadblocks in the advanced exercise.
  3. Expect swift feedback as the MVP reviews workflows and offers suggestions every Wednesday of the week.
1 Like

Having Difficulty extracting the checkbox data i am using a form extractor , Request you all to please guide me @Melisa_Miranda

SpecAI_DU_Performer.zip (151.3 KB)
here is my workflow please let me know where i went wrong, i have configured every classifiers and extractors and yet its not extracting the data from the kyc form @postwick @Anil_G @supermanPunch @Melisa_Miranda @ppr

2 Likes

Hi @Melisa_Miranda

Here is my Workflow

Waiting for your Feedback

DU Forum Challenge.zip (1.1 MB)

2 Likes

Hello @Jai_Pande

I reviewed your workflow. It is pretty good. Great work implementing it :slight_smile:

I however, have few suggestions and guides for you.

Regarding the Date field extraction:

I saw the output files and the date fields were extracted as a number. You did correct by specifying the field as a type “Date” in Taxonomy. However, in forms like this we may need to add an extra setting by specifying the expected date format in the form. This is done because the document itself doesn’t say which format the date is in as we write it in a single line with numbers and no separations for day, month, etc.

So in the Taxonomy, we can set the expected date format

image

Regarding the Checkboxes:

the type set in the Taxnomy (Boolean) is correct. We also highlight the textbox area in the Form Extractor to indicate where it is. These two steps can work. However, there can also be scenarios where it requires a extra step here as well.

In the Form Extractor, we have a place to configure the usual characters that may show up for checkboxes. We can add those automatically and manually. Following is a screenshot.

Click on the “Add recommended” text next to it to automatically add the common characters. You can also edit manually.

Try adding this and running it again. It will most probably work. Feel free to reach out to me if it doesn’t work. We can work together.

I have another suggestion for you regarding the name fields like the “full name”. You can try using the data type in Taxonomy as “Name” instead of “Text”. There is nothing wrong in using “Text”. However, using the property “Name” will give you added formatting functionality. The extracted name will be separated automatically into “First Name”, “Middle Name”, and “Last Name” just like it does for Address and Dates.

Give it a try and see the magic :slight_smile:

Overall, great job friend!! Happy to see your learning… Keeping going!

Thanks
Lahiru

@Lahiru.Fernando

First of all it means alot when you responded sir.
I managed to solve the conundrum by first splitting the PDF into single pages and then inserting a single type of PDF into the template to enhance the Template of Form Extractor

Regards

1 Like

@sanjay3 Thank you for your workflow submission.
We will review and provide feedback shortly.

Hello @sanjay3

Great to see your efforts in learning AI. I have reviewed your submission and here is some feedback.

Your solution includes the Dispatcher (REFramework) and the Document Understanding flow.

Feedback on the Dispatcher

The use of REFramework is a good choice for the dispatcher. However, there are few things that you can improve as a part of best practices. You included the folder reading activity in the Process.xaml to capture all files that require processing. Ideal scenario would be to include this in the INIT state and push it out as a DataTable from the InitAllApplications.xaml. If you look at the Main.xaml, there is a predefined datatable that we can use as transaction items. If we can assign the values to that, we can pass them as Data Rows to the Process state.

The other approach is, instead of converting to Datatable, use a List. Either way, it is always good to define it in the INIT itself. The reason to define in INIT has multiple benefits:

  • in case of errors, we can always refer here and get the next item
  • We can maintain a status (if we use datatables)
  • We can ignore the follow up steps if there are no new files to run

Feedback on the Document Understanding flow

Here I assume that you had the idea of triggering the DU job multiple times for each queue item?

I say this because I don’t see the looping back to get the next queue item. Can you also share whether you planned to trigger the DU job based on a Queue Trigger? If so, that is perfect :slight_smile:

I also liked the fact that you have included the DU activities inside a Try Catch.

Taxonomy Manager specifics

Good naming methods.
I noticed a few fields where you can use better data types. For example, a few fields were referring to names of people with the data type “Text”. You can use the data type “Name” here. It will also help to automatically format the names into First, middle, and last names.

Its great that you added the date format for those tricky dates :slight_smile:

Flow specifics

Your classification method is correct. Can you explain a bit on how you trained your classifier? What was your approach in training?

The Train Classifier Scope has to be included after the manual classification step. For example, after Present Classification Screen. So you need to swap the two activities to reorder it. The reason is, we do the training only after the manual validation. and based on manual validation results If we use the auto-classified results for training, we may also train using the wrong data (if the automatic one is wrong).

Form Extractor config

Signature Fields option is only applicable for the fields that include the human signature. We only configure it when we need to detect whether a signature is present or not. It should point only that field. In your scenario, it is mentioned for most of the fields. Hence, you need to fine-tune that setting to point only to the field that requires the signature.

This is probably the reason why you get “YES” “NO” as the output for most of those fields in the Excel.

Though that is the practice, I kind of see a pattern in how you have configured it. Can you explain your thinking behind this? I would like to understand better and be a bit more specific :slight_smile:

Try Catch

The use of a try catch is wonderful. I just noticed that in the Catch section you are setting the transaction status to “Success”. I believe it should be “Failed” with Application Exception :slight_smile:

That’s what I have for now… I look forward for your reply so we can talk about those little pointers I mentioned.

Thanks again and keep learning!

Thanks
Lahiru

1 Like

Hi @Lahiru.Fernando

Thank you for the feedback

  1. For the Dispatcher , based on your feedback I will work on that and follow the best practice

  2. The Document Understanding flow, yes my idea was to process one file at a time.

  3. Taxonomy Manager specifiers , I will work on that.

  4. For Classifiers training, I manually split the document which are merged i.e. (AC Open and KYC From). After that when I run my flow it was automatically splitting the document which are merged.

For the Train Classifier Scope , I will make changes as per your feedback

  1. Form Extractor , I will work on that

6.Try Catch , yeah It should be failed, it was mistake from my side

Once again thank you for the feedback

Regards,
Sanjay

1 Like

Awesome… That is perfect…
Feel free to text here in case you need any assistance :slight_smile:

1 Like