I can't annotate a signature in document type during training/annotate

Luca_Spaccatrosi · March 11, 2025, 12:48pm

Hello.

My scenario is:
1- Automation Cloud with Enterprise plan and Document Understanding license
2- Training phase of a Document Understanding Modern project with “UiPath Document OCR” as OCR method to extract data from pdf and OCR applied to all pdf
3- A custom document type (structured italian privacy policy) with one language in settings
4- I generated all the necessary fields to extract data and signatures (I only need signatures presence information)
5- The blu boxes are to hide sensible information and are irrelevant to the problem

My problem is:

The extraction engine extracts all the necessary fields which I then validate, including the signatures (green circle).
In some cases a signature (the document contains multiple signatures) is not pre-annotated by the extractor engine (red circle) and is not possible to draw a box around the signature to link it to the correct field.
I am therefore forced to mark the field as missing, thus obtaining a false negative regarding the presence of the signature!

I am at the beginning of my experience with document understanding and training of a project/model, but I believe that if the project has this behavior during training, I will have the same problem when I will use the project in automation with Studio, false negative in the presence of the signature!
Or am I missing something?

Please help.
Thanks in Advance.

Anil_G · March 11, 2025, 2:51pm

@Luca_Spaccatrosi

Welcome to the community

May I know why you are unable to draw around the signature? Because ideally you should be able to indicate anywhere on doc

Also if you want signature from sifferent areas thwn you need to give both type of documents with enough samples and train

Cheers

Darshan_Sable · March 11, 2025, 3:24pm

Go to project settings, there you can see multiple OCR Methods. Try other OCR methods to check if they can identify the signature

Also instead of Auto keep apply OCR option as Yes

Luca_Spaccatrosi · March 11, 2025, 3:57pm

Hi @Anil_G and thanks for the response.

Drawing a box with mouse pointer around the red circled signature of the first post, does not appare the dotted red box and the relative pop-up to link to a field.
This signature IS NOT pre-annotated by the engine.

Drawing a box with mouse pointer around the green circled signature of the first post is possible to draw a box to link to the field.
This signature IS pre-annotated by the engine,

The document is only of one type, a structured one with ten page, with two to three signatures on page one, two to four signatures on page two and one to two signatures on page nine.
Composing the data set (fields) with the first uploaded document, i generated all of nine fields for all possible signatures, with a sample document in which the all nine signatures are present (not all signatures are always present in the real documents).

Any thougs?

Luca_Spaccatrosi · March 11, 2025, 4:02pm

Hi @Darshan_Sable and thanks for the response.

I’ve already tried to use UiPath Extended Languages OCR (Preview) and Yes on Apply OCR on PDFs.
With no result at all.

Luca_Spaccatrosi · March 11, 2025, 4:15pm

Another case.
Clipboard_03-11-2025_04
In this case the signature is not pre-annotated by the engine, but is possible to draw a box and link to his specific field.

Luca_Spaccatrosi · March 11, 2025, 4:23pm

Other information, if it can be useful.
The documents are not scanned, but are “digital” pdfs containing only objects.
Even the signatures, when present, are objects placed in the correct place.

Darshan_Sable · March 11, 2025, 4:31pm

@Luca_Spaccatrosi try to draw larger box around signature. some of signature might have large background size and OCR might consider those extra space as well.

Luca_Spaccatrosi · March 11, 2025, 4:51pm

@Darshan_Sable
Good idea, but I’ve already tried it and it doesn’t work.
Also I risk capturing/hooking information that I don’t want.

Darshan_Sable · March 12, 2025, 4:52am

@Luca_Spaccatrosi is it possible to share the file so that I can check from my end?

Luca_Spaccatrosi · March 12, 2025, 9:57am

Hi @Darshan_Sable.
I’m waiting permission to share with you from my manager.
Just in case, how can i privately share with you a certain number of documents (the documents contains sensible data that i can’t publicy share)?

Darshan_Sable · March 12, 2025, 10:48am

@Luca_Spaccatrosi You can send message directly to my profile or you can email me darshan29sable@gmail.com

better edit the file and blur or remove sensitive data just keep the signature things for test

Darshan_Sable · March 12, 2025, 10:59am

@Luca_Spaccatrosi I tried with this photo and I was able to fetch the signature with UiPath Document OCR

It might be because we are using a image.

Luca_Spaccatrosi · March 12, 2025, 12:45pm

Interesting and yes, I think so too.
I’ll try to convert this document in ten jpg/png, build a new pdf and upload into document type to elaborate it with the model.
I will let you know.

I till waiting for permission to share.

Luca_Spaccatrosi · March 12, 2025, 2:12pm

And yes, now that the document is a flat pdf (image) the model is able to pre-annotate the signature.

Luca_Spaccatrosi · March 12, 2025, 2:14pm

And even in case of no pre-annotated signature (it depends on short model training) i am able to capture the signature and link it to correct field.

Clipboard_03-12-2025_02

Now i have permission to share with you.
In a few time i’ll send to you some original pdf to try signature recognition.

Luca_Spaccatrosi · March 12, 2025, 4:14pm

I just sent you an email with ten documents with no sensitive data.
Thank you.

Darshan_Sable · March 12, 2025, 4:29pm

@Luca_Spaccatrosi I observe that I was able select all the signature except the signature which are overlapped with the text or too closer to text which is expected. Solution is to ask user to sign properly in free space instead of overlapping it to text or close to text.

Luca_Spaccatrosi · March 12, 2025, 4:59pm

Yes, that pdf is one of the cases with signatures not recognized cause overlapping.
And is not possible to ask user to properly sign; the signatures are cattured digitally on a tablet and then the pdf is composed with signature object.

What for the first signature (first in page 1) of pdf 31131970?

Clipboard01

Darshan_Sable · March 12, 2025, 5:18pm

@Luca_Spaccatrosi I can annotate that

I have used 1040 document type

Topic		Replies	Views
Extract handwritten Signs from different PDF formats AI Center question , ai_center	6	1895	December 13, 2024
UiPath signature extraction Document Understanding question	3	196	May 20, 2024
Detecting signature using Document Understanding Concept Studio studio , question , activities_panel	2	1900	October 9, 2021
Checkbox and signature on scanned pdf (any OCR engine in UiPath) Help	4	9461	January 24, 2023
Checking Existence of signature in pdf file Studio studio , question , designer_canvas	17	3201	August 24, 2021

I can't annotate a signature in document type during training/annotate

Related topics