Facing problem in extracting aadhar card using Document understanding

Jai_Pande · October 9, 2023, 4:37pm

Hi all,

I have doing a self project on extracting aadhar card details.I am using document understanding.Whether i use keyword extractor or machine learning its unable to extract name of the person.

The images are clicked from phone and i am using UiPath document ocr

In some images it is like
First name: Name

Some its like
Name

Thanks!

Shiva_Nikhil · October 9, 2023, 4:52pm

@Jai_Pande

Have you tried with pdf activities

check it once its better if the fields are not constant

or you need to use different extractor for different Aadhar card types

cheers

Jai_Pande · October 9, 2023, 5:33pm

Actually i want to make a project with DU only, so can you kindly help me out

Jai_Pande · October 9, 2023, 5:35pm

Also i need to ask do i have to use separate DU activites as for ex there i a single aadhar card but there are two images of it front and back…front contains name etc…and back contains address

Jai_Pande · October 9, 2023, 6:05pm

@supermanPunch need help man!

supermanPunch · October 9, 2023, 6:05pm

Hi @Jai_Pande ,

Could you let us know upto what stage have you performed the Process Design ?

What is the Model used ? If DU Model, what was the number of documents used for Training (Labelling+Pipeline Run) ?

Jai_Pande · October 9, 2023, 6:12pm

There are 5 docs available. 1 Aadhar card and 1 pan card. Each has 2 documents namely:- Front and Back

Shiva_Nikhil · October 9, 2023, 6:23pm

@Jai_Pande

Then you need to use two extractor one for Pancard other for Aadhar Card

cheers

Jai_Pande · October 9, 2023, 6:30pm

yes but aadhar card too has two pages front and back…and when i am extracting the front page there is data of back page also in present validation station

supermanPunch · October 9, 2023, 6:33pm

@Jai_Pande ,

Not yet fully able to understand the implementation done. Could you let us know the progress of the implementation upto now as previously mentioned ?

Jai_Pande · October 9, 2023, 6:51pm

Oh! the model used is Document Understanding with Flowchart brother, actually i am unable to extract the name its showing something else in validation station

Shiva_Nikhil · October 9, 2023, 6:56pm

@Jai_Pande

try with form extractor or Regex based extractor

cheers

Venkat4 · October 9, 2023, 6:59pm

@Jai_Pande

can you confirm if you have performed labeling the data and trained the model.

Cheers

Jai_Pande · October 9, 2023, 7:02pm

guys its always showing Yes in field of name, that the major problem here

Jai_Pande · October 9, 2023, 7:04pm

i am using trainer after present validation station , the workflow is as follows:- Taxo…digitization…classify…extraction…present validation station…and then classifier trainer

Jai_Pande · October 9, 2023, 7:04pm

using form extractor

Shiva_Nikhil · October 9, 2023, 7:13pm

@Jai_Pande

try once with regex based extractor as you are not getting the appropriate values using Form extractor

cheers

Jai_Pande · October 9, 2023, 7:14pm

ok and should i create separate document types in taxo for aadhar card front and back??

Shiva_Nikhil · October 9, 2023, 7:17pm

@Jai_Pande

see basically we use one taxinomy for one type of pdf,as aadharfront and back as combined will called as a single pdf

for pan card you need to add another one in the taxinomy

Jai_Pande · October 9, 2023, 7:25pm

okay i have two separate images of aadhar card brother…front and back

Topic		Replies	Views
How to extract the Text from AADHAR Card Studio uiautomation	4	7783	August 29, 2023
Form extractor not extracting the whole form! please help Studio orchestrator , activities , studio	5	202	November 20, 2023
Partial Data Extraction from Document AI Center question , ai_center	1	109	June 26, 2024
Document Understanding data not getting extracted Activities excel , uiautomation , studio	5	393	November 17, 2023
Having Issues Extracting Data from semi structured pdf Document Understanding	2	1070	June 21, 2020

Facing problem in extracting aadhar card using Document understanding

Related topics