Data extraction using Taxonomy

rrahat93 · July 22, 2022, 7:00pm

I am extracting one of the elements from my pdf file and it should be only numbers. but when it extract the data to excel it is including some letters in it. for example I am trying to extract 20-088022 but instead im getting this F2or0th-e0ta8xa8b0le2ye4a2r. How can i resolve this problem?

THIRU_NANI · July 22, 2022, 7:02pm

Hey!

Which extractor we’re using?

Can we try with ML- Intelligent OCR?

Regards,
NaNi

rrahat93 · July 22, 2022, 7:05pm

I am using form extractor. Do you want me to use the other one?

rrahat93 · July 22, 2022, 7:15pm

@THIRU_NANI used intelligent form extractor and got same result. any other solution

THIRU_NANI · July 22, 2022, 7:22pm

Hey!

We can still do string manipulation to get the numbers using RegEx…

System.Text.RegularExpression.regex.match(InputStringVariable,"\d+").ToString

But the intelligent OCR should work…

Can you please show me the extraction part…Where we are validating the field?

Regards,
NaNi

rrahat93 · July 22, 2022, 7:33pm

@THIRU_NANI

Is this what you want you to see? I am trying to get federal employer Identification number

Rounak_Kumar1 · July 23, 2022, 7:04am

Hey,

You can go through this Vedio
Your all doubt will be clear
UiPath Document Understanding # 6 | Extract and Validate | ExpoHub | By Rakesh - YouTube

Thanks
Rounak

rrahat93 · July 23, 2022, 11:44am

@Rounak_Kumar1 I watched that video. I can extract elements i want to extract, but one of the field is adding some letter in between numbers. Is there any activities I can use before write range activities to format just one column in my data table?

Rounak_Kumar1 · July 23, 2022, 11:46am

Hey,

Lets Connect on zoom
we will resolve it

Thanks

rrahat93 · July 23, 2022, 11:53am

Sure. Thanks @Rounak_Kumar1 . I am using the same link you sent me last time.

Topic		Replies	Views
Extracting data(number) from different types of document - Digitize document Activities activities , question , ml , mlservices	2	1001	March 11, 2021
Extraction in Invoice Problem Studio studio , question , activities_panel	30	2244	March 16, 2021
Read PDF Text Only extract certain data Activities pdf , activities , question	3	764	March 12, 2021
Document Udnerstanding Data extraction Activities excel , database , activities , question	3	212	August 17, 2023
Intelligent OCR Activites - Regex Based Extractor Studio activities , error , regex , intelligent_ocr	2	1018	March 19, 2020

Data extraction using Taxonomy

Related topics