Data extraction using Taxonomy

I am extracting one of the elements from my pdf file and it should be only numbers. but when it extract the data to excel it is including some letters in it. for example I am trying to extract 20-088022 but instead im getting this F2or0th-e0ta8xa8b0le2ye4a2r. How can i resolve this problem?

Hey!

Which extractor we’re using?

Can we try with ML- Intelligent OCR?

Regards,
NaNi

I am using form extractor. Do you want me to use the other one?

@THIRU_NANI used intelligent form extractor and got same result. any other solution

Hey!

We can still do string manipulation to get the numbers using RegEx…

System.Text.RegularExpression.regex.match(InputStringVariable,"\d+").ToString

But the intelligent OCR should work…

Can you please show me the extraction part…Where we are validating the field?

Regards,
NaNi

@THIRU_NANI

Is this what you want you to see? I am trying to get federal employer Identification number

Hey,

You can go through this Vedio
Your all doubt will be clear
UiPath Document Understanding # 6 | Extract and Validate | ExpoHub | By Rakesh - YouTube

Thanks
Rounak

@Rounak_Kumar1 I watched that video. I can extract elements i want to extract, but one of the field is adding some letter in between numbers. Is there any activities I can use before write range activities to format just one column in my data table?

Hey,

Lets Connect on zoom
we will resolve it

Thanks

Sure. Thanks @Rounak_Kumar1 . I am using the same link you sent me last time.

1 Like