I am having a scanned copies of pdf. I am trying to extract paticular field like name, valid from date and valid to dates.But these fileds are twice in single page so its extracting twice in simple fileds

I am having a scanned copies of pdf. I am trying to extract paticular field like name, valid from date and valid to dates.But these fileds are twice in single page so its extracting twice in simple fileds.

how can i extract once?
image
image

In single page i had 2 ValidFrom so its extracting twice , i had used form extractor by anchore .

Can you help extracting single field

If you use the
image
activity instead and then regex from the output; I believe the following regex statement should work;

Assign:
“(?<=FROM)(.*\n?)(?=UNTIL)”)

1 Like

Actually i am using OCR to extract pdf, and Uipath Document OCr so , data is not extracting in regex properly.

Could you attach your workflow? Regex shouldnt be an issue even if you are using ocr to read the pdf


It should be valid from dd/mm/yy to dd/mm/yy but while extracting data with ocr , its displaying different.