Fixed Form Data Extraction selecting Text over Date in Date Field

Hi All,

I’ve got a curious case where I have a clearly labelled date field which is consistently selecting a Text value over a date value

This is the settings I’ve applied in the Taxonomy

Here is the selected values in the Document
image

And the detected options
image

Is there a simple way to force the Data Extraction to select the actual date data over the text data?

For additional information, the Extract model is an Old Fixed Form extraction model, which appears to have lost its “Template” document a while ago which means I can no longer update it. I’ve added in a couple of additional Fixed Form extraction models, however these don’t appear to be getting selected over the old extraction model

@Stefan.He_Enlift,

Try changing OCR Engine and Apply OCR on PDF.

Thanks,
Ashok :slight_smile:

I gave that a go, but unfortunately its made my results even worse

image

image

@Stefan.He_Enlift,

Which OCR engine you are using?

My original settings were

  • UiPath OCR Engine (For Digitization and all Classification/Extraction training)
  • ApplyOCROnPDF = Yes (For Digitization, and all Classification/Extraction training)

I tried out the Tesseract OCR engine which is free to use specifically for a new template. And also tried variations of “ApplyOCROnPDF” = Yes and “ApplyOCROnPDF” = Auto with no luck.

@Stefan.He_Enlift

looks like your indicated region also looks wrong…can you try checking that

cheers