I am trying to use Tesseract OCR in UiPath to read a PDF document in Dutch. I followed the recommended steps to add support for the Dutch language by placing the nl.traineddata file in the C:\Program Files\UiPath\Studio\tessdata folder. However, I am still encountering the following error:
Read PDF With OCR: Error performing OCR: InvalidInputLanguage
Here’s what I’ve done so far:
I downloaded nl.traineddata for Tesseract OCR.
Placed the nl.traineddata file in both the ‘C:\Program Files\UiPath\Studio\tessdata’ as the UiPath\Vision folder ( but not entirely sure if this is the correct location).
In UiPath, I configured the Tesseract OCR engine and set the Language property to nl.
Has anyone successfully used Tesseract OCR with other languages in UiPath, and if so, could you share the correct steps to configure it?
I have done both steps, for legacy as normal and did a restart but still getting the invalid language error! ;(
Language set to “nld.traineddata”
tessdata\nld.traineddata file is now present in:
C:\Program Files\UiPath\Studio\net461\tessdata\nld.traineddata
C:\Program Files\UiPath\Studio\tessdata\nld.traineddata