Error with Tesseract OCR: InvalidInputLanguage for Dutch (nl.traineddata)

uiStijn · November 6, 2024, 10:40am

Hey there,

I am trying to use Tesseract OCR in UiPath to read a PDF document in Dutch. I followed the recommended steps to add support for the Dutch language by placing the nl.traineddata file in the C:\Program Files\UiPath\Studio\tessdata folder. However, I am still encountering the following error:

Read PDF With OCR: Error performing OCR: InvalidInputLanguage

Here’s what I’ve done so far:

I downloaded nl.traineddata for Tesseract OCR.
Placed the nl.traineddata file in both the ‘C:\Program Files\UiPath\Studio\tessdata’ as the UiPath\Vision folder ( but not entirely sure if this is the correct location).
In UiPath, I configured the Tesseract OCR engine and set the Language property to nl.

Has anyone successfully used Tesseract OCR with other languages in UiPath, and if so, could you share the correct steps to configure it?

Thank you

Greetings,

Stijn

Yoichi · November 6, 2024, 10:45am

Hi,

The following post may help you.

If you are using Enterprise edition, it’s necessary to put it in similar folder under program files.

Regards,

uiStijn · November 6, 2024, 11:14am

Hey @Yoichi!

I have done both steps, for legacy as normal and did a restart but still getting the invalid language error! ;(

Language set to “nld.traineddata”

tessdata\nld.traineddata file is now present in:
C:\Program Files\UiPath\Studio\net461\tessdata\nld.traineddata
C:\Program Files\UiPath\Studio\tessdata\nld.traineddata

Yoichi · November 6, 2024, 2:07pm

Hi,

Can you try to set just "nld" ?

Regards,

system · November 12, 2024, 10:07am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Read PDF With OCR: Error performing OCR: InvalidInputLanguage Error Still Occurring Activities	2	949	September 7, 2023
Tesseract OCR has downloaded the language and also restart the UiPath Studio,but the OCR still get InvalidInputLanguage error Reboot Your Skills 2021 week-4	4	2541	February 9, 2022
TesseractOCR Activities ocr , activities	6	1866	December 1, 2022
Tesseract OCR is not supporting Tamil Language Activities ocr , activities , question	19	131	December 7, 2024
How to add Polish language in Tesseract OCR Activities ocr , activities , question , tesseract-ocr	19	4472	December 31, 2021

Error with Tesseract OCR: InvalidInputLanguage for Dutch (nl.traineddata)

Related topics