Tesseract OCR is not supporting Tamil Language

Hi Team,

I tried to get text from pdf image by using “GET OCR TEXT” activity, but it shows invalid input language. Tesseract Installation verified and version is tesseract v5.5.0.20241111
. If i give language as “eng”, it is working but for “tam” tamil language not working. Please assist.
Note: tam.traineddata file available.

@CodeRays_Info

did you happen to install c++?

needed to unpack and train languages

and once the trainign data fiels are placed in right location for them to load we need to reload studio as well

cheers

Hi,

The following post may help you.

If you use Enterprise edition (system-install) , please replace “c:\Users\[AccountName]\AppData\Local\Programs” with “c:\Program Files” in the above post.

Regards,

I have installed Visual C++ but still getting same error as invalid input language.

Yes, I do have that file in the below path

C:\Users\game2\Documents\UiPath\Studio\tessdata

Hi,

Please check the above content. it says tessdata folder should be created under the following path.

 c:\Users[AccountName]\AppData\Local\Programs\UiPath\Studio

or

c:\Users[AccountName]\AppData\Local\Programs\UiPath\Studio\net461 

Regards,

Hi,

Filepath changed but still same error. Please check this attached snapshot.

Hi,

Can you try to reboot your machine then try again?

Regards,

Yes, after restarting the system, got this error. please check this attached snapshot.

Hi,

Can you share screenshot of property for TesseractOCR activity?

Regards,

Attaching snapshot of Tesseract OCR property.

Hi,

In my environment (24.10.6 per user install), it works well, if put tam.traineddata under c:\Users\[AccountName]\AppData\Local\Programs\UiPath\Studio\tessdata , as the following.

So, I’d like to confirm your Studio version and install mode: per user install or system install. (we can know it from path UiPath.Studio.exe is installed)

Regards,

Version is Studio 2024.10.6

Community License

per user installation

attaching snapshot

Hi,

It’s same as my environment.

To isolate cause, can you try to run the following sample?

Sample
Sample20241128-1.zip (8.3 KB)

Regards,

I have tried this issue but same error. Language given is “tam”

Hi,

I checked your above image again and it’s not same path what i mentioned.

Can you check your path? C:\Users\[userAccount]\AppData\Local\Programs\UiPath\Studio should already exist.

Regards,

Thank you for the clarification…After doing this, reloaded the studio and now i got different error.

Hi,

The following may help you.

However, as you mentioned it works well in Eng as lang settings, it seems strange…
It may be also good if you try another target with tam and/or eng.

Regards,


Install vcredist_x86 installed but same issue facing. Please check the attached snapshot.