Ocr is not working with japanese language

suresh_polinati · October 9, 2017, 6:51am

Hi,
I’m not able to get the data using Google ocr (Japanese language) From Scanned pdf file.
I’m getting the following error.

ddpadil · October 9, 2017, 6:54am

did you try with Microsoft from OCR engine option?

suresh_polinati · October 9, 2017, 6:54am

yes. its not working

suresh_polinati · October 9, 2017, 7:22am

finally I got solution.

suresh_polinati · October 9, 2017, 7:23am

here the language pack. This one working

tango · October 16, 2017, 7:14am

Thank you @suresh_polinati
I tried jpn.traineddata and can fixed same issue.
But it seems there are many incorrect texts.
eg. １００万円 ==> １。。円

galbeath123 · October 17, 2017, 11:08am

OCR isn’t perfect. Try scale option or Microsoft OCR.

suresh_polinati · November 14, 2017, 6:26am

Using Microsoft Ocr is not I’m Not able to read Japanese data.

galbeath123 · November 14, 2017, 10:54am

Hi.
Language Pack might be the solution. Hope this helps

Topic		Replies	Views
OCR different language Help	3	4409	March 8, 2017
Tesseract OCR 日本語対応できない問題フォーラム robot , question	5	5325	April 17, 2021
GoogleOCRのインストール方法についてフォーラム studio	8	5324	April 20, 2019
OCR Japanese Help ocr , studio	6	3875	July 1, 2019
Do UiPath provide the Korean language package? Help	8	4658	September 28, 2017