Reading scanned PDF with different languages

Devbrath_Rajkhua · September 30, 2020, 5:01am

Hey Guys,

Hope you are doing fine in this lockdown. Need a help in on of the POCs I am working on.
I have a scanned PDF and I am trying to read it using Microsoft OCR. The issue here is with the language in the PDF, which could be any language. When I try to read it using OCR it picks up only the English alphabets and skips any other language present in the PDF.

Am I missing something while reading the PDF or going somewhere wrong here. Please help.

P.S. : The POC is to identify the language the PDF is in and I have already figured that part out. I just need to send the output of the PDF to my component.

Cheers
Dev

Pablito · October 6, 2020, 12:08pm

Hi @Devbrath_Rajkhua,
Hopefully this could help you

Topic		Replies	Views
Microsoft OCR Help	1	2847	January 31, 2018
How to read different languages from pdf other than english Help activities	5	5442	February 18, 2020
How to set up Google OCR for Portuguese Language Help ocr , studio	6	6835	January 2, 2020
Installing OCR Languages Tutorials ocr , studio	0	2846	December 20, 2017
试用版的OCR可以识别中文吗？中文 studio	3	3234	April 29, 2019

Most Active Users - Yesterday
ashokkarale
sharazkm32
sonaliaggarwal47
LamaX
More details...

Reading scanned PDF with different languages

Related topics