Howto convert PDF(Hindi+English Language Mix) to Word Documents

Dear Developers,

I have a pdf which contains hindi+English language
I want to convert the same pdf into word file

I am using read pdf text but only English language its reading

Please help
Attached us the sample pdf


Test-1-162-pages-2 (1).pdf (30.6 KB)

@manish_patel

For the PDF Text activity it will only work for English text

For other languages to read you need a OCR which can read the language (Hindi in your case)

Better search for a good OCR which has capability of reading Hindi text and then integrate with Uipath

Thanks

Hi @manish_patel,

As suggested by @Srini84, for languages other than English you can go with inbuilt OCR engines to get the results.

Thanks,
Shikhar

Hi @manish_patel ,

The Below Documentation describes details on installing OCR Engine and Selecting/Installing the Language pack required.

Although I haven’t tried on this, even after the installation maybe you would have to use OCR on the PDF files twice, once with the English (en) and next with the Hindi (hi).

Let us know if this works.