Can not read "đ" by Read PDF Text

Hi, I am Vietnames and trying to read pdf text by Read PDF Text Activity.
My problem is this activity can not read character “đ” like this
image
The rectangle is where the “đ” character should be in.
Please help me to resolve it
Thank you

Hi @Duy_Luong_Minh

Use Read PDF with OCR activity and with in that place the Tesseract OCR. In Tesseract OCR there is option for scale and for the first time just run the code without giving any value in the scale option. If it works fine then it’s okay if not then give the value in scale option. You can start the value from 0 and can go till the max value of 5. The value should be increased by 0.5 in each try. You will achieve the exact output at a particular value.

Regards

Hi,

Can you share your input pdf as file? It’s no problem if dummy data.

Regards,

Hi @Duy_Luong_Minh Duy,
Read PDF Text
it works well with Vietnamese
my in/output


my activity

Regards,