Which scrapping methods to get non-unicode text?

I can not see OCR method than
fulltext,
text attribute,
native,

when the windows application is non-unicode, (actually EUC-KR)
then
what is best recommendation to get text from non-unicode windows application?

Hi @mountie

You can use Screen Scraping Method to extract the uni-code. Since in the latest update Claasic experience has been removed so follow the below process to get Screen Scraping option.

Open your Project.json file of your project and add the below give line below "fileInfoCollection": [],

 "modernBehavior": false,

Refer the screenshot below for better understanding:

Save the json file and reopen your project, you will be able to to see Screen Scraping and Data Scraping option.
Refer the below image for better understanding:

When dealing with non-Unicode windows applications, especially those using character encodings like EUC-KR, the standard text extraction methods like “FullText,” “Text Attribute,” and “Native” may not always provide accurate results. In such cases, Optical Character Recognition (OCR) becomes a practical solution.

Here are some recommendations for extracting text from non-Unicode windows applications with OCR in UiPath:

  1. Use the “Screen Scraping” Wizard
  2. Use Tesseract OCR Engine. You can configure UiPath to use Tesseract OCR specifically for non-Unicode text extraction. In the “Screen Scraping” wizard, choose Tesseract OCR and select the appropriate language (EUC-KR in your case).
  3. Fine-Tune OCR Settings depending on the application, you may need to fine-tune OCR settings such as the OCR engine, language, scale, and other parameters. Experiment with different settings to find the combination that provides the best accuracy for your specific application.

Regards

Hi @mountie,

Thanks for reaching out UiPath Community.

Can you try using screen scrapping?
while scrapping try to play with Profile and and scale.
It may get what you want.

Hope this will help.

Happy Automation,
@Vinit_Kawle