Which scrapping methods to get non-unicode text?

mountie · January 6, 2024, 11:44am

I can not see OCR method than
fulltext,
text attribute,
native,

when the windows application is non-unicode, (actually EUC-KR)
then
what is best recommendation to get text from non-unicode windows application?

vrdabberu · January 6, 2024, 11:51am

Hi @mountie

You can use Screen Scraping Method to extract the uni-code. Since in the latest update Claasic experience has been removed so follow the below process to get Screen Scraping option.

Open your Project.json file of your project and add the below give line below "fileInfoCollection": [],

 "modernBehavior": false,

Refer the screenshot below for better understanding:

Save the json file and reopen your project, you will be able to to see Screen Scraping and Data Scraping option.
Refer the below image for better understanding:

When dealing with non-Unicode windows applications, especially those using character encodings like EUC-KR, the standard text extraction methods like “FullText,” “Text Attribute,” and “Native” may not always provide accurate results. In such cases, Optical Character Recognition (OCR) becomes a practical solution.

Here are some recommendations for extracting text from non-Unicode windows applications with OCR in UiPath:

Use the “Screen Scraping” Wizard
Use Tesseract OCR Engine. You can configure UiPath to use Tesseract OCR specifically for non-Unicode text extraction. In the “Screen Scraping” wizard, choose Tesseract OCR and select the appropriate language (EUC-KR in your case).
Fine-Tune OCR Settings depending on the application, you may need to fine-tune OCR settings such as the OCR engine, language, scale, and other parameters. Experiment with different settings to find the combination that provides the best accuracy for your specific application.

Regards

Vinit_Kawle · January 6, 2024, 11:54am

Hi @mountie,

Thanks for reaching out UiPath Community.

Can you try using screen scrapping?
while scrapping try to play with Profile and and scale.
It may get what you want.

Hope this will help.

Happy Automation,
@Vinit_Kawle

Topic		Replies	Views
How to copy visible texts only Help studio , data_scraping , question	4	1007	December 16, 2019
Full text vs Native text scraping Help data_scraping	2	26558	June 16, 2017
Text scrapping Help uiautomation , studio	15	18034	October 10, 2019
Best way to read Image on Screen Help studio	6	2659	May 16, 2019
Screen Scraping in uipath studio not able to scrap all data even if it do it showing encoded character Help studio	5	1012	December 27, 2018

Most Active Users - Yesterday
lrtetala
ashokkarale
yangyq10
esongi_park
pratiksha_gulhane
Muthuraj_Chidhambaram1
muhammet.sezer2
More details...

Which scrapping methods to get non-unicode text?

Related Topics