Replace some chinese or japanese characters which is in description column

manjesh_kumar · December 18, 2023, 5:52am

Hello All,

I downloading data from SAP and uploading data into Oracle database, though I was successful however when I am trying to read the data in Power Bi I am getting an error

ORA-29275: partial multibyte character

Upon checking I saw in one of the columns the description contains non English description looks like chinese or Japanese characters, how can I replace these with null values.

二极管
TYRES
集成电路
多元件集成电路

Though the description column CAN contain the following characters.

**a-z A-Z 0-9 !@#$%^&()-_=+[]{};:',.<>/?`~*

I tried the following in a for each row in a data table and then an Assign activity, however I was not successful.

System.text.RegularExpressions.Regex.Replace(CurrentRow(“Description”).ToString,“^[a-z A-Z 0-9 !@#$%^&*()-_=+{};:',.<>/?`~]”,“”)

Appreciate your feedback for the same.

Regards,
Manjesh

neha.upase · December 18, 2023, 8:53am

Hi,
You can try below query

System.Text.RegularExpressions.Regex.Replace(CurrentRow(“Description”).ToString, “[^a-zA-Z0-9!@#$%^&*()-_=+{};:',.<>/?`~\p{IsBasicLatin}\p{IsLatin-1Supplement}]”, “”)

Yoichi · December 18, 2023, 9:33am

Hi,

Do you have any error in the above expression? Or still have ORA-29275?
At least, as there are some special regex characters such as “-”, “[” in the above pattern, it’s necessary to escape as the following.

System.text.RegularExpressions.Regex.Replace(CurrentRow(“Description”).ToString,"^[ a-zA-Z0-9!@#$%^&*()\-_=+\[\]{};:',.<>/?`~]","")

Regards,

N_Mounika · December 18, 2023, 1:03pm

Hi,

I hope this may help you.
Main.xaml (6.8 KB)

Regards,
Mounika

Mariemily_Silva · December 18, 2023, 1:50pm

manjesh_kumar:

Hello All,

I downloading data from SAP and uploading data into Oracle database, though I was successful however when I am trying to read the data in Power Bi I am getting an error

ORA-29275: partial multibyte character

Upon checking I saw in one of the columns the description contains non English description looks like chinese or Japanese characters, how can I replace these with null values.

二极管
TYRES
集成电路
多元件集成电路

Though the description column CAN contain the following characters.

a-z A-Z 0-9 !@#$%^&*()-_=+[]{};:',.<>/?`~

I tried the following in a for each row in a data table and then an Assign activity, however I was not successful.

System.text.RegularExpressions.Regex.Replace(CurrentRow(“Description”).ToString,“^[1]”,“”)

Appreciate your feedback for the same.

Regards,
Manjesh

It looks like you’re trying to replace non-English characters in the “Description” column with null values. Your attempt with regular expressions seems to be on the right track, but there are a couple of adjustments needed. The regular expression you provided is designed to match only the characters specified in the square brackets, so it would remove any character that is not in that list.

Here’s an updated version of your regular expression to remove non-English characters:

System.Text.RegularExpressions.Regex.Replace(CurrentRow(“Description”).ToString, “[^a-zA-Z0-9 !@#$%^&*()-_=+{};:',.<>/?`~]”, “”)

Changes made:

Removed the space between “a-z” and “A-Z” to include all letters in one character class.
Added the caret (^) inside the square brackets to negate the character class, meaning it will match any character not in the specified list.

This regular expression should replace any character in the “Description” column that is not a letter, number, or the specified special characters with an empty string.

Please note that this will remove all non-English characters, so make sure it aligns with your requirements. If you want to replace non-English characters with null values specifically, you can modify the expression accordingly.

a-z A-Z 0-9 !@#$%^&*()-_=+{};:',.<>/?`~ ↩︎

manjesh_kumar · December 19, 2023, 3:19am

dear @Mariemily_Silva ,

Thank you very much, it worked as expected.

Regards,
Manjesh

manjesh_kumar · December 19, 2023, 3:21am

@Yoichi

I was able to detect the actual root cause, there was another column where there was extra spaces after & before the text which was causing the issue.

Thank you for the support as always.

Regards,
Manjesh

system · December 22, 2023, 3:21am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How can I get the string.length meet Oracle system requirement when the string contains both chinese characters and english characters? Studio studio , question , activities_panel	10	1310	December 25, 2021
How to smartly remove some special characters and some strings (if those exist) from column data in a datatable? Help	3	3634	July 18, 2019
How can get substring when the string have both Chinese characters and English characters to meet Oracle requirements? Studio studio , question , activities_panel	3	1850	December 25, 2021
[Database][Oracle]-The query results data in the Japanese language which is encoded as "?" in the output within UiPath Knowledge Base activities	0	519	January 3, 2023
検索結果のセル位置から列移動した位置の値を取得する方法フォーラム	3	1974	November 9, 2021

Replace some chinese or japanese characters which is in description column

Related topics