How to split a string based on a certain format using Regex?

Psyence · December 15, 2019, 9:56am

hey folks!

i’ve been working on a process that performs a data entry function after i’ve extracted certain key information from a string of data given in a statement.

So far, i’ve been able to get some success by breaking down my string manipulation into a few steps i.e. first splitting the string by spaces, then searching for the keyword that i need from the array and further split them up before i validate the information i need using regex statements. However, it seems like there’s a more efficient way of doing this by splitting and validating the information at the same time using regex statements.

A sample of the type of string i receive would be:

Inward PayWave LBC T17FC0060B PTE 01 SM3P191118448667 C110013055337 OTHER ABC ENGINEERING PTE. LTD. AUD 600

And the only information that i need starts from “LBC” and “T17FC0060B PTE 01”. Although instructions have been given to users to enter this in a format of LBC-<9/10 alphanumeric characters>-PTE-01 but as the source comes from a vendor, we have no control over how these information gets entered.

Thanks to the help of other users in this forum, i’ve been able to validate after splitting such entries with the following regex statement: System.Text.RegularExpressions.Regex.IsMatch(strData,“^\w{9,10}-?PTE-?0\d$”).

However, by first splitting this string by spaces, i would have failed to validate entries that were entered using the format given in the sample above. Therefore, would then be possible to split the chunk of string according to “^\w{9,10}-?PTE-?0\d$” instead?

many many thanks in advance!

Palaniyappan · December 15, 2019, 10:30am

Hi

str_output = System.Text.RegularExpressions.Regex.Match(str_input,”(LBC).*(PTE\s\d+)”).ToString

This will give us that value like
LBC T17FC0060B PTE 01

Cheers @Psyence

lakshman · December 15, 2019, 10:39am

@Psyence

Try below Regular expression.

              requiresStr = System.Text.RegularExpressions.Regex.Match(inputStr,"LBC\s*\w+\sPTE\s\d{2}").ToString

Psyence · December 16, 2019, 3:38am

Hi both! Thanks for the replies!

Both statement works however, the correct format that users are supposed to enter is along the lines of “LBCT17FC0060B-PTE-01” where LBC is connected to the first character and PTE and 01 are both separated by hyphens.

As this input i purely determined by our external users we have no control over what they key in. So i would like to cater to as many variations as i possibly can. Is there a way to amend this regex statement to cater for different variations like missing hypens, extra spaces etc.?

One way that i’ve worked out was to first remove hyphens and spaces then run and edited regex statement like:

requiresStr = System.Text.RegularExpressions.Regex.Match(inputStr,"LBC\w+PTE\d{2}").ToString*

Are there better methods to go about achieving what i want?

lakshman · December 16, 2019, 3:44am

@Psyence

Yes it also better solution.

But other alternative is to you can split it based on hypen and can read required values. I guess it will be easiest solution.

str = "LBCT17FC0060B-PTE-01"

str.Split("-“C)(0) - LBCT17FC0060B
str.Split(”-“C)(1) - PTE
str.Split(”-"C)(2) - 01

Palaniyappan · December 16, 2019, 4:54am

Yah of course
In same expression with Regex

str_output = System.Text.RegularExpressions.Regex.Match(str_input,”(LBC).+(PTE\W\d+)|(LBC).+(PTE\s+\d+)|(LBC).+(PTE\d+)”).ToString

Cheers @Psyence

Tanzill_Ahsan · January 26, 2021, 6:53pm

can you please share the xaml? or a ss?

Topic		Replies	Views
How to split a string based on Regex. But keep the "splitter" Studio studio , question , activities_panel	6	1869	October 22, 2021
Using split or regex Studio studio , question , output_panel	8	369	May 13, 2023
Extract a string using split, with strings as a delimiter Help activities	13	3108	July 1, 2019
Splitting a string into array by # and [] Academy Feedback lp_developer_fnd	4	324	July 20, 2023
Split a string based on Regex Help activities , regex	50	20746	October 29, 2018

Most Active Users - Yesterday
ashokkarale
Anil_G
Yoichi
yangyq10
postwick
chandreshsinh.jadeja
aravindbalineni123
Parvathy
aya
PRASHANT_GABHANE
More details...

How to split a string based on a certain format using Regex?

Related Topics