RegEx for extracting value from string with trim

agnesv · February 11, 2020, 2:13pm

Hello! I have a string with a lot of information that i need to extract a value from.

The string contains alot of information from a website and the value in it is:

“Number 1. Amount: 43 dollars”
“Number 2. Amount: 22 dollars”
“Number 3. Amount: 23 dollars”
etc.

I only want to get the value 3. What i want to extract is only (in this example) 23 dollars, and the amount can vary and so can the order of the list (Number 1, 2, 3, etc.). So what i want to do is use a reg ex that starts with "Number 3. Amount: ", and trim the rest.

My current code takes all the values from the rest of the page starting from Number 3. and ending with the last time “dollars” occurs. Can someone please help me alter it so that it trims the rest instead?

System.Text.RegularExpressions.Regex.Match(EntirePage,“((?<=Number 3. Amount:).*(?= dollars))”).Value

bcorrea · February 11, 2020, 2:19pm

Hi welcome to the community!
This seems too simple to use regex, i recommend just removing the part you dont need from the string if it is always the same part you need to remove like:
result = "Number 3. Amount: 23 dollars".Remove(0, 17)

agnesv · February 11, 2020, 2:23pm

Hi! I did a simplified version of the string. The only thing that is unique is that i want to extract the value after “Number 3. Amount:” (size of it and values around it can vary).

What does 17 indicate?

bcorrea · February 11, 2020, 2:25pm

Remove function => 0 is the position to start deleting and 17 is how many chars to remove.

MaxyArthes · February 11, 2020, 2:27pm

Pretty new with Regex, but looking online for some example and come up with this:

(?<=Number 3. Amount: )(.*)(?= dollars)

agnesv · February 11, 2020, 2:28pm

What about if i don’t know how many chars to remove?
What i mean is, the value could vary from time to time. Sometimes it could be:

Number 2: Amount 24 dollars
Number 5: Amount 26 dollars
Number 8: Amount 22 dollars
Number 3: Amount 55 dollars
(could go on)

and another could be:

Number 1: Amount 22 dollars
Number 3: Amount 14 dollars
Number 8: Amount 43 dollars

agnesv · February 11, 2020, 2:29pm

This is why i want to use a trim for everything after, because if the end value is not unique it will take everything until the last time Dollars occurs.

If this is the string:

Number 1: Amount 22 dollars
Number 3: Amount 14 dollars
Number 8: Amount 43 dollars

The result would be this:

14 dollars Number 8: Amount 43 dollars

supermanPunch · February 11, 2020, 2:30pm

@agnesv Is it a . or : after Number 3?

bcorrea · February 11, 2020, 2:31pm

no examples you gave had a number different than 17…

agnesv · February 11, 2020, 2:33pm

Sorry i wasnt being specific, its actually none. This is the correct exact format:

Number 3 Amount:0,23 dollars

(Just an example, Amount can vary)

supermanPunch · February 11, 2020, 2:36pm

@agnesv (?<=Number 3 Amount:)\s*(.*)(?=dollars) Try This, Actually your original regex is correct , you needed to just modify a bit

MaxyArthes · February 11, 2020, 2:39pm

that better then? It’s stop as soon its a letter

agnesv · February 11, 2020, 2:53pm

I’ve tried both the alternatives now, but both times it just starts with the right value and then takes everything that comes after. Does anything look wrong with either of them?

System.Text.RegularExpressions.Regex.Match(EntirePage,“(?<=Number 3 Amount:)\s(.*)(?= dollars)”).Value

System.Text.RegularExpressions.Regex.Match(EntirePage,“(?<=Number 3 Amount:)(.*)(?= [^a-z] *)”).Value

supermanPunch · February 11, 2020, 2:55pm

@agnesv Why do you use . afer Number 3?

agnesv · February 11, 2020, 2:56pm

Haha, sorry. The string actually contains something else but i cannot write it out here so Ive had to modify it, will edit to the correct way

supermanPunch · February 11, 2020, 2:56pm

@agnesv Can you share the Text File ?

agnesv · February 11, 2020, 3:00pm

I cannot share it, but the only difference between my example and the real one is that the string i have does not contain rows. Its structured this way:

Numer 6 Amount:44 dollars Number 3 Amount:0,45 dollars Number 8 Amount:9,5 dollars Number 4 Amount:88 dollars

Could that be why?

supermanPunch · February 11, 2020, 3:01pm

@agnesv Yes , You should give the correct information

supermanPunch · February 11, 2020, 3:02pm

@agnesv Try This :
(?<=Number 3 Amount:)\s*[0-9.,]*(?= dollars)

agnesv · February 11, 2020, 3:05pm

That worked! Thanks a million

Topic		Replies	Views
Adding all values with same name together by using regex Help regex , question	18	1656	February 22, 2020
Extraction using regex Help	13	1377	April 9, 2020
Help with Regex to get a value Studio studio , question , activities_panel	6	944	November 3, 2021
Need to extract a Amount value fully in a string Learn	3	953	March 27, 2020
Extract all Amounts from a String Studio studio , regex	20	2229	June 27, 2020

Most Active Users - Yesterday
Yoichi
Raja.G
mkt.scott4
More details...

RegEx for extracting value from string with trim

Related Topics