Regex expression to find value in block of text

Robert_Schauer · October 23, 2019, 8:00pm

Hello! I am reading in PDF data and need to pick out a specific value from a block of text. The value is static in it’s position within the block no matter what page I read in, however, the amount of characters is dynamic. The block of text I am working with is as follows:

Writing to a text file:

Residue mmbtu 3,787.11 100.00% 3,787.11 $1.623163 $6,147.10 
Total 3,787.11 3,787.11 $6,147.10

From UiPath message:

Residue mmbtu 3,787.11 100.00% 3,787.11 $1.623163 $6,147.10 \r\nTotal 3,787.11 3,787.11 $6,147.10 \r\n

I am trying to pull out the second dollar amount so in this example I should get 6,147.10 as an answer.

Using a regex expression generator (https://regexr.com/) I am able to pull the value out successfully using the expression: (?<=$)\S+$|(?<=$)\S+ $

However, when I try it in UiPath by assigning a variable to the code: system.Text.RegularExpressions.Regex.Match(str_ResVolAmt,“(?<=$)\S+$|(?<=$)\S+ $”).ToString, no value is pulled out.

SowmyaLeo · October 23, 2019, 8:24pm

@ Robert_Schauer
Welcome to the community.

Hope this helps.
RegEx.xaml (6.0 KB)

Robert_Schauer · October 23, 2019, 8:34pm

Thank you for jumping on this A couple of quesitons for follow up:

I need to build the processing using the enterprise version of UiPath, so is the UiPath.Core.Activities.Matches activity part of that edition?
The expression successfully parsed out the values after every $ and assided them to each index. I have a more advanced problem where the position or index of the value might change. The example I have is as follows:

Fees 
Description Fee Unit Fee Quantity Fee Rate Fee Value 
Electric gal 15,524.01 0.011391 $176.83 
Low Volume USD 0.00 400.000000 $0.00 
Marketing Fee gal 15,524.01 0.057579 $893.85 
Marketing Fee mmbtu 3,787.11 0.046063 $174.45 
Processing mmbtu 5,690.48 1.957675 $11,140.11 
Transportation gal 15,524.01 0.000000 $0.00 
Total $12,385.24

I need to pull 174.45 from the line that starts with “Marketing Fee mmbtu”, but the position of that line will change occasionlly. Any thoughts?

SowmyaLeo · October 23, 2019, 8:44pm

Hi,

Answer to Question1: Matches is availble in the Enterprise Edition.
Question 2: Is it data from the Fee value column that you need from all rows?

Robert_Schauer · October 23, 2019, 8:50pm

@SowmyaLeo,

Yes, techincally I do need to pull values from the “Fee Value” column. I want to be careful saying that however, becasue I need value from that column as well as a select amount of lines such as “Marketing Fee mmbtu”.

SowmyaLeo · October 23, 2019, 9:02pm

Is it possible to share a sample pdf please.

Robert_Schauer · October 23, 2019, 9:14pm

Apparently new users don’t have the ability to post files, so hopefully the code below works!

Robert_Schauer · October 23, 2019, 9:16pm

@SowmyaLeo,
The code did not work, so here is an image for now at least:

Dave · October 23, 2019, 9:30pm

@Robert_Schauer The following expression would pull out all non-white space characters from the line containing “Marketing Fee mmbtu”: (?<=Marketing Fee mmbtu.*\$)\S+

Assign YourNumber = Regex.Match(YourInputString,"(?<=Marketing Fee mmbtu.*\$)\S+",RegexOptions.IgnoreCase)

Add system.text.regularexpressions to your imports tab so you don’t have to type it in each time

Robert_Schauer · October 23, 2019, 9:40pm

@Dave,

Brilliant! That worked, and additially, I should just be able to change the string that is being used for the positive lookbehind function to find my other values. @SowmyaLeo thank you for your help as well!

system · October 26, 2019, 9:41pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Get a Block of text from start of a word until the same text is not found Studio uiautomation , activities , data_scraping , string , question	4	955	January 21, 2021
Find specifig word and catch following then send to excel in batch Studio uiautomation	5	1192	July 20, 2021
Regular expression works in regexr.com and UiPath matches activity but end result is blank Activities uiautomation , studio	9	1828	March 31, 2021
Regex to get specific value Help	11	1169	September 26, 2019
UiPath only able to read blocks of text in PDF instead of specific values Help uiautomation , studio	7	2430	October 24, 2019

Most Active Users - Yesterday
ashokkarale
Anil_G
Yoichi
yangyq10
postwick
chandreshsinh.jadeja
aravindbalineni123
Parvathy
aya
PRASHANT_GABHANE
More details...

Regex expression to find value in block of text

Related Topics