Extract Specific numbers from PDF

Justine · October 27, 2022, 3:04pm

How do i extract the highlighted text using Indexof?

Capture
Below is the formula so far i have used:
strInvoiceText.Substring(strInvoiceText.IndexOf(“PAID-UP ,ORDINARY SINGAPORE”)+3869).Split(Environment.NewLine.ToCharArray)(0).Trim

Was trying to extract a specific string of numbers from a PDF file.

Tried 2 different Regex expression below:

system.text.RegularExpressions.Regex.Match(strInvoiceText,“(?<=PAID-UP ,ORDINARY SINGAPORE, DOLLARS\s\s)(.*)\d(?=\s)”).Value

system.text.RegularExpressions.Regex.Match(strInvoiceText,“(?<=PAID-UP ,ORDINARY SINGAPORE, DOLLARS\s\s)(\d.+\d)”).Value).Replace(" “,”")

but somehow UIpath is not able to extract thus resort to using Indexof.

Attaching the PDF file for reference.
1.pdf (136.1 KB)

Appreciate the help in advance!

supermanPunch · October 27, 2022, 6:29pm

Hi @Justine ,

Could you try using the Regex Expression below :

(?<=PAID-UP\s+,ORDINARY\s+SINGAPORE, DOLLARS\s+)[\d.,]+

Expression :

System.Text.RegularExpressions.Regex.Match(pdfText,"(?<=PAID-UP\s+,ORDINARY\s+SINGAPORE, DOLLARS\s+)[\d.,]+",RegexOptions.IgnoreCase).Value

Let us know if this doesn’t work.

Justine · October 28, 2022, 1:08am

hi @supermanPunch , it works perfectly. thanks for the help!

system · October 31, 2022, 1:32am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
String Manipulation - Extract fields from PDF by knowing exact Indexes Help	4	2446	May 9, 2019
Not able to extract specific data from PDF Something Else feedback	20	1168	October 24, 2022
Extract Specific Info from PDF Something Else feedback	8	1104	January 17, 2022
Read specific pdf text using regular expressions Studio uiautomation , activities	34	6398	June 26, 2020
Extract data between characters from PDF Help pdf , activities , string , question	5	1270	January 8, 2020

Extract Specific numbers from PDF

Related topics