Extract PDF Regex Match

We are extracting the text from PDF using Read PDF Text activity. We would like to capture ONLY the score of Quality ,Delivery ,Cost ,Technology from pdf text output.

Score will be always next to 25% (highlighted in BOLD) and we wants to capture only the score and ignore remaining text.

Note : Quality always starts with 1) , delivery starts with 2) , Cost starts with 3) and Technology starts with 4)

Please help me to create regex to capture the score of each below element ?

  1. Quality 25% 83 A 50

  2. Delivery 25% 83 A 0

  3. Cost 25% 17 C 68 Improvement Plan Required

  4. Technology 25% 62 B 0 Improvement Plan Required

1 Like

Hi,

Can you try the following sample?

m = System.Text.RegularExpressions.Regex.Match(yourString,"Quality 25% (?<QUALITY>\d+)[\s\S]*?Delivery 25% (?<DELIVERY>\d+)[\s\S]*?Cost 25% (?<COST>\d+)[\s\S]*?Technology 25% (?<TECHNOLOGY>\d+)")

note: m is System.Text.RegularExpressions.Match type.

Sequence.xaml (6.4 KB)

Regards,

2 Likes

Sure…Let me try. Thank you

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.