How to extract data from the pdf in front of the same number

Hi All,
I have 5 pages pdf. there are multiple columns in this. Like date, time, number, duration
Number is repeating so I want to extract the data in front of the number. How is it possible.

Like I have number 6026050250 this is mobile number it’s repeating several times in the number list. In front of this number it’s duration is written. I want the only duration. can you guide me.

Hi,
What you have to do is to use the read PDF activity that will return you a string and manipulate that string. You can use to extract all the lines that you want using match activity with a regex expression and after that to eliminate the fields that you don’t want to.
Attached you can find an example that will return all the lines that start with 5 character+space+3 characters (12345 009).ExampleExtractFromPDF.xaml (6.5 KB)

1 Like