Regex to extract particular information from Pdf

Hi All,

Any help on the below

I want to extract particular text between two sections of pdf using Regex. What will be the regex expression?

I want to extract between "Performance and Milestone section and before “Payment” section.

I want to extract the Tabular data between “Performance and Milestone” section and “Payment section”

Hi @dutta.marina

Try this way,

But it will give only string output value.

(?<=Performance and Milestone section)(?s).*(?=Section 4 Payment)

image

Regards,
Gowtham K

@Gowtham_K115

Its not working

@dutta.marina in case if you want to extract the tabular data I would strongly recommend you to use any other method or services in UiPath like DU or some other integration services as Regex for this kind of scenario are not commendable or not reliable.

@dutta.marina Slightly modified format from @Gowtham_K115:

(?<=SECTION\s+3\s+Performance\s+and\s+milestone\s+Schedule).*(?=SECTION\s+4\s+Payment)

When you extract text from PDF enable formatted text option. After the first regex, you’ll have to do a lot more work to create something useful. Regex approach may not be the best here

@sppal.c

Trying with DU but form extractor is unable to extract the table. any help on this?