Matches activity problem

So I need to extract the contents of a pdf file and divide it by lines. I do that by reading it and then for the line division i use a regex ( (?<=[0-9]{2}.[0-9]{2}.[0-9]{4}\s[0-9]{2}.[0-9]{2}.[0-9]{4}).*?(?=[0-9]{2}.[0-9]{2}.[0-9]{4}\s[0-9]{2}.[0-9]{2}.[0-9]{4}) ).The regex is correct but the matches activity remains empty. Can anybody suggest a solution?

Hi @velizar,

Can you provide your PDF if possible and the expected output ?

Hi Unfortunately no because it contains very sensitive info but essentially i know that a line ends with a double date and starts with a double date

Understand that, if possible a single line will be sufficient.

something like this:09.09.2009 09.09.2009 some text
09.09.2009 09.09.2009 some more text
09.09.2009 09.09.2009 even more\r\n09.09.2009 09.09.2009 even even moree \r\n09.09.2009 09.09.2009 dsfsdfm,m,5, 546
01.02.2019 01.02.2019 Операция No: ****** stuff .* 0.00
04.02.2019 04.02.2019 Операция No: ****** more stuff*.** .*
04.02.2019 04.02.2019 Операция No: ****** some stuff

But uipath replaces newlines with \r\n

So what are you expecting from this input

To return some text,some more text,even more etc.

And i put the text from uipath (Got from the debugger ctr+c the variable) into regex101 and that regex highlights the things i need uipath also does in the regex builder. However when it comes to working it outputs an empty array

Hi @velizar,

Refer the below workflow, i had stored the above input in a text file and see the corresponding output.

Main (25).xaml (6.9 KB)

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.