Get lines from text and filter the lines


#1

Hi, I am reading Pdf with Ocr and writing the result as a text file(Basically it is all chunk together). I would get some repetitive lines in it. Example Starts with Apple ends With Ok and in the middle got other words that may vary according to the line. Right now, I can only get the one line using the index function. But I would like to get repetitive lines to. Any suggestions?

Thank you xoxo


#2

Hi Sanga,

I would suggest using Regex to find your strings, in particular the “Matches” activity, documented here:

This activity returns a collection containing all of the matches found in the input string, so I think it should fit your needs.

Best,

Evan


#3

@Evan_Bruns seems doable but I am not sure what to write for the Pattern input


#4

test78.xaml (14.0 KB)

Please follow this example and let me know if this works for you.