Regular expression - PDF Data Extraction

Hi, I have attached the pic of pdf . I need to extract all the values by row by row. can anyone help on this.


Extracted Text file:
extracted text.txt (226 Bytes)
if there is no value, the expected output will be ^“”^“”^“”^“$”^“” Here the 2nd row and 3rd row doesn’t have any value. so it should print like this.

Hi @Lalitha_Selvaraj

Try this

image

Sequence7.xaml (23.7 KB)

Regards,

Hi @lchirathapudi . Row 1 output should be like this “ST DAVID ## Yes ## No 123 $250,000 2015”. the entire row value should get printed .

@Lalitha_Selvaraj

How about the following?

Sequence7.xaml (23.7 KB)

Regards,

1 Like

Let me check and inform you back.

1 Like

Since Im not able to open the .xaml file. can you please send the pic of regex and variable types which you have used in the .xaml.

@Lalitha_Selvaraj

Check this

Pattern : "(.*Yes.*)"
Match = If(System.Text.RegularExpressions.Regex.IsMatch(currentMatch.ToString(), "(^[A-Z].*)"), System.Text.RegularExpressions.Regex.Match(currentMatch.ToString(), "(^[A-Z].*)").Value, "^""^""^""^""$^""")

Regards,

Try using Computer Vision(CV) Activities, use extract data for this type of texts.

Hi. I dont find “find matching pattern” activity in activities panel.

@Lalitha_Selvaraj

Search Matches activity

Regards,

Its working fine. can you pls tell how to print output values in csv file.

@Lalitha_Selvaraj

image

Output:

image

Regards,


image
image
Please look into this

@Lalitha_Selvaraj

Place the Build Data Table activity above for each activity and check Include headers option

image

Regards,

1 Like

It’s working fine. Thanks.

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.