Extraction of data in pdf using regex codes

Hi to all,
Plz check the attached text file, In that I want to extract the vendor name like.
1)BATCHAS SLEEP ZONE
2)JYOTI ENTERPRISES
3)SAI THIRUMALA FURNITURE
in that customer reference number will repeat after the vender name is there any chance to extract the data.
Data Extraction.txt (357 Bytes)

Try this

Extract Vendor Name.xaml (6.0 KB)

Hi @Harish_pavuluri ,

Could you try this Regex and see if it works out for you?

System.Text.RegularExpressions.Regex.Matches(“YourString”,“(?<=FROM\s)[A-Z\s]+(?=\s)”)

Kind Regards
Ashwin A.K

Hi,

Another solution:

System.Text.RegularExpressions.Regex.Matches(yourString,"(?<=FROM\s).+(?=\s\w+\s\w+\sREF NO)")

Main.xaml (7.6 KB)

Regards,

1 Like

Thanks to all
I will check all the codes.

Tks for reply
The out put is coming like this “System.Text.RegularExpressions.MatchCollection”

tks you.

Your code is also working fine.

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.