Hello,
Can someone help me with extract information from PDF. I know there is a lot of examples but can’t find right one.
I have a .pdf invoice file with table inside 7 columns and 4 rows. I need to extract it. Steps what i have done:
- Read Pdf ( variable: OutPdfText)
- OutPdfText.Split(Environment.NewLine.ToArray,System.StringSplitOptions.RemoveEmptyEntries)
- then assign variables From and To
- In Matches activity i used: Input: OutPdfText Pattern:"(?<=" + From + “)”+"(.*)"+"(?=" + To + “)”
It searches from top to almost bottom of the file string but the result is: Empty.
If i set From and To variables to read only first line then it works perfect but when i try get text from whole document it can’t find. I think there is a problem with New lines.