could you please advise:
I need to get some information from pdf attachments (various customers, so each pdf can vary) like Invoice no, Delivery date, Amount etc.
I am able to split pdf to lines (with output.Split(Environment.NewLine.ToArray, StringSplitOptions.RemoveEmptyEntries))
I can find for example “Delivery date” string and get delivery date, but my questions are:
“Delivery date” might 3 times on pdf, how to find 1st occurrance and get date next to it?
“Delivery date” string and the date as such can be always in different structure on pdf, like right next to the string, or right below, or at the end of the row (due to different customers) - how to assure that I always get the date? Do I need to create code for each variation?