Extract multiple PDF data and store into Excel using String operations

Hey all,
I want to extract data from multiple PDF files and store into excel sheet using Regular expression and string operations.

I have already found a similar post, but I don’t know how to apply the one published there to my requirements.

Here is the link to the similar post:

On the picture you can see what information is needed from the PDF’s and as a note next to it you can see the Excel columns into which the values are to be extracted. As a note, if this is important, you should know that the values for column name: “Bruttogehalt” are always between 40000 and 90000 and the values for column name: “Bruttobonus” are always between 0 and 2000".

In the attached files you will find my previous workflow, the described image with the annotations and all PDF files to be read.

I hope you can help me and thank you very much in advance :slight_smile:

Projekt.zip (2.3 MB)

In your loop, insert a new array to the row as follows

{ System.Text.RegularExpressions.Regex.Match(item, "(?<=Herrn )(\w+\s){2,3}").Value, System.Text.RegularExpressions.Regex.Match(item, "(?<=Vertragsnummer: )\d+).Value, System.Text.RegularExpressions.Regex.Match(item, "(?<=Jahresgrundgehalt\svon\s)\d+").Value, System.Text.RegularExpressions.Regex.Match(item, "(?<=hrlich\s)\d+").Value }