I’m working on an automation to extract PDF information to an excel. I have extracted the ‘easy’ information with Substrings and Regex. My problem is when I have this table because when I transform it to a text file, the information is misplaced. I think I can use document understanding but, I’ve never worked with it.
What I exactly want it’s to extract the information inside the table and relate it with the column and the row. I’m working with different tables; different number of rows but, the number of columns stays. Therefore, I want to extract, for instance, “Procedimiento VFR Incumplido” and relate that it’s column “AERONAVE”. Another example: extract “Separación inadecuada” and relate with “CONSECUENCIAS”
I know isn’t easy to do this but, I hope someone can help me.