Extract only some columns in a PDF data table to excel

Hello to everyone,
i’m building a sequence to automate the import of employees’s salaries to the bank but in order to be able to make this, I have to extract this data from a data table in a pdf file like this:


the columns that interested me are “cognome”, “nome”, “importo”.
As you can see, the table could be split in two or more pages of pdf and this fact create some difficult for me.

How can i solve?
Tks guys

1 Like

Hey @daniele.giunta94

Please try the below steps,

  1. Read the PDF text with Preserve Format enabled

  2. Use Generate Data Table activity to create the table

  3. Then you can use DefaultView to select only required columns from the table

Hope this helps

Thanks
#nK

i attached you the sequence. I follow your steps but it doesn’t work.
Main.xaml (7.3 KB)

Can you check where is the problem in my sequence?
tks

1 Like