How to Extract table from non-native pdf?

Hello all !
I have pdf file that that 100+ pages, now I want to find where is table in pdf and extract that table and save in Excel.

Can someone give me idea? how can i find table is present in which page

@RobotUi

Welcome to the community

Two ways

  1. Use read pdf and get the data of pdf and then search for table headers in it…and then try to use regex to get the data under it
  2. Use document understandingn and train a model

Cheers

Headers may differ in every pdf

@RobotUi

Then even du will not hep unless you train each type of pdf…if you have what all types of tables can cone…then need to train all those different models in du and use them

See if it is a tagged pdf…then not a good way but we can try to use frontend as well to get the table

Cheers

Hi @RobotUi

Welcome to UiPath community

Check out the Video link

Regards
Gokul

Hi @RobotUi ,

Could you maybe also check with the below post :

1 Like

I do have pdf that has table as image. I need to extract table from image.