Scapping issue on multiple template

Hi Team,

We have multiple type of pdf files from which we need to extract table data into excel files. All the table structure are different.

Please suggest scrapping approach at here ,Is we create master template in excel.

Hi @Kaushal_Maru1 you can check the below link for scrapping table from the pdf

Hope it helps you

Regards

Nived N

Happy Automation

1 Like

Thanks for reply but we need to extract from multiple PDF and each pdf table have some different column. Now how we proceed on this.

Like u can loop.through each PDF files and extract the tables using the above activity

Hello Kaushal,
In this video, I have 17 use-cases for extracting tables from PDF and write data in Excel:

2:00 GitHub free code for all the files
2:20 Logic of general workflow
4:40 File 1 simple PDF
9:50 File 2 PDF with a column with multiple lines
20:10 File 3 PDF with a column with multiple words ON the LAST column
27:00 File 5 PDF with a column with multiple words ON inside column (2 columns)
31:40 File 6 PDF with a column with multiple lines
39:10 File 8 simple PDF
42:15 File 9 PDF with multiple spaces on that need to be correct
45:50 File 10 PDF with multiple columns that have multiple lines + multiple pages
55:50 File 11 simple PDF with protection empty Cells
58:35 File 12 Big PDF with an empty line and Empty columns and partial total
1:02:25 File 13 PDF with multiple columns that have multiple words and hard to define a rule
1:10:15 File 15 PDF with multiple columns that have multiple lines
1:12:50 File 17 simple PDF remove spaces from headers also remove space from Data
1:16:05 File 18 simple PDF
1:17:10 File 19 PDF with multiple pages and columns with multiple lines
1:22:10 File 20 PDF with multiple columns that have multiple lines
1:25:00 File 21 PDF with empty columns and subtotal

Code:

Thanks,
Cristian Negulescu