How to extract table details if one of the format is such two tables are present side by side

charantej · January 14, 2023, 8:12pm

Please check below snip and full pdf attachment for reference, which has two tables side by side. This format pdf is not extracting line items properly. both table’s 1st line is considered as single line. similarly 2,3,4,5,… lines.
Template-19.pdf (152.3 KB)

Anil_G · January 15, 2023, 3:52am

@charantej

If both the tables are getting read into same table and if all data is read…then you can copy the datatable into another and use remove data column to remove non required columns…which should give you both in separate tables

Hope this helps

Cheers

supermanPunch · January 15, 2023, 6:13am

Hi @charantej ,

Could you let us know if you have performed Data labelling for these documents and then trained the model ?

When Data Labelling, we should be able to Select/Label each of the rows in the tables (both) separately. Marking it as separate rows if that is how you would want to fetch the data.

Let us know what are the steps that you have taken upto now. Are you using the DU Model or any Pre-Trained model ?

charantej · January 15, 2023, 4:34pm

I’m using custom DU model. I have labelled, trained model and using it. I have labelled around 35 pdfs with all different formats. Right now I have only these files. Only in 2 pdfs we have adjacent tables. I have labelled adjacent table rows separately only.

Anil_G · January 15, 2023, 6:02pm

@charantej

It is recommended to train on atleast 5 samples for each type

Cheers

Topic		Replies	Views
Adjacent table extraction from pdf Document Understanding excel , pdf	4	1198	February 28, 2021
Extract specific value and table from pdf using document understanding Studio studio , question , activities_panel	10	961	October 26, 2021
Extract Tabular Data Academy Feedback studio , question	0	649	February 21, 2020
Extract multiple tables from a pdf Help excel , pdf , activities	5	3642	November 28, 2017
Table Data extraction having multiple words and multiple lines for each column Studio	8	1346	March 3, 2022

How to extract table details if one of the format is such two tables are present side by side

Related topics