How to extract one table from pdf which consists multiple tables without using document understanding?

Purvai_Marwaha · February 2, 2021, 10:17am

how to extract table from pdf without using document understanding and epilson method

NIVED_NAMBIAR · February 2, 2021, 10:20am

Hi @Purvai_Marwaha

If the PDF is not scanned one

Then u can open the PDF file and do the datascraping operation to get the table and store in datatable

Regards

Nived N

Happy Automation

Purvai_Marwaha · February 2, 2021, 10:27am

There are no selectors in pdf. Cant do by data scraping

NIVED_NAMBIAR · February 2, 2021, 10:29am

Try that @Purvai_Marwaha

Once I try to datascrape the PDF using datatscrapping it works

Pradeep_Shiv · February 2, 2021, 10:32am

Hello @Purvai_Marwaha,

you can Use data scrapping, Just indicate on any first column you should get a Structured Data & based on your requirement you can change or modify the Metadata.

cheers

Purvai_Marwaha · February 2, 2021, 10:33am

I tried its not working

NIVED_NAMBIAR · February 2, 2021, 10:34am

Or try with screenscrapping option

Screenscrapping will.give u the datatable in string format where u can use generate datatable to get datatable from the string format

Regards

Nived N

Happy Automation

Purvai_Marwaha · February 2, 2021, 10:34am

Its saying this control doesnot support data extraction

Pradeep_Shiv · February 2, 2021, 10:38am

Can we see how the data looks?

Purvai_Marwaha · February 2, 2021, 11:17am

Marius_Puscasu · February 2, 2021, 12:00pm

Hi @Purvai_Marwaha ,

For data extraction, try to open the pdf file with Adobe Acrobat Reader. Please avoid using PDF Exchange or this kind of tools

Best regards,
Marius

Cristian_Negulescu · February 24, 2021, 2:46pm

Hello Purvai,
In this video, I extract tables from PDF and write data in Excel:

0:25 Install PDF Activities
1:10 READ PDF text, Get PDF page count, Extract PDF
5:40 Read PDF with OCR
6:55 Join PDF and Manage PDF passwords
9:30 Extract Images From PDF and Export PDF as Image
12:00 Extract table from PDF use-cases 1 replace some spaces with | (one column has multiple words)
24:00 Run the robot to see the result
25:40 Extract Table from other PDF use-cases 2 delimiter is 2*spaces " " easy split
31:50 Extract Table from complex PDF use-cases 3 unstructured data the logic will be based on IsUpper and IsLower
40:25 Extract the price value from PDF

Thanks,
Cristian Negulescu

Topic		Replies	Views
How to Read Table from PDF Help pdf , studio	7	7924	November 14, 2017
Can't scarp pdf file table data using data scraping? Help uiautomation , activities	7	4261	November 17, 2017
Extract Table from pdf using Data Scraping Studio datatable , pdf , data_scraping , question	17	7522	January 3, 2022
How to extract data from unstructured pdf table Help pdf , activities , data_scraping , question	2	3073	February 24, 2021
Get the entire table data? Help	8	1166	October 4, 2018

How to extract one table from pdf which consists multiple tables without using document understanding?

Related topics