Extract table from pdf as it is

Suraj_Gaikwad_Nuvama_Grou · April 24, 2023, 12:19pm

How to extract the table format data from pdf to excel

Ashish_Soni · April 24, 2023, 12:23pm

Hi @Suraj_Gaikwad_Nuvama_Grou ,

You can follow these general steps:

Install the UiPath.PDF.Activities package from the Manage Packages option in UiPath Studio.
Use the Read PDF Text activity to extract the text from the PDF file.
Use the Generate Data Table activity to convert the extracted text into a DataTable.
Use the Filter Data Table activity to remove any unwanted rows or columns from the DataTable.
Use the Write Range activity to write the filtered DataTable to an Excel file.

Thanks

arjunshenoy · April 24, 2023, 12:23pm

Hi @Suraj_Gaikwad_Nuvama_Grou

Please checkout the following thread:

Hope this helps,
Best Regards.

Suraj_Gaikwad_Nuvama_Grou · April 24, 2023, 12:31pm

4 point i didn’t understand

Ashish_Soni · April 24, 2023, 12:37pm

Hi @Suraj_Gaikwad_Nuvama_Grou ,

Remove the unwanted columns or rows , take only those rows or column that you needed if in case there is no unwanted data then you can skip also.

Thanks

Palaniyappan · April 24, 2023, 12:44pm

Hi
Just now we had a similar discussion of pdf table extraction

Have a view on this thread for more details

Cheers @Suraj_Gaikwad_Nuvama_Grou

Srini84 · April 24, 2023, 12:45pm

@Suraj_Gaikwad_Nuvama_Grou

Check below post for your reference

You can use Python code and use Camelot Library

Hope this may help you

Thanks,
Srini

Suraj_Gaikwad_Nuvama_Grou · April 24, 2023, 12:56pm

Can you share some example so it’s helpful for me

Palaniyappan · April 24, 2023, 12:58pm

U got some demo on this
Hope that would help u to build the workflow

@Suraj_Gaikwad_Nuvama_Grou

supermanPunch · April 24, 2023, 3:08pm

Hi @Suraj_Gaikwad_Nuvama_Grou ,

Maybe you could also check if the below post suits your requirements :

Suraj_Gaikwad_Nuvama_Grou · April 25, 2023, 7:18am

i m not getting understand

Suraj_Gaikwad_Nuvama_Grou · April 25, 2023, 7:20am

please let know more info to extract as it is table from pdf file

supermanPunch · April 25, 2023, 7:22am

@Suraj_Gaikwad_Nuvama_Grou ,

We do not know on what points you are not able to understand.

It would be better if you could provide us with a Sample Data file and then Provide us the Expected Extraction data from it formatted in an Excel maybe. This way we will be able to help you better and suggest the proper approach.

Suraj_Gaikwad_Nuvama_Grou · April 25, 2023, 7:37am

the scenario is we have 5 pdf files
2.select some specific text data and table from pdf files
3. selected data extract to the mail body

supermanPunch · April 25, 2023, 9:53am

@Suraj_Gaikwad_Nuvama_Grou ,

The Highlighted point above is where we would need information on, Could you provide some sample data (by masking the original data) after the data is extracted to a text file ?

Do also keep the PreserveFormat option checked in Read PDF Text activity.

We need to know what is the Specific text that you want to extract, Is there any anchor or Keyword that we can refer for it to be extracted and also the same for the Table Data.

We did provide some examples above which also does suggest some form of solution.

Do note on a generic level we have already many posts related to the Data extraction from PDF files but a specific case and if not encountered before, we would need to check on the data formats of the Inputs.

Ambika_Singh1 · March 4, 2024, 6:16pm

Hi,

You can watch this video for your reference

Thanks,
Ambika

Topic		Replies	Views
Extract from PDF to Excel specifically Studio datatable , excel , selector , pdf , robot , activities , studio , question , activities_panel	7	1217	April 12, 2023
How to extract table from pdf file without using document understanding and regex to an excel sheet Studio	3	2800	February 1, 2024
How to extract a table from pdf to excel Studio excel , activities	18	6750	July 19, 2023
How to Extract tabel data from pdf file Help studio , question	3	726	March 1, 2021
Extract only some columns in a PDF data table to excel Activities datatable , pdf , activities , data_scraping , question	2	1268	February 9, 2022

Extract table from pdf as it is

Related topics