How to extract data from PDF and save it in excel

Vincent_Nuestro · February 22, 2023, 1:09pm

How to Extract data from PDF and save it in excel file.

I have some records the i need to get inside PDF file.
Please see image below

here is the output i copied and paste in the notepad

how can i arrange them and write it in excel

Thank you in advance
Vincent

GANESH_BALAGAM · February 22, 2023, 1:14pm

Hi @Vincent_Nuestro
If you have bulk items better to choose Document understanding process to extract data from documents and import to datatable.

Vincent_Nuestro · February 22, 2023, 1:20pm

hi this process only runs every monday and around 30 to 50 Item per monday.

GANESH_BALAGAM · February 22, 2023, 1:25pm

If the documents have fixed format like Structured or unstructured you can get the data by using Regex or String manipulation and also doc understanding processes.

supermanPunch · February 22, 2023, 1:48pm

Hi @Vincent_Nuestro ,

If the Table always contains values and will not have any blank values, We could adopt the regex method of retreiving the records and adding the records to a Datatable, then write it to an Excel sheet.

Could you gives us a Confirmation whether the data will always be present for each column in the PDF table ? If you have the case of empty values, Could you provide that case data along with the case which has non-empty values in columns.

Providing samples of different scenarios would help us better analyse the problem and suggest the appropriate approach.

Vincent_Nuestro · February 22, 2023, 3:59pm

Hi, to confirm the data will always present for each column in the PDF table.

Thank you

supermanPunch · February 22, 2023, 4:53pm

Hi @Vincent_Nuestro ,

Could you check the workflow provided in the below post :

If the PDF has a Simpler table, this workflow should be able to extract and save it in Excel. This might not work for all types of PDF.

However, Do try this and let us know if you get any errors.

Also, for the regex method extraction, we would require a Sample PDF or the exracted text to check further the patterns, analyse and provide a better solution. You could also research on the regex part as there are many tutorials available in the Forum :

Topic		Replies	Views
Can anyone please help me to extract data from pdf and store in to excel Studio studio , question , activities_panel	1	566	October 18, 2022
Tabular data extraction from pdf to excel Studio excel , pdf	16	2631	March 5, 2021
Extract from PDF to Excel specifically Studio datatable , excel , selector , pdf , robot , activities , studio , question , activities_panel	7	1067	April 12, 2023
Extracting PDF data into Excel Help excel , pdf , activities	26	24705	May 6, 2021
PDF content in to in to excel table Studio uiautomation	7	2047	April 29, 2021

How to extract data from PDF and save it in excel

Related topics