Gather Data from different PDF files and paste it into One excel sheet

How can i gather data of 3 PDF files and paste into single excel sheet?

1 Like

is that pdf has structured data like in tabular form
Cheers @saikumar


1 Like

–hope you have those three pdf files stored in a folder
if so
–use assign activity and mention like this
arr_filepath = Directory.GetFiles(“yourfolderpath”,"*.pdf")
where arr_filepath is a string array variable that will store the file path of all those pdf in a array

–next create a datatable with BUILD DATATABLE ACTIVITY and create a table structure with similar columnnames as in our pdf and get the output with a variable of type datatable named Finaldt
–now use a for each loop and pass the above variable as input and change the type argument as string in the property panel of for each activity
–inside the loop use a READ PDF or READ PDF OCR activity (if its a image)
and mention the input as item and get the output with a variable of type string named str_output
–now use generate datatable activity and pass the above variable str_output as input and get the output with a variable of type datatable named outdt
–now use a MERGE DATATABLE activity and in property panel source property as outdt and destination property as Finaldt from build datatable activity
–atlast next to this for each loop activity use a write range activity and pass the input for datatable as Finaldt which would actually have the merged datatable of all those three pdfs and make sure that ADD HEADERS is enabled in the property panel of write range

thats all you are done
hope this would help you
Kindly try this and let know for any queries or clarification
Cheers @saikumar

Hi Sai,

If it is a structured Data ,and is same in all the 3 files, you can do data scrapping .

steps :

step 1 : Read all the folders from the file using directory.getfiles(path),
step 2 : Use for each to iterate through it ,
step 3 : inside for each , pass it to in start process and do data scrapping(use it for structural data ) or screen scrapping if u want to get specific contents
step 4 : then write into excel using append range.

If it is unstructured Data,

then build a data table using build data table activity with required columns
then you can have to follow first 2 steps as of above, use read pdf inside
and use string manipulation and use add data row to add it to specifc table and then write the datatable into excel.

other option would be to use computer vision activities (here first steps two would be same )

Vishnuvarthan P

Were we able to get them now buddy
Cheers @saikumar

Hi @vishnuvarthanp …i have got struck at some errors, please help me out

Hi sai, please let us know what is the issue