How to Extract from multiple Pdf and store in temporary database then in Excel file

I have multiple filed in a folder where I have to extract the data from all pdf and store in temporary and then into excel… Does anyone have the same workflow or guide?

@balkishan

You can follow the below steps :

  • Read all files with commmand Directory.GetFiles() method.
  • Iterate through each file using for each activity.
  • Read PDF with Read PDF Text activity if there’s not any image inside that else use Read PDF Text with OCR acitivity. Both output’s will be of string type.
  • By doing string operations as per your scenario, retrieve all values from string and use write range activity to store into excel.

Let me know incase of any queries.

1 Like