How to extract PDF data using re-framework

I need to create a workflow where I need to Process PDF Invoices and extract their data according to a Data Mapping Excel sheet. The data mapping sheet has various categories and according to the specific category data needs to be extracted from PDF file.
I am confused how to design the workflow using ReFramework, as I am not able to understand that what will be the Transaction Item for my Process.
Hi @chauhan.rachita30

Since u are processing each invoices and taking data and filling into excel

So ur each Transaction item should the invoice file ie invoice file path

For more details to work in reframework involving files

Nived N

Thanks for sharing the video.

how to do email automation in re framework please reply

Hi @pallavireddy077
Can u explain your process?

firstly thank you for your reply
so actually i need to do pdf automation. in that i need to read each file and extract some data from them like invoice no,date,name,bill amount like that, and finally i want put all this data into a excel file. how can i do this in reframework

Hi @pallavireddy077

Check the sample process (1015.1 KB)

Nived N

Hey…Thank you for shareing process here i have one question .
first you assign file list or file folder list in the assign activity
,because you didnt mention that in config file?

Well u can add the file path of in config file and then called in assign activitiy

Here the file I created is a demo only so to understand how the process works


Thank you so much its really helpful for me

Hey, I have one Question here we create file_list variable and assigned it to transactionitem so that we can use that transactionitem for each file right ?


Yes @pallavireddy077

Hey…I am not getting extracted values in the excel file and also its not moving succsesfull folder too please (2.4 MB)

hello, I am not getting values in excel in pdf automation
Are able to extract the data from PDF ?

Thank you for your quick response.
now Problem is solved. I am getting the values

Do you how can we put delay in workflow using config file,
i want to put some delay because while extracting the pdf value it is taking some only not all values .