Hello RPA developers,
I am trying to read and convert the bank statement “.PDF” file into an Excel spread sheet format “.xlsx”. The requirement is to go through the each row in the PDF and extract values and use them for validation purpose.
I appreciate your ideas and inputs in how to extract and read/convert PDF to xlsx/xls/csv in to perfect structured Data table.
Eg: Any bank statement with the list of GL transactions.
Hi Ashwin, Thanks for the reply. I did try using “read pdf with OCR” its actually working well when targeted at extracting one single page from the PDF. But here in my scenario i would need the transactions extracted from multiple pages lets say(page 4-10) The format is not same , it differs from page to page. Thus this method of extraction has not helped me.
Appreciate your suggestion,I welcome more suggestions please.
Using start process activity open the PDF file and using send hot key copy the content from pdf file.
Using excel application scope paste it into excel file using send hot key.