How to convert from PDF to XLS?

excel
pdf
studio

#1

Hello RPA developers,
I am trying to read and convert the bank statement “.PDF” file into an Excel spread sheet format “.xlsx”. The requirement is to go through the each row in the PDF and extract values and use them for validation purpose.

I appreciate your ideas and inputs in how to extract and read/convert PDF to xlsx/xls/csv in to perfect structured Data table.

Eg: Any bank statement with the list of GL transactions.


#2

hi @vijaygrpa
Use read pdf with ocr activity it will extract all the data in a Single string variable
and Use Write range or write cell activity to paste the content in .xlsx file
Need reference check this link https://www.uipath.com/tutorials/pdf-data-extraction-and-automation

Thanks
Ashwin S


#3

Hi Ashwin, Thanks for the reply. I did try using “read pdf with OCR” its actually working well when targeted at extracting one single page from the PDF. But here in my scenario i would need the transactions extracted from multiple pages lets say(page 4-10) The format is not same , it differs from page to page. Thus this method of extraction has not helped me.

Appreciate your suggestion,I welcome more suggestions please.