How to read pdf file in to the tabular format?

kiran · April 12, 2017, 9:55am

Hi how to read pdf file into the tablular format? is there any activity for that?

acaciomelo · April 12, 2017, 12:52pm

Basically, UiPath provides three ways of extracting data from PDF:

Get text activity with anchor base activity
Read PDF Text activity
Read PDF With OCR activity (this option is suggested when it’s not an original PDF file and is the last recommended since it’s prone to errors)

In your case, I would suggest to test those three options and check which one fulfill better your requirements. In my case, I had a project where I had to extract specific elements from the PDF and then I used the Read PDF Text activity to extract the whole text. After that, I split the text into an array of text lines and started to search the text I needed with functions like Substring, IndexOf, Split and so on.

I hope it helps.

Cosmin_Ion_Nicolae · April 12, 2017, 1:12pm

Hello.

Depending on the PDF structure, on 2016.2, the new Data Scraping wizard might be able to directly extract a table if the PDF is native (you can select the text). So you can also test that.

kiran · April 13, 2017, 4:48am

Thanks acaciomelo, will try this also.

kiran · April 13, 2017, 4:49am

Thanks Nicolae, so with the data scrapping we can get table.Hmm, i think this also good option. thanks again:)

Serran_Neru · April 13, 2018, 2:59am

Hi @kiran,

Could you please let me know, what is the data type for reading each line in for each activity?

Regards,
Serran

Topic		Replies	Views
UiPath PDF Structured Table Extraction Help pdf , activities , data_scraping , question	1	953	January 6, 2020
Read PDF Text activity is not working for PDF in Text format Help	4	6679	September 18, 2018
PDF to excel Help	7	9079	October 17, 2018
Extract tabular data from PDF Help pdf , activities , data_scraping , question , data_manipulation	7	1623	December 14, 2019
Extract format of file with data Help	1	981	January 13, 2019

How to read pdf file in to the tabular format?

Related topics