How to extract blocks of tabular data which spans multiple lines from PDF

Hi all,

May I ask what would be the best method to extract blocks of tabular data which spans multiple lines from PDF?
A sample of how the data looks like in PDF is attached below.
Please note that the “blocks” of data can also span multiple pages.

Will need to extract into DataTable where columns can be like:
“Seq”, “Serial Number”, “Reading 1 Start”, “Reading 1 End”, “Reading 2 Start”, “Reading 2 End”…

2021-06-06_10h36_13

Thanks!

Hello KL Low

Welcome to the UiPath Community.

Try to use Read PDF text activity to convert it into string and then use regex to extract the required data. Please share the pdf file or the text extracted for further help. This activity is available under UiPath.PDF.Activities package that you can download from manage package section.

Hi Varun Kumar,

Thanks for the guidance. Let us try to make it work and update back here.

Regards,
KL Low

Hello Rana,
In this video, I have 17 examples with code on how to extract data from PDF (try some VB.NET logic on your PDF ):

Thanks,
Cristian