How to Read PDF table data next to Keyword

Niranjan_k · January 8, 2024, 7:35pm

Hi All,

I would like to read pdf table next keyword found. for below example i would like to read table data next to “Example of Table Data2” keyword found in pdf file. any suggessions plese.

Image 1-9-24 at 12.59 AM

pdf2.pdf (166.8 KB)

Thanks in Advance,
Niranjan

vrdabberu · January 9, 2024, 12:43am

Hi @Niranjan_k

Use the read pdf with ocr activity if it’s a scanned file and use use the read pdf text if it’s and normal pdf. After using the above mentioned activity you will be able to read the entire pdf and store that data in a string variable.

So after getting the data into a variable use the regex expressions and extract the required data.

Regards

Niranjan_k · January 9, 2024, 2:00am

@vrdabberu I tried with the option but have 55 pages of pdf with unstructured table details

vrdabberu · January 9, 2024, 4:13am

Hi @Niranjan_k

Yeah you can use the regex expression to extract all the tables and then place the regex expression in matches activity. Use the for each activity and give the output of the matches activity as the input for the for each and by iteration you are able to get the data tables.

Regards

Niranjan_k · January 9, 2024, 6:30am

@vrdabberu i tried same logic with real data. The problem is column space and data space we have same. We are unable to differentiate data and columns. I’m trying to use the extract data table concept which is not working.
When key word found immediately I want to extract the table

Anas-p-v · January 9, 2024, 7:28am

One of option would be to use read pdf activities and using regex to get the table from that.
Or else, you have UiPath Document Understanding which can be used to extract tables and fields from documents.

Topic		Replies	Views
Extract specific table within PDF Form with RegEx Studio studio , question , activities_panel	12	1663	March 8, 2023
How to read the specific data in pdf Activities pdf , activities , question	33	4925	June 2, 2021
How to Read a Table data from PDF and store in Excel or Word? Help excel , uiautomation , pdf , activities	7	3014	October 7, 2019
Extract certain key words from multiple pdfs Activities pdf , activities , question	8	913	February 8, 2022
Unable to get 'PDF Table Data' Using Regex,String manipulation Help studio , question	4	886	February 24, 2021

Most Active Users - Yesterday
sharazkm32
sonaliaggarwal47
martin.parovski
prashant1603765
postwick
Darshan_Sable
ashokkarale
Anil_G
arivu96
V_Roboto_V
More details...

How to Read PDF table data next to Keyword

Related topics