Extract data in pdf based on keyword exits

I have a pdf with more than 100 pages. But i want pages where the data is having a keyword based on account number .
Mostly the data is digital but few can be scanned pdf too

Hi @Chandra_Sekhar_K ,

Please follow below steps .

Get the page count .
loop for the total number of pages in PDF
Use Read PDF page range activity pass the page by incrementing store the value in the string
use the .contains method or indexof string method .
if value exists then try to read the data .

if its scanned use Read Read PDF range activity with OCR activity

Hope the above steps helps .


1 Like

In that pages i have specific fields where i need to extract the data

In that case after following the above steps try to use Regex Expressions to extract the required fields .


1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.