I am trying to extract data from a PDF. I used read PDF activity to get the entire PDF data into a string. Now the extracted data is of format :
01166355A
CP98133
KALILA ABDURRAHMANN/
013004744524
013004744524 16D293577700
05/14/2016-
05/14/2016
HC:99283 $347.00 – -$294.62 $52.38 – $52.38 45 –
Subtotal $347.00 $0.00 -$294.62 $52.38 $0.00 $52.38 $0.00
01166419A
CP98081
YASMINE ABDUSSAMAD/
233029641221
233029641221 16D287324000
05/14/2016-
05/14/2016
HC:99283 $347.00 – -$294.62 $52.38 – $52.38 45 –
Subtotal $347.00 $0.00 -$294.62 $52.38 $0.00 $52.38 $0.00
Here from the above mentioned text I want KALILA ABDURRAHMANN and YASMINE ABDUSSAMAD data to be extracted. Please help!
Hi,
To extract the specific value you need to find the start index and end index of the value and pass these index and get the specific value by using substring
Thank you for your suggestion.
I tried this way.
stringToExtract.Substring(stringToExtract.IndexOf(“0”),stringToExtract.IndexOf(“/”)-stringToExtract.IndexOf(“/”)+19)
But the output am getting is 01166355A CP98133 K
how will i extract a dynamic variable from pdf? i need to extract “name” value from all the available pdfs and the length/ start or end index will not be known. pls provide a solution for it.
Yeah, @Sravenco I was facing the same issue ,people kindly help regarding above issues,
is it possible to Extract dynamic content from pdf. @badita@ddpadil need some help asap.
Does this works for Scanned Pdf and if yes how can i capture the data from pdf by their margin or after the fields such as after Name : xxxxx please suggest thanks in advance