Hope you are doing good, I came across one new scenario (in PDF) and finding it little complicated to get the solution. Below is the scenario:
I am trying to extract the number which is in the format like 2343…-23-2(n number-2 digit-1 digit) from PDF sheet, can you please advise how it is possible to extract all this number which are in the above mentioned format from the PDF sheet which is having multiple pages. Please guide me the process.
Thanks A lot Ashwin, Could you be kind enough to let me know from the starting like how to do it
Adding read PDF text in the sequence then what next i should do, i mean how to proceed
@Rahulsinha In your case you can use 2 ways either using matches activity with below regex pattern
or
use below code in for each
like below
Each time loop runs item.ToString prints new value.
Regarding regex pattern.
Here [\d] matches any digit between 0 to 9
[\d]+ matches 1 or more digits
followed by -
[\d]{2} matches exactly 2 digits
followed by -
[\d]{1} matches exactly 1 digit.
if you still have doubts contact me and check about regex in google once.
I want all that number(format which i told earlier in our conversation) to be extracted in an excel sheet from PDF file that is what i am trying to do bro
Still the same error, could you please elaborate from the starting, what Activities I need to add, what variable should I create, etc (from initial point). below is an example of what I am trying to extract in an excel sheet.
Still Unable buddy to process, getting the error. If you dont mind can you please elaborate from starting I mean I put Read PDF text what variable do I need to create then putting For Each what I need to do.
after that do I need to put Build Data Table , Add Data Row and Write Range to extract the details in PDF, What all variable do I need to create.
Please check whether you have ‘System.Regex’ package or not. If not please install the package from ‘Package Manager’. And if you have scanned pdf then please use ‘Read PDF with OCR’ activity to read the pdf.