I have a pdf file in which I need to extract the two dates informations from these line: Data richiesta 15/06/2019 Scadenza validitĂ 13/10/2019
How to do that with regex?
Thank you so much,
Cami
@CamiCat,
In your string " Data richiesta" is stable word. it will always available.
So, use Split string activity. Seperator should be Data richiesta
and Left(outPut(0),10) will give you 15/06/2019. similarly use Scadenza validitĂ for another split string activity and Right(outPut(0),10) will give you 13/10/2019
–and before this to get the value from the pdf use read pdf or read pdf ocr activity and get the data in a variable of string type
–we can use string manipulation like this line starts with word Data richiesta then we can use like this
–use read pdf or read pdf ocr and get the output with string variable named out_string
–use assign activity and mention like this out_string_array = out_String.Split(Environment.Newline.ToArray())
–use a for each loop and pass the above variable as input and change the type argument as string in the property of for each loop
–inside this loop use a if condition like this item.Contains(“Data richiesta”)
if this passes it will go to THEN part where we can put the above image mentioned but change the in_value variable with item (the variable from for each loop and that has the value “ Data richiesta 15/06/2019 Scadenza validità 13/10/2019 ” )
thats all you are done
Hope this would help you
Cheers @CamiCat
@CamiCat in recordings we have image recording in that screen scraping there is a option scrape relative rough3.xaml (10.3 KB) in this way you can get.have a look at this.hope this would help you.