Extract adress from scanned pdf

Hello Guys,

I’m trying to extract an Adress from different PDF: the proble is that adresses an pdf are in different format

1 Like

Hi @abdel, Please refer the given link and try with your PDF Files

if the PDF template remains same we can give a try

There will be some of constant values. Because you need to use them for getting the address.

1 Like

unfortunatly there is no constant value

i ha ve another question if you had a string: “dfsffdfsdfsf dsdfsf M. Jean Duran”

and you want only to extract “Jean Durand” how do you proceed.

(?<=M.).*
if you do this you’ll get Jean Durand

You can use below regex expression to get the name,
([A-Z].*)

how to use it in an assigne

System.Text.RegularExpressions.Regex.Match(StrVariable," (?<=M.).*").Value

Use below code in assign,
StringVar = System.Text.RegularExpressions.Regex.Matches(“dfsffdfsdfsf dsdfsf M. Jean Duran”,“([A-Z]\w.*)”)(0)

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.