Looks like string manipulation sometimes can be very hard to handle.
This is my string
1.2. Nome Robert Negula
1.2. Endereço Rua do circuito do estoril N12DT
1.3 Telefone 916778495
1.4. N° ID Civil 302541244151551
1.5. N° ID Fiscal 2520145741444141
It is coming from a PDF file extracted with Tesseract.
The same thing as usual, putting the data into a variable, then assign it to a String array list variable.
But the Nome here is dynamic right?? So after 1.2 you would like to ignore the first string ? if yes, what are the other possibilities of getting the values other than Nome and what if 1.2 Nome ABCD Rober Negula, is this possible too?
Hi @Andre_Vieira
If you decide to make it with regex
Assign a STRING_VARIABLE = System.Text.RegularExpressions.Regex.Matches(INPUT, “(?<=Nome\s)(\S.+)”)
Note: INPUT is the variable to store the values you mention above.
You can use same code, let me know what you don’t want to extract?
Run it without if condition it will work for all as right now it is picking 1 and 2 index value.
But the thing is how the bot will know what is the full name of the person?
In last example we have done string manipulation for Nome but you have identified it as Nome Robert is full name so we used 2 index but for bot it’s hard to identified the full names ?
You have to put it in if condition that if the person name is this bot have to pick this much of index value.