Regex Fuzzy logic?

I got a pdf. I read out the text. now i want to look for values in that text by using regex, i.e. “(invoice\s+number)[:\s]+(?inv-[a-zA-Z0-9]+)\s+(order\s+number)[:\s]+(?[a-zA-Z0-9_.-]+)”.

yet sometime in the pdf it’s not the exact word “invoice number” or “order number”. it’s something like “ivoice number”, “invoice numbr”, etc. - so let’s say typos. how can i still find matches in that case?


Regex doesn’t support fuzzy matching.So we need to use other way to achieve it such as Levenshtein distance, i think.


1 Like

Ok that’s good to know! Thanks for your help :slight_smile:

Check this article @rpadevquick it might help.

Regards, :slight_smile:

1 Like

Amazing, thank you! :slight_smile:

1 Like

You are most welcome :slight_smile:

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.