I got a pdf. I read out the text. now i want to look for values in that text by using regex, i.e. “(invoice\s+number)[:\s]+(?inv-[a-zA-Z0-9]+)\s+(order\s+number)[:\s]+(?[a-zA-Z0-9_.-]+)”.
yet sometime in the pdf it’s not the exact word “invoice number” or “order number”. it’s something like “ivoice number”, “invoice numbr”, etc. - so let’s say typos. how can i still find matches in that case?