i want to extract data from a PDF containing multiple invoices with atable which is not constant
Invoice Number
MSHCR78420005670000247288
Invoice Generated on 31/07/2019
Recipient Details
ID 784200056785
TRN 100347258400003
Name MAJID AL FUTTAIM CINEMAS LLC
Address 9TH FLOOR MAF TOWERS
DEIRA
DUBAI
112222
DUBAI
Dubai
United Arab Emirates
Registrant Details
Bank TRN 100282764800003
Bank Name Mashreqbank psc.
Address Omar Bin Al Khattab Street
Deira
BOX 1250
Dubai
United Arab Emirates
For the period 01/07/2019 - 31/07/2019
Summary
SL. No Tax Code Tax Rate % No.of
Transactions
Taxable
Supply in
AED
VAT Paid in
AED
1 SR 5 7 15.87 0.78
1 - Read PDF Text
2 - use matches
to extract Name (?<=Name).*
to extract ID (?<= ID).*
to extract Address (?<=Address),*
and which amount you want and date there are multiple dates !