I have a regex that extract the data from a PDF and below is the result but there is dynamic data in between that I dont want like the Co-Applicant , No inquiry records found and Applicant USTIN CORDARYL Co-Applicant LONG INFILE REPORT : Page 3 of 7".
If data exists like that I wanna remove it and jsut return data that matches the pattern
Based on my current progress I already have extracted the data I need which is NOVUS HOME MORTGAGE , FACTUAL DATA and all the names.
But I mannually removed text that I dont need like Co-Applicant , No inquiry records found. etc using regex like
System.Text.RegularExpressions.Regex.Replace(text,"Applicant.*INFILE REPORT : Page \d+ of \d+.*","")
But I dont wanna do it manually cause those data that will pop is dynamic , Is there a way in regex to remove all the data that did not match the pattern without mannualy deleting each text using regex like what I did on Co-Applicant text.
Help would be much appreciated. Thank you
Result data :
" 08/03/2020 NOVUS HOME Mortgage Company TRU MORTGAGE 07/08/2020 FACTUAL DATA Mortgage Reporter XPN 07/08/2020 FCTUALDATA EFX 07/08/2020 NOVUS HOME Mortgage Company TRU MORTGAGE 07/07/2020 CROSSCOUNTRY Mortgage Loan TRU MORTGAG 07/07/2020 FACTUAL DATA Mortgage Reporter XPN 07/07/2020 FCTUALDATA EFX 05/21/2020 CAP ONE NA Bank Credit Card XPN 05/21/2020 CAPITAL ONE Credit Card TRU 05/21/2020 CAPITALONE Bank EFX 05/20/2020 CROSSCOUNTRY Mortgage Loan TRU MORTGAG 05/20/2020 FACTUAL DATA Mortgage Reporter XPN 05/20/2020 FCTUALDATA EFX 05/20/2020 FINGERHUT/WEBBANK Finance Company XPN 05/07/2020 EMS EFX 05/07/2020 GROW FINANCIAL CREDI Credit Bureau/Mortgage TRU Processing Co-Applicant No inquiry records found. Applicant USTIN CORDARYL Co-Applicant LONG INFILE REPORT : Page 3 of 7"
Current progress :