Using taxonomy for different pdf alignment

The pdfs for which I want to extract the data are having somewhat different alignment, according to the characters length. I have also tried using present validation station. So some of the is data is not getting extracted, how to resolve this.

hey @aditit ,

you can use the activity called regex based extractor with the use of regex.it can help you remaining part which are not extracting.

regards,
sainath