How to extract the data from pdf between two names

Hi All,

I want to extract table data from PDF between two names start name and end name. If data in multiple pages I want to use loop how can I build the logic. Suggestions please

Thanks in advance
Niranjan

@Niranjan_k

you can read multiple pages of pdf at same time using all pages setting in the read pdf text

and then use the regex for rest split/extract

cheers

@Anil_G will this be work for table data exact from PDF

@Niranjan_k

Try it out with out looking at for mat and all we might not say…but if keywords are constant it would extract all data inbetween in a string

after that may be you need to use generate table or etc to convert data to table

cheers

@Anil_G could please help me with workflow not sure how to implement

@Niranjan_k

  1. Read pdf with property as all pages
  2. regex activity with the above regex

cheers

@Anil_G Anil I have tried with option Regex for is not working for me. As I do not have specific character to separate the data. I want to extract the full table data if it is multiple pages based on start name and end name. Any suggestions please

@Niranjan_k

I am little confused…You are saying start and end and then again you are saying not constant

please share a sample pdf

or try with document understanding

cheers

@Anil_G Anil Actaully I’m using data scrapping to extract the table details, it is not extracting table data when table details in multiple pages. Any way to fix this. One thought I got is t possible to extract table with two strings characters using any loop function. Suggestions please. Sorry for my poor explanation

@Niranjan_k

We need some sample

cheers

@Anil_G My bad luck here I do not have access to upload any data. I’m discussing on my mobile