Read pdf multiple pdf files

How extract text from multiple pdf files some specific data


you have multiple options here

  1. Read pdf and use regex
  2. Use ml models
  3. use form extractors


Hi @suraj_gaikwad

Can you be more specific about your requirements? Maybe the input & expected output will help us analyze your requirements.

Best Regards.

Hi @suraj_gaikwad

Check out the link to read multiple PDF

Check out this video to extract the particular data


Multiple pdf are downloaded after moving in different folder now I have to read that all pdf but some specific data from pdf files

Any example?
Ml models?


Please check this.I hope those are invoices …the steps would be same, there are more models you can check

Invoice Processing with Document Understanding | Community Blog.


Pdf reading done using Du but bot is reading all pdf pages not able extract text which I have configured text

Hi @suraj_gaikwad

Use string manipulation methods or regular expressions to search and extract the specific data you need from the extracted text. Depending on the structure of the PDF and the data you want to extract, you may need to use techniques such as string splitting, pattern matching, or custom logic to extract the desired information.


Could you please tell me about what string i have to put and where I can use that