I would like to shred each line in my PDF to successive rows in an excel file. Here is what I’m looking for:
- Each line should be written into new rows in excel.
- A column against each row which identifies the paragraph number, line number and page number of line that was shredded.
- I have a dictionary of keywords on excel, I want to check if each shredded line contains ANY of my keywords. If the keyword is detected, the corresponding cell in excel needs to be highlighted.
I’m a beginner and so far I have been able to read the PDF file, write data into a text file and then to the excel. However, the entire data from the text file is being written into the first row. I want each line to be written into new rows. I tried using the split function, but am not able to achieve this.
Could one you please help me out?
I’m actually working on an enterprise project, if anyone of you is interested to collaborate please do let me know.