I have a list of documents in my folder, around 3000, which consists of doc, docx, rtfs and pdfs. I ahve a list of keywords to search on each of the doc (keywords remain the same). Which is the best way to do it ? Document understanding seems a vast topic , any specific leads would be really appreciated.
i have 12 keywords to search in each document and want to categorise them based on their match.
I tried getfiles from folder and reading them as a text. But using the option of ‘Read text’ and ‘Read pdf’ I am only able to read .docx and pdf. I am unable to read .doc and .rtf files. How do i read them ?