I’m working on a project where I need to delete any duplicate records of a document in a folder. We’ve decided the best way to do this is to check the file sizes of the documents in the folder, and delete any duplicates (retaining one original).
I’ve been given a bit of a lead, but I wasn’t sure what to do with it, or where to put it -
I believe this is meant to be an Int64?
Can I have an idea of a xml that could achieve this? The documents are all going to be PDFs
@Sheri
I dont think i will be a good idea to assume two files are same if they have same size…
imaging two files having content 11112222 and 12121212 respectively…
these two file will have same size but data is not same…
Instead of checking file size, go for content of PDF files…
dt = new DataTable
For each pdf in Folder
str = Read PDF text
DT.AddRow = {str,pdf.path}
Next