Skip downloading the same file

Hello community!


In above image i have highlighted the subject name…it is different file with same subject name but bot is download the same file 4 time and the unique is the notice url…iam downloading the file through url…what will be logic i can add for not to download the same file again and again?

Hi @Priyesh_Shetty,
If the files headers are same then merge all the files and remove duplicate using notice url by this you can avoid duplicate download.

Let me know if you have any questions.

Thanks
Muthu

When the same notice url occurs multiple time within the above shown excel then we can deduplicate the data based on notice url done with the help of a group by LINQ

@muthuerd @ppr the second subject name is not duplicate i want to download that also only the name will be same.

maybe you illustrate in detail with some data samples. This would clear and speed up the solution approach finding. Thanks for support

Please check the File downloding logic if same file downloading 4 times.

@jai_kumar2 @ppr



As you can see in above image i have kept the file name in array which i want to download…Note that in this xlsx file there is no duplicate subject name.

we cannot as screenshots are not showing complete structure samples.

However, we can

  • deduplicate urls
  • can defensive check which file name would be retrieved, when calling an url and could decide if we want to download or not

@ppr how can i implement this logic if bot is trying to download the same notice url of same subject name then it should skip and go for another file to download.

so is url info distinct or duplicated within the excel/csv data?

@ppr yes url is distinct in front of every subject name there is a url to download the file of that subject name.