Hi, I have a task to remove duplicate rows from a DataTable.
If I do it using brute force by comparing one row by every other row it results in a very slow algorithm especially with large data sets.
How do I use self-join on the DataTable to remove duplicate rows very quickly by processing them in bulk?
Input: raw file, contains duplicate rows
Output: Only unique rows