Remove duplicates in two columns and keep rest of all the column values

prabhu_ponnusamy · August 27, 2021, 6:33am

Hi all,

need to remove duplicates in two columns (Customer Code,Jurisdiction Name) the attached sheet and also keep rest of all the column values.

if I use dt_Inputsheet.DefaultView.ToTable(True,“Customer Code”,“Jurisdiction Name”) syntax I am getting these two columns alone in the output dt . but i need rest of all the columns and corresponding values.

Yoichi · August 27, 2021, 6:44am

Hi,

Perhaps you should use Linq - GroupBY method. The following topic might help you.

If you can share your excel file, we might write expression for it.

Regards,

prabhu_ponnusamy · August 27, 2021, 7:02am

Sample.xlsx (6.8 MB)

remove duplicates for “Customer Code”, “Jurisdiction Name” columns and keep all the columns.

Yoichi · August 27, 2021, 7:31am

Hi,

How about the following sample?

img20210827-2

dt.AsEnumerable.GroupBy(Function(r) Tuple.Create(r("Customer Code").ToString,r("Jurisdiction Name").ToString)).Select(Function(g) g.First).CopyToDataTable

Sample20210827-1.zip (2.3 KB)

Note: More specifically, you need to decide which duplicated rows to keep.

Regards,

Jeroen · August 27, 2021, 8:11am

@Yoichi nice solution! I’ve been seeing a lot of these Linq statements in the forum, and I’d like to learn more about it. Is there a resource you can recommend?

Yoichi · August 27, 2021, 8:29am

Hi @Jeroen ,

Unfortunately, I’m not very familiar with good LINQ resources in English because my native language is non-English (Japanese). Now @ppr is working on making documents for LINQ (as the following, for example), and it will help us better understanding LINQ, I think.

Regards,

ppr · August 27, 2021, 8:33am

Answered with alternate on the forked / duplicated topic thread:

Topic		Replies	Views
Remove duplicates in DT for specific columns and keep all column and values Studio datatable , excel , studio , question , activities_panel	5	3447	August 27, 2021
Keep structure of table as it is and take distinct row Help excel , activities	10	964	March 11, 2020
Remove Duplicate rows based on single column and that too if cell starts with specific values Activities excel , activities , question	22	923	October 12, 2023
Keep unique of each Duplicate Studio studio , question , activities_panel	4	167	March 10, 2024
How to delete dublicate values in excel. based on 2 different columns dats Activities excel , activities , question	20	1163	June 2, 2022

Remove duplicates in two columns and keep rest of all the column values

Related topics