Remove Duplicate Rows based on Column

Yudhisteer_Chintaram1 · March 24, 2021, 7:55am

Hi,

I have this excel sheet where I want to:

Remove all duplicate rows based on the column named “PO”.
Keep all the duplicate rows ONLY based on the column named “PO”.

I tried the first part to remove duplicate rows but it is keeping a record of one of each row and and deleting the rest.
I used this:
dt.AsEnumerable().GroupBy(Function(x) convert.ToString(x.Field(of object)(“PO”))).SelectMany(function(gp) gp.ToArray().Take(1))

Modified_ASOS_June.xlsx (21.8 KB)

Can you help me please?
RemoveDuplicateRows (1).xaml (6.4 KB)

ManiPrajwal_K · March 24, 2021, 1:55pm

You can use LINQ statements to do remove dups.
Like
dt.AsEnumerable().GroupBy(Function(r) r.Field(of String)("<Col Name>")).Select(Function(s) s.First()).CopyToDataTable - This can be used to remove all dups with in the specified column.

dt.DefaultView.ToTable(True) - It will create a new Datatable with unique rows with respect to every column.

Both return a new Data table with no dups.

Yudhisteer_Chintaram1 · March 24, 2021, 7:26pm

Thanks for the help!

dt.DefaultView.ToTable(True) - It will create a new Datatable with unique rows with respect to every column.

How can I get unique rows with respect to a SPECIFIC column then?

prasath17 · March 24, 2021, 7:29pm

Hi…you can try like this…

InputDT.DefaultView.ToTable(True,“Column Name”)

ManiPrajwal_K · March 25, 2021, 6:45am

Please check with this first LINQ statement @Yudhisteer_Chintaram1

PASB_BOT · February 3, 2023, 8:28am

How do we distinct the range based on one column and keep another column intact.

pchaconsantana · March 14, 2024, 1:05pm

Hey I have another solution:

You can group unique values (removing duplicates) for each column and finally update them in a new table.
Example:

Build DataTable with duplicates: vDt1

Note: vDt2 is a copy of vDt1 but empty
For each():
List of items= vDT1.AsEnumerable().GroupBy(Function(x) x(“Valor1”).ToString)
Item: groupby_result

Body:
First, you have to analyze if there are rows in the vDt1 <= Number of item of the group
If : vDt1.Rows.Count <= vIntRowValue
Then: ADD DataRow with empy values
Finally
Assign:
vDtDatosOCR.Rows(vIntRowValue)(“Value1”) = groupby_result.Key.ToString

Finally, we obtain the table without duplicates.
vDt1 - Non duplicates on column "Value1"

Note: You have to do the same for each column
Greetings

Topic		Replies	Views
Remove Duplicate rows based on single column and that too if cell starts with specific values Activities excel , activities , question	22	905	October 12, 2023
Remove all duplicates rows based on a column Studio datatable , studio , question , activities_panel , for-each-row	4	53	November 14, 2024
How to remove rows with duplicate columns Help datatable , excel , activities , question	12	9001	November 30, 2019
Remove Dublicate Rows based on a column Studio datatable , excel , activities , question	11	3100	August 22, 2021
How to Remove Duplicate Row If the Value from two Columns matched Activities excel	5	3592	March 13, 2022

Remove Duplicate Rows based on Column

Related topics