Delete duplicate and keep uniques

Robert_Wennberg · April 2, 2020, 3:15am

This problem is pretty similar to removing duplicates, except that I want to delete everything except for the unique strings that remain.
For example, I have two excel files where there are several URL’s, and when I remove the duplicate rows the result is like this:

Excel1:
www.123.com

www.321.com
www.cba.com

Excel2:
www.123.com
www.321.com
www.cba.com
www.helloworld.com

Result:
www.123.com

www.321.com
www.cba.com
www.helloworld.com

What I aim to achieve:
www.helloworld.com

So I want to pretty much get rid of all the rows and data, except for the part that is unique and only has one of.
Is there any way to achieve that through Datatable filter or something?

Best regards
Robert

supermanPunch · April 2, 2020, 3:52am

@Robert_Wennberg So you want to Achieve the Unique Items from both the Excel Files?

Robert_Wennberg · April 2, 2020, 3:57am

Sorry, poorly explained.
I want to use Excel2 to check inside Excel1 if there are any duplicates, and if there are, delete those in Excel2 and just keep the Unique Items.
Excel1 is used as a reference with more URL’s than Excel2, if that makes any more sense

supermanPunch · April 2, 2020, 4:00am

@Robert_Wennberg I think you want to find the items from Excel2 which are not present in Excel1. Is that right?

Robert_Wennberg · April 2, 2020, 4:02am

Yeah that’s exactly what I am trying to do! Sorry for explaining it so badly

supermanPunch · April 2, 2020, 4:23am

@Robert_Wennberg Can you follow these Steps :

Read the Excel1 file using Read Range Activity. Get the Output as Datatable, say DT1. Read the Exce2 file using Read Range Activity. Get the Output as Datatable, say DT2.
Using an Assign Activity with the Expression below :
DT1 = DT2.AsEnumerable().Where(function(row) Not DT1.AsEnumerable().Select(function(r) r(“ColName”).ToString).Any(function(x) x = row(“ColName”).ToString)).CopyToDataTable()
Write the DT1 datatable to an Excel File using Write Range and check if that is the Output you needed.

Robert_Wennberg · April 2, 2020, 4:31am

I will see if I can make it work and get back to you with my results!
Thank you

Robert_Wennberg · April 2, 2020, 4:36am

The assign activity is giving me an error saying that ‘(’ is missing. However, I am not sure where…
This is what the activity I am pasting is looking like so far:

DT2.AsEnumerable().Where(function(row) Not DT1.AsEnumerable().Select(function) r(“URL”).ToString).Any(function(x) x = row(“URL”).ToString)).CopyToDataTable()

supermanPunch · April 2, 2020, 4:40am

@Robert_Wennberg Sorry, My bad I missed a Bracket, Can you try this :
DT2.AsEnumerable.Where(function(row) Not DT1.AsEnumerable.Select(function( r ) r(“URL”).ToString).Any(function(x) x = row(“URL”).ToString)).CopyToDataTable()

Robert_Wennberg · April 2, 2020, 4:43am

Yeah I couldn’t see where it was missing haha!
After pasting it, I get an error once again which I have seen before but not sure how to fix or what it means.
AsEnumerable is not a member of ‘System.Data.Datatable’
How would you fix that?

supermanPunch · April 2, 2020, 4:47am

@Robert_Wennberg Can you check this post for that , I don’t really know when exactly those error occurs, but in this post I have Suggested a Solution. It Should work :

Robert_Wennberg · April 2, 2020, 4:57am

I managed to solve the issue and make the automation work. However, the result is not really what I wanted.
I did get fewer results, like I wanted, but the results I got are duplicates from the other Excel…

supermanPunch · April 2, 2020, 5:01am

@Robert_Wennberg Can you provide the Excel Files?

Robert_Wennberg · April 2, 2020, 5:03am

I was just about to do that.
20200402.xlsx (11.6 KB) RemoveDupes.xaml (9.1 KB) SearchedApartments.xlsx (73.3 KB)

The expected result would be the row with the url:

That is the only row that I would like to be transferred to the new workbook/DT, seeing as all the other rows are duplicates.

supermanPunch · April 2, 2020, 5:16am

@Robert_Wennberg Correct me if I’m wrong, you want to remove all the rows in 20200402 excel which have URL’s present in SearchedApartments excel , Am I right ?

Robert_Wennberg · April 2, 2020, 5:17am

That is correct

Robert_Wennberg · April 2, 2020, 5:48am

Sorry, I was being stupid and compared the wrong files… I do believe the solution you sent me is working as intended. Thank you so much for helping and I’m sorry for wasting your time!

Robert_Wennberg · April 2, 2020, 6:12am

I am trying out the solution you gave me and it’s working fine for the Datarows that has data in it, but for the ones that return null, the Assign activity throws an error saying that “The source contains no DataRows”. I tried using an IF statement to bypass it but with no luck. Any ideas on how to fix it?

supermanPunch · April 2, 2020, 6:31am

@Robert_Wennberg You can use a Try Catch on it or there is a Different method that needs to use an array of Datarows variable. You can use Try Catch and in the Catch Section just put this Assign Statement :

DT = DT1.Clone

Robert_Wennberg · April 2, 2020, 6:37am

I tried what you asked but it is still throwing the same error when it gets to the assign activity you taught me.

Topic		Replies	Views
Duplicate Delete Activities datatable , excel , activities	4	779	November 23, 2021
How to delete duplicate rows in Excel Academy Feedback datatable , excel , activities , error	17	7270	July 25, 2020
REmove duplicates from datatable excel Help datatable , activities , question	13	2019	November 14, 2019
How to remove rows with duplicate columns Help datatable , excel , activities , question	12	8550	November 30, 2019
How to remove duplicate andalso delete this row Activities excel , activities , question	7	145	January 12, 2024

Most Active Users - Yesterday
Anil_G
ashokkarale
Ajay_Mishra
Gautham_Pattabiraman
BHUSHAN_NAGAONKAR1
vrdabberu
ABHIMANYU_THITE1
lrtetala
samantha_shah
shyamala_shyamu
More details...

Delete duplicate and keep uniques

Related Topics