I need to remove rows from a DT, whens rows equal “C” in column B. But remove all other rows equal to number “4” in column A, only the ones related to the row containing “C”, means row “4” in column A has “C” in column B.
As you see on the below picture, column A has some rows that are identical, I need to remove all of them (means all the identical ones), even if only one of them has “C” in column B, if none has “C”, i keep.
Multiple (as identical) or single in column A, if it has “C” in column B, should remove it.
I though about using something like this, the idea is close… :
Sorry for late response. Thank you for the solution, it’s exactly what I’m looking for, but there’s something I don’t understand, the first part as filter by “Currently” works fine, but once i use the group function it doesn’t work, unless I’m using your example, I must have missed something, can you please clarify the group part, how does it work ? Thank you in advance.
The above expression makes groups which based columnA and filter out the number of rows is not 1. I might misunderstand your requirement. Can you share more your input and expected output?
I don’t understand what you mean by the number of rows is not 1 ?
Example :
Column A Column B
222222224 Currently
222222224 NOT Currently
222222224 NOT Currently
222222224 Currently
222222224 NOT Currently
555555552 NOT Currently
555555552 NOT Currently
666666631 Currently
666666631 Currently
666666631 Currently
666666631 Currently
666666647 NOT Currently
666666647 NOT Currently
666666647 NOT Currently
666666647 Currently
666666647 NOT Currently
666666671 NOT Currently
As you can see here, we have the same number “222222224” on multiple rows on column A, we should processes each row differently, but since one of the rows has the value “Currently” in column B (no matter if it’s only one or multiple, at least one containing the value “Currently”). The whole list of that same number “222222224” as rows should not be processed.
So all the ones, which not contains the value “Currently”, should be taken, as a result i should have something like this as an OUTPUT :
Thank you for you response. It worked fine for the few thousand rows, than at somepoint it start taking “Currently” instead, would you by any chance know what went wrong ? I mean it should work even if the file contains more than 5k rows ?
There might be extra whitespace in the remaining data because the above expression strictly check “Currently”. Can you check this? If so, the following will work.