在c#中从List<T>中删除重复项

谁有一个快速的方法去重复在c#的泛型列表?

当前回答

使用HashSet可以很容易地做到这一点。

List<int> listWithDuplicates = new List<int> { 1, 2, 1, 2, 3, 4, 5 };
HashSet<int> hashWithoutDuplicates = new HashSet<int> ( listWithDuplicates );
List<int> listWithoutDuplicates = hashWithoutDuplicates.ToList();

2021-06-20 15:09:22

其他回答

这里有一个扩展的方法来删除相邻的副本原位。首先调用Sort()并传入相同的ic比较器。这应该比Lasse V. Karlsen的版本更有效，后者重复调用RemoveAt(导致多次块内存移动)。

public static void RemoveAdjacentDuplicates<T>(this List<T> List, IComparer<T> Comparer)
{
    int NumUnique = 0;
    for (int i = 0; i < List.Count; i++)
        if ((i == 0) || (Comparer.Compare(List[NumUnique - 1], List[i]) != 0))
            List[NumUnique++] = List[i];
    List.RemoveRange(NumUnique, List.Count - NumUnique);
}

2011-02-25 06:15:44

如果你不关心顺序，你可以把这些项推到HashSet中，如果你想保持顺序，你可以这样做:

var unique = new List<T>();
var hs = new HashSet<T>();
foreach (T t in list)
    if (hs.Add(t))
        unique.Add(t);

或者用Linq的方式:

var hs = new HashSet<T>();
list.All( x =>  hs.Add(x) );

编辑:HashSet方法是O(N)时间和O(N)空间，而排序，然后使唯一(由@lassevk和其他人建议)是O(N*lgN)时间和O(1)空间，所以我不太清楚(因为它是第一眼)，排序方式是较差的

2008-09-06 19:32:48

使用Linq的Union方法。

注意:这个解决方案不需要了解Linq，只需要知道它存在。

Code

首先将以下内容添加到类文件的顶部:

using System.Linq;

现在，你可以使用下面的方法从一个名为obj1的对象中删除重复项:

obj1 = obj1.Union(obj1).ToList();

注意:将obj1重命名为对象的名称。

它是如何工作的

Union命令列出两个源对象的每个条目中的一个。由于obj1都是源对象，这将把obj1减少为每个条目中的一个。 ToList()返回一个新的List。这是必要的，因为像Union这样的Linq命令将结果返回为IEnumerable结果，而不是修改原来的List或返回一个新的List。

2018-02-13 12:56:58

通过Nuget安装MoreLINQ包，你可以很容易地通过属性区分对象列表

IEnumerable<Catalogue> distinctCatalogues = catalogues.DistinctBy(c => c.CatalogueCode);

2017-03-15 14:51:58

所有的答案要么复制列表，要么创建一个新列表，要么使用慢函数，要么就是慢得令人痛苦。

据我所知，这是我所知道的最快和最便宜的方法(同时，还得到了一个非常有经验的实时物理优化程序员的支持)。

// Duplicates will be noticed after a sort O(nLogn)
list.Sort();

// Store the current and last items. Current item declaration is not really needed, and probably optimized by the compiler, but in case it's not...
int lastItem = -1;
int currItem = -1;

int size = list.Count;

// Store the index pointing to the last item we want to keep in the list
int last = size - 1;

// Travel the items from last to first O(n)
for (int i = last; i >= 0; --i)
{
    currItem = list[i];

    // If this item was the same as the previous one, we don't want it
    if (currItem == lastItem)
    {
        // Overwrite last in current place. It is a swap but we don't need the last
       list[i] = list[last];

        // Reduce the last index, we don't want that one anymore
        last--;
    }

    // A new item, we store it and continue
    else
        lastItem = currItem;
}

// We now have an unsorted list with the duplicates at the end.

// Remove the last items just once
list.RemoveRange(last + 1, size - last - 1);

// Sort again O(n logn)
list.Sort();

最终成本为:

nlogn + n + nlogn = n + 2nlogn = O(nlogn)非常漂亮。

关于RemoveRange注意事项: 由于我们不能设置列表的计数并避免使用Remove函数，我不知道这个操作的确切速度，但我猜这是最快的方法。

2019-05-28 14:55:51

在c#中从List<T>中删除重复项

推荐文章

最新文章

标签