我如何检查是否有重复在一个平面列表?

例如，给定列表['one'， 'two'， 'one']，算法应该返回True，而给定['one'， 'two'， 'three']则应该返回False。

当前回答

我真的不知道布景的幕后是做什么的，所以我只想让它简单。

def dupes(num_list):
    unique = []
    dupes = []
    for i in num_list:
        if i not in unique:
            unique.append(i)
        else:
            dupes.append(i)
    if len(dupes) != 0:
        return False
    else:
        return True

2019-02-05 04:57:27

其他回答

另一种简洁的方法是使用Counter。

要确定原始列表中是否有重复项:

from collections import Counter

def has_dupes(l):
    # second element of the tuple has number of repetitions
    return Counter(l).most_common()[0][1] > 1

或者获取重复项的列表:

def get_dupes(l):
    return [k for k, v in Counter(l).items() if v > 1]

2018-02-05 03:56:05

仅推荐用于短列表:

any(thelist.count(x) > 1 for x in thelist)

不要在一个很长的列表上使用——它所花费的时间与列表中项目数量的平方成正比!

对于具有可哈希项(字符串，数字和c)的较长列表:

def anydup(thelist):
  seen = set()
  for x in thelist:
    if x in seen: return True
    seen.add(x)
  return False

如果你的项目是不可哈希的(子列表，字典等)，它会变得更加复杂，尽管它仍然有可能得到O(N logN)，如果它们至少具有可比性。但你需要知道或测试项目的特征(可哈希与否，可比性与否)，以获得最佳性能——可哈希对象为O(N)，不可哈希对象为O(N log N)，否则就会变成O(N平方)，没有人能做什么:-(。

2009-10-09 04:36:37

我真的不知道布景的幕后是做什么的，所以我只想让它简单。

def dupes(num_list):
    unique = []
    dupes = []
    for i in num_list:
        if i not in unique:
            unique.append(i)
        else:
            dupes.append(i)
    if len(dupes) != 0:
        return False
    else:
        return True

2019-02-05 04:57:27

如果所有值都是可哈希的，使用set()删除重复项:

>>> your_list = ['one', 'two', 'one']
>>> len(your_list) != len(set(your_list))
True

2009-10-09 04:38:45

一个更简单的解决方案如下。只需用pandas . replicated()方法检查True/False，然后取sum。请参阅pandas. series . replicated - pandas 0.24.1文档

import pandas as pd

def has_duplicated(l):
    return pd.Series(l).duplicated().sum() > 0

print(has_duplicated(['one', 'two', 'one']))
# True
print(has_duplicated(['one', 'two', 'three']))
# False

2019-02-25 08:18:53

我如何检查是否有重复在一个平面列表?

推荐文章

最新文章

标签