不区分大小写的字符串作为HashMap键

出于以下原因，我想使用不区分大小写的字符串作为HashMap键。

在初始化过程中，我的程序用用户定义的字符串创建HashMap 在处理事件(在我的情况下是网络流量)时，我可能会在不同的情况下收到字符串，但我应该能够定位<键，值>从HashMap忽略我从流量收到的情况。

我采用了这种方法

CaseInsensitiveString.java

    public final class CaseInsensitiveString {
            private String s;

            public CaseInsensitiveString(String s) {
                            if (s == null)
                            throw new NullPointerException();
                            this.s = s;
            }

            public boolean equals(Object o) {
                            return o instanceof CaseInsensitiveString &&
                            ((CaseInsensitiveString)o).s.equalsIgnoreCase(s);
            }

            private volatile int hashCode = 0;

            public int hashCode() {
                            if (hashCode == 0)
                            hashCode = s.toUpperCase().hashCode();

                            return hashCode;
            }

            public String toString() {
                            return s;
            }
    }

LookupCode.java

    node = nodeMap.get(new CaseInsensitiveString(stringFromEvent.toString()));

因此，我为每个事件创建了CaseInsensitiveString的新对象。因此，它可能会影响性能。

有没有其他办法解决这个问题?

当前回答

Map<String, String> nodeMap = 
    new TreeMap<>(String.CASE_INSENSITIVE_ORDER);

这就是你所需要的。

2014-03-11 21:04:18

其他回答

继承HashMap的子类，并创建一个在put和get(可能还有其他面向键的方法)时小写键的版本。

或者将HashMap合成到新类中，并将所有内容委托给映射，但要转换键。

如果需要保留原始键，可以维护双映射，或者将原始键与值一起存储。

2011-11-23 03:26:40

因此，我为每个事件创建了CaseInsensitiveString的新对象。因此，它可能会影响性能。

创建包装器或在查找前将键转换为小写都会创建新对象。编写自己的java.util.Map实现是避免这种情况的唯一方法。这并不难，而且在我看来是值得的。我发现下面的哈希函数工作得很好，最多几百个键。

static int ciHashCode(String string)
{
    // length and the low 5 bits of hashCode() are case insensitive
    return (string.hashCode() & 0x1f)*33 + string.length();
}

2016-05-23 11:39:35

您可以使用来自Eclipse Collections的基于HashingStrategy的映射

HashingStrategy<String> hashingStrategy =
    HashingStrategies.fromFunction(String::toUpperCase);
MutableMap<String, String> node = HashingStrategyMaps.mutable.of(hashingStrategy);

注意:我是Eclipse Collections的贡献者。

2016-03-25 21:31:08

你可以使用CollationKey对象来代替字符串:

Locale locale = ...;
Collator collator = Collator.getInstance(locale);
collator.setStrength(Collator.SECONDARY); // Case-insensitive.
collator.setDecomposition(Collator.FULL_DECOMPOSITION);

CollationKey collationKey = collator.getCollationKey(stringKey);
hashMap.put(collationKey, value);
hashMap.get(collationKey);

使用排序器。忽略口音差异。

CollationKey API并不保证实现hashCode()和equals()，但实际上您将使用RuleBasedCollationKey，它实现了这些。如果你是偏执的，你可以使用TreeMap，它保证以O(log n)时间而不是O(1)的成本工作。

2021-01-22 23:09:07

为了记住hashCode，“包装”字符串不是更好吗?在普通的String类中，hashCode()第一次是O(N)，然后是O(1)，因为它是为将来使用而保留的。

public class HashWrap {
    private final String value;
    private final int hash;

    public String get() {
        return value;
    }

    public HashWrap(String value) {
        this.value = value;
        String lc = value.toLowerCase();
        this.hash = lc.hashCode();
    }

    @Override
    public boolean equals(Object o) {
        if (this == o) return true;
        if (o instanceof HashWrap) {
            HashWrap that = (HashWrap) o;
            return value.equalsIgnoreCase(that.value);
        } else {
            return false;
        }
    }

    @Override
    public int hashCode() {
        return this.hash;
    }

    //might want to implement compare too if you want to use with SortedMaps/Sets.
}

这将允许您在java中使用哈希表的任何实现，并具有O(1) hasCode()。

2013-08-21 13:43:15

不区分大小写的字符串作为HashMap键

推荐文章

最新文章

标签