源码分析HashMap

2023年12月18日 371次阅读来源: thriller_zw

1、数据结构：数组+链表

//成员变量，数组 table[bucketIndex] = new Entry<K,V>(hash, key, value, e);
  transient Entry[] table;
 
 //内部类，链表
 static class Entry<K,V> implements Map.Entry<K,V> {
        final K key;
        V value;
        //链表指向下个元素
        Entry<K,V> next;
        final int hash;

        /**
         * Creates new entry.
         */
        Entry(int h, K k, V v, Entry<K,V> n) {
            value = v;
            next = n;
            key = k;
            hash = h;
        }

        public final K getKey() {
            return key;
        }

        public final V getValue() {
            return value;
        }

        public final V setValue(V newValue) {
        V oldValue = value;
            value = newValue;
            return oldValue;
        }

        public final boolean equals(Object o) {
            if (!(o instanceof Map.Entry))
                return false;
            Map.Entry e = (Map.Entry)o;
            Object k1 = getKey();
            Object k2 = e.getKey();
            if (k1 == k2 || (k1 != null && k1.equals(k2))) {
                Object v1 = getValue();
                Object v2 = e.getValue();
                if (v1 == v2 || (v1 != null && v1.equals(v2)))
                    return true;
            }
            return false;
        }

        public final int hashCode() {
            return (key==null   ? 0 : key.hashCode()) ^
                   (value==null ? 0 : value.hashCode());
        }

        public final String toString() {
            return getKey() + "=" + getValue();
        }

        /**
         * This method is invoked whenever the value in an entry is
         * overwritten by an invocation of put(k,v) for a key k that's already
         * in the HashMap.
         */
        void recordAccess(HashMap<K,V> m) {
        }

        /**
         * This method is invoked whenever the entry is
         * removed from the table.
         */
        void recordRemoval(HashMap<K,V> m) {
        }
    }

2、put存值

向链表头部插入元素,数组元素即为头。

  public V put(K key, V value) {
        if (key == null)
            return putForNullKey(value);
        //计算hash值
        int hash = hash(key.hashCode());
        //计算数组下标
        int i = indexFor(hash, table.length);
        for (Entry<K,V> e = table[i]; e != null; e = e.next) {
            Object k;
            if (e.hash == hash && ((k = e.key) == key || key.equals(k))) {
                V oldValue = e.value;
                e.value = value;
                e.recordAccess(this);
                return oldValue;
            }
        }

        modCount++;
        //添加元素
        addEntry(hash, key, value, i);
        return null;
    }
    //计算hash值
    static int hash(int h) {
        h ^= (h >>> 20) ^ (h >>> 12);
        return h ^ (h >>> 7) ^ (h >>> 4);
    }
    //计算数组下标
    static int indexFor(int h, int length) {
        //保证结果的最大值是length-1，不会产生数组越界问题。
        return h & (length-1);
    }
  //向链表头部插入元素
  void addEntry(int hash, K key, V value, int bucketIndex) {
  			//获取table[i]的对象e
                Entry<K,V> e = table[bucketIndex];
                //将table[i]的对象修改为新增对象，让新增对象的next指向e。
        table[bucketIndex] = new Entry<K,V>(hash, key, value, e);
        if (size++ >= threshold)
            resize(2 * table.length);
   }        
   //超过阀值，生成新的Entry[]数组。
    void resize(int newCapacity) {
        Entry[] oldTable = table;
        int oldCapacity = oldTable.length;
        //最大容量：static final int MAXIMUM_CAPACITY = 1 << 30;
        if (oldCapacity == MAXIMUM_CAPACITY) {
            threshold = Integer.MAX_VALUE;
            return;
        }
 
        Entry[] newTable = new Entry[newCapacity];
        transfer(newTable);
        //生成新的数组
        table = newTable;
        //阀值等于新的容量乘以加载因子（默认0.75）
        threshold = (int)(newCapacity * loadFactor);
    }

3、get取值

      public V get(Object key) {
        if (key == null)
            return getForNullKey();
        int hash = hash(key.hashCode());
        for (Entry<K,V> e = table[indexFor(hash, table.length)];
             e != null;
             e = e.next) {
            Object k;
            //hash和key值都相等
            if (e.hash == hash && ((k = e.key) == key || key.equals(k)))
                return e.value;
        }
        return null;
    }

4、相关知识

Hash，一般翻译做“散列”，也有直接音译为”哈希”的，就是把任意长度的输入（又叫做预映射， pre-image），通过散列算法，变换成固定长度的输出，该输出就是散列值。这种转换是一种压缩映射，也就是，散列值的空间通常远小于输入的空间，不同的输入可能会散列成相同的输出，而不可能从散列值来唯一的确定输入值。

java两个对象值相同(x.equals(y) == true)，则一定有相同的hash code。当equals方法被重写时，通常有必要重写 hashCode 方法，以维护 hashCode 方法的常规协定，该协定声明相等对象equals必须具有相等的哈希码。hashCode相等对象未必相等。

数组的特点是空间连续（大小固定）、寻址迅速，但是插入和删除时需要移动元素，所以查询快，增加删除慢。链表恰好相反，可动态增加或减少空间以适应新增和删除元素，但查找时只能顺着一个个节点查找，所以增加删除快，查找慢。哈希表综合2者优点，但实际上查找肯定没有数组快，插入删除没有链表快，一种折中的方式。

参考：http://www.cnblogs.com/hzmark/archive/2012/12/24/HashMap.html

    原文作者：thriller_zw
    原文地址: https://blog.csdn.net/thriller_zw/article/details/84595891
    本文转自网络文章，转载此文章仅为分享知识，如有侵权，请联系博主进行删除。