HashMap源码分析_JDK1.8版本

2024年1月3日 400次阅读来源: 算法小白

HashMap源码分析_JDK1.8版本

声明

文章均为本人技术笔记，转载请注明出处
[1] https://segmentfault.com/u/yzwall
[2] blog.csdn.net/j_dark/

HashMap声明

public class HashMap<K,V> extends AbstractMap<K,V> implements Map<K,V>, Cloneable, Serializable

HashMap结构图示

HashMap基本数据结构

HashMap本质是一个散列表，存储元素为键值对；
HashMap继承AbstractMap，实现了Map、Cloneable、java.io.Serializable接口；
HashMap的是线程不安全的，它的key、value都可以为null；

final int loadFacotr
- static final float DEFAULT_LOAD_FACTOR: 默认装填因子0.75，如果当前键值对个数 >= HashMap最大容量*装填因子，进行rehash操作；
int threshold
- static final int TREEIFY_THRESHOLD： JDK1.8 新加，Entry链表最大长度，当桶中节点数目大于该长度时，将链表转成红黑树存储；
- static final int UNTREEIFY_THRESHOLD：JDK1.8 新加，当桶中节点数小于该长度，将红黑树转为链表存储；
- static final int DEFAULT_INITIAL_CAPACITY: 默认键值队个数为16；
transient Node<K, V>[] table：键值对数组，长度动态增加，但是总为2的幂；用transient修饰表示对象序列化时该字段不可被序列化；

HashMap键值对描述：Node<K,V>

JDK1.6用Entry描述键值对，JDK1.8中用Node代替Entry；

static class Node<K,V> implements Map.Entry<K,V> {
    // hash存储key的hashCode
    final int hash;
    // final:一个键值对的key不可改变
    final K key;
    V value;
    // 一个桶中的键值对用链表组织
    Node<K, V> next;
    ...
}

HashMap中键值对的存储形式为链表节点，hashCode相同的节点（位于同一个桶）用链表组织；

public final int hashCode() {
    return Objects.hashCode(key) ^ Objects.hashCode(value);
}

键值对元素hashCode = key的hashCode与value的hashCode，高16位与低16位按位异或运算；

红黑树：TreeNode

static final class TreeNode<K,V> extends LinkedHashMap.Entry<K,V>

JDK1.8新增，用来支持桶的红黑树结构实现

HashMap重要方法分析

HashMap添加/更新键值对：put/putVal方法

public V put(K key, V value)内部调用putVal方法实现；

public V put(K key, V value) {
    // 倒数第二个参数false：表示允许旧值替换
    // 最后一个参数true：表示HashMap不处于创建模式
    return putVal(hash(key), key, value, false, true);
}

putVal方法分析：

final V putVal(int hash, K key, V value, boolean onlyIfAbsent, boolean evict) {
    Node<K,V>[] tab; Node<K,V> p; int n, i;
    // 槽数组未初始化或者未扩容，先调用resize()扩容
    if ((tab = table) == null || (n = tab.length) == 0)
        n = (tab = resize()).length;
    /**
     * Hash函数，(n - 1) & hash 计算key将被放置的槽位；
     * (n - 1) & hash 本质上是hash % n，位运算更快
     */
    if ((p = tab[i = (n - 1) & hash]) == null)
        // 空桶，创建新的键值对节点，放入槽数组中；
        tab[i] = newNode(hash, key, value, null);
    // 键值对已在对应桶中
    else {
        Node<K,V> e; K k;
        // 与桶中首元素比较，如果key不同发生Hash冲突，在桶中添加新元素
        if (p.hash == hash &&
            ((k = p.key) == key || (key != null && key.equals(k))))
            e = p;
        // 当前桶中无该键值对，且桶是红黑树结构，按照红黑树结构插入
        else if (p instanceof TreeNode)
            e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
        // 当前桶中无该键值对，且桶是链表结构，按照链表结构插入到尾部
        else {
            for (int binCount = 0; ; ++binCount) {
                // 遍历到链表尾部
                if ((e = p.next) == null) {
                    // 创建链表节点并插入尾部
                    p.next = newNode(hash, key, value, null);
                    // 检查链表长度是否达到阈值，达到将该槽位节点组织形式转为红黑树
                    if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
                        treeifyBin(tab, hash);
                    break;
                }
                // 链表节点的<keym, value>与put操作<key, value>相同时，不做重复操作，跳出循环
                if (e.hash == hash &&
                    ((k = e.key) == key || (key != null && key.equals(k))))
                    break;
                p = e;
            }
        }
        // 找到 or 新建一个key和hashCode与插入元素相等的键值对，进行put操作
        if (e != null) { // existing mapping for key
            V oldValue = e.value;
            /**
             * onlyIfAbsent为false或旧值为null时，允许替换旧值
             * 否则无需替换
             */
            if (!onlyIfAbsent || oldValue == null)
                e.value = value;
            afterNodeAccess(e);
            return oldValue;
        }
    }
    // 更新结构化修改信息
    ++modCount;
    // 键值对数目超过阈值时，进行rehash
    if (++size > threshold)
        resize();
    afterNodeInsertion(evict);
    return null;
}

散列分布策略

键值对<key ,value>的槽位 = (容量 - 1) & hash(key)

键值对槽位是键值对在tab数组的索引，本质上 = hash(key) % 容量，位运算速度更快；
本质上是除数取余法，尽可能的散列均匀；

Hash函数

// in HashMap
static final int hash(Object key) {
    int h;
    return (key == null) ? 0 : (h = key.hashCode()) ^ (h >>> 16);
}

计算key的hashCode, h = Objects.hashCode(key);
h >>> 16表示对h无符号右移16位，高位补0；然后h与h >>> 16按位异或；

HashMap更新旧键值对 or 添加新键值对的核心思想：

根据键值对key的hashCode计算键值对的在HashMap中槽位，
判断是否空桶 Or 是否发生Hash冲突（与桶中首元素不同）
解决Hash冲突：根据桶组织形式是红黑树 Or 链表进行对应插入操作；
- 链表形式完成插入后，检查是否超过链表阈值，超过将链表->红黑树；
最后检查键值对总数是否超过阈值，超过调用resize()进行rehash操作；

HashMap删除键值对：remove/removeNode方法

remove方法分析

public V remove(Object key) {
    Node<K,V> e;
    return (e = removeNode(hash(key), key, null, false, true)) == null ?
        null : e.value;
}

remove()方法内部调用removeNode()方法实现

removeNode方法分析：

final Node<K,V> removeNode(int hash, Object key, Object value,
                           boolean matchValue, boolean movable) {
    Node<K,V>[] tab; Node<K,V> p; int n, index;
    if ((tab = table) != null && (n = tab.length) > 0 &&
        (p = tab[index = (n - 1) & hash]) != null) {
        Node<K,V> node = null, e; K k; V v;
        if (p.hash == hash &&
            ((k = p.key) == key || (key != null && key.equals(k))))
            node = p;
        // 待删除元素在桶中，但不是桶中首元素
        else if ((e = p.next) != null) {
            // 待删除元素在红黑树结构的桶中
            if (p instanceof TreeNode)
                // 查找红黑树
                node = ((TreeNode<K,V>)p).getTreeNode(hash, key);
            else {
                // 遍历链表，查找待删除元素
                do {
                    if (e.hash == hash &&
                        ((k = e.key) == key ||
                         (key != null && key.equals(k)))) {
                        node = e;
                        break;
                    }
                    // p保存待删除节点的前一个节点，用于链表删除操作
                    p = e;
                } while ((e = e.next) != null);
            }
        }
        /**
         * matchValue为true：表示必须value相等才进行删除操作
         * matchValue为false：表示无须判断value，直接根据key进行删除操作
         */
        if (node != null && (!matchValue || (v = node.value) == value ||
                             (value != null && value.equals(v)))) {
            // 桶为红黑数结构，删除节点
            if (node instanceof TreeNode)
                // movable参数用于红黑树操作
                ((TreeNode<K,V>)node).removeTreeNode(this, tab, movable);
            // 待删除节点是桶链表表头，将子节点放进桶位
            else if (node == p)
                tab[index] = node.next;
            // 待删除节点在桶链表中间
            else
                p.next = node.next;
            ++modCount;
            --size;
            afterNodeRemoval(node);
            return node;
        }
    }
    // 待删除元素不存在，返回null
    return null;
}

HashMap访问键值对：get/getNode方法

get方法

public V get(Object key) {
    Node<K,V> e;
    return (e = getNode(hash(key), key)) == null ? null : e.value;
}

get方法通过指定key获得对应键值对，内部调用getNode方法实现；

getNode方法

final Node<K,V> getNode(int hash, Object key) {
    Node<K,V>[] tab; Node<K,V> first, e; int n; K k;
    if ((tab = table) != null && (n = tab.length) > 0 &&
        (first = tab[(n - 1) & hash]) != null) {
        if (first.hash == hash && // always check first node
            ((k = first.key) == key || (key != null && key.equals(k))))
            return first;
        if ((e = first.next) != null) {
            // 红黑树查找
            if (first instanceof TreeNode)
                return ((TreeNode<K,V>)first).getTreeNode(hash, key);
            // 链表查找
            do {
                if (e.hash == hash &&
                    ((k = e.key) == key || (key != null && key.equals(k))))
                    return e;
            } while ((e = e.next) != null);
        }
    }
    return null;
}

getNode方法查找过程和putVal一样，先查找对应桶的首元素，然后根据红黑树结构 Or 链表结构对应查找；

HashMap重散列操作：resize方法

final Node<K,V>[] resize() {
    // 保存旧table，容量，阈值
    Node<K,V>[] oldTab = table;
    int oldCap = (oldTab == null) ? 0 : oldTab.length;
    int oldThr = threshold;
    int newCap, newThr = 0;
    if (oldCap > 0) {
        // 旧table容量已超过最大容量，更新阈值为Integer.MAX_VALUE
        if (oldCap >= MAXIMUM_CAPACITY) {
            threshold = Integer.MAX_VALUE;
            return oldTab;
        }
        // 调整新容量为旧容量2倍，左移一位实现
        else if ((newCap = oldCap << 1) < MAXIMUM_CAPACITY &&
                 oldCap >= DEFAULT_INITIAL_CAPACITY)
            newThr = oldThr << 1; // double threshold
    }
    // oldCap == 0 && oldThr > 0
    else if (oldThr > 0) // initial capacity was placed in threshold
        newCap = oldThr;
    // oldCap == 0 && oldThr == 0
    else {               // zero initial threshold signifies using defaults
        newCap = DEFAULT_INITIAL_CAPACITY;
        newThr = (int)(DEFAULT_LOAD_FACTOR * DEFAULT_INITIAL_CAPACITY);
    }
    if (newThr == 0) {
        float ft = (float)newCap * loadFactor;
        newThr = (newCap < MAXIMUM_CAPACITY && ft < (float)MAXIMUM_CAPACITY ?
                  (int)ft : Integer.MAX_VALUE);
    }
    threshold = newThr;
    @SuppressWarnings({"rawtypes","unchecked"})
        Node<K,V>[] newTab = (Node<K,V>[])new Node[newCap];
    table = newTab;
    if (oldTab != null) {
        for (int j = 0; j < oldCap; ++j) {
            Node<K,V> e;
            if ((e = oldTab[j]) != null) {
                oldTab[j] = null;
                if (e.next == null)
                    newTab[e.hash & (newCap - 1)] = e;
                // 红黑树桶进行rehash操作
                else if (e instanceof TreeNode)
                    ((TreeNode<K,V>)e).split(this, newTab, j, oldCap);
                
                // 链表桶进行rehash操作
                // 根据e.hash & oldCap)是否为0把链表分成两个链表，进行rehash
                else { // preserve order
                    Node<K,V> loHead = null, loTail = null;
                    Node<K,V> hiHead = null, hiTail = null;
                    Node<K,V> next;
                    do {
                        next = e.next;
                        if ((e.hash & oldCap) == 0) {
                            if (loTail == null)
                                loHead = e;
                            else
                                loTail.next = e;
                            loTail = e;
                        }
                        else {
                            if (hiTail == null)
                                hiHead = e;
                            else
                                hiTail.next = e;
                            hiTail = e;
                        }
                    } while ((e = next) != null);
                    if (loTail != null) {
                        loTail.next = null;
                        newTab[j] = loHead;
                    }
                    if (hiTail != null) {
                        hiTail.next = null;
                        newTab[j + oldCap] = hiHead;
                    }
                }
            }
        }
    }
    return newTab;
}

当键值对总数超过阈值threshold, HashMap通过resize方法实现重散列/rehash

HashMap调整容量：tableSizeFor()

static final int tableSizeFor(int cap)：得到>=cap的2的最小幂值；
由指定容量参数的构造器调用，计算rehash阈值threshold；

参考

[1] http://www.cnblogs.com/leesf456/p/5242233.html
[2] http://www.cnblogs.com/ToBeAProgrammer

    原文作者：算法小白
    原文地址: https://segmentfault.com/a/1190000008994736
    本文转自网络文章，转载此文章仅为分享知识，如有侵权，请联系博主进行删除。