🔥 Internal Working of HashMap in Java — Deep Dive (Source-Level + Real-World)

Ram Lakhan Singh

Published Dec 30, 2025

Most developers know that HashMap gives O(1) performance. Very few know why it sometimes suddenly doesn’t.

And that gap is exactly where production outages happen.

Let’s break HashMap from memory layout → hashing → collisions → resizing → Java 8 changes → concurrency issues → GC impact.

1️⃣ What HashMap Really Is (Not the textbook version)

At runtime, a HashMap is:

An array of references (Node[])
→ each index is a bucket
→ each bucket holds:
   - null
   - single Node
   - linked list of Nodes
   - red-black tree of Nodes (Java 8+)

It is not just a map. It is a carefully optimized memory + CPU trade-off.

2️⃣ Internal Node Structure (JDK Source Reality)

static class Node<K,V> implements Map.Entry<K,V> {
    final int hash;
    final K key;
    V value;
    Node<K,V> next;
}

Important details:

hash is cached → hashCode not recalculated
next enables chaining
TreeNode extends Node (Java 8+)

👉 Every entry is an object allocation → memory & GC matter.

3️⃣ Hashing: Why hashCode() Is Not Enough

Step 1: hashCode() is called

int h = key.hashCode();

Step 2: Bit spreading (CRITICAL)

h ^ (h >>> 16)

Why?

hashCode implementations often use lower bits poorly
HashMap uses lower bits to calculate index
This XOR spreads entropy

👉 This single line prevents massive collision disasters

4️⃣ Index Calculation (Why power of 2?)

index = (n - 1) & hash

Why NOT %?

% is slower
& is one CPU instruction
Works only when n is power of 2

This is why:

16, 32, 64, 128 ...

are used as capacities.

5️⃣ put() — Complete Internal Flow

map.put(key, value);

What actually happens:

Compute hash
Calculate bucket index
Check bucket:
Increment size
Check resize condition

⚠️ Every step has performance implications

6️⃣ Collision Handling — Where Things Go Wrong

Before Java 8

Bucket → Linked List
Worst case: O(n)

Java 8+

Bucket → Linked List → Red-Black Tree
Worst case: O(log n)

Treeification Conditions (Very Specific)

Bucket size ≥ 8
Table size ≥ 64

If table size < 64: 👉 HashMap resizes instead of treeifying

Why?

Tree overhead is expensive for small maps

Recommended by LinkedIn

Understanding the Java Memory Model

RAHUL RAJ ABHISHEK 2 months ago

🚀 Boosting Java Performance with Virtual Threads…

Jay Saraiya 5 months ago

Threads vs. Virtual Threads in Java

Yaksh Jethva 8 months ago

7️⃣ get() — Fast or Slow Depends on You

map.get(key);

Internal flow:

hash
index
bucket traversal:

💥 Bad equals() = slow map 💥 Mutable key = broken map

8️⃣ equals() + hashCode(): The Silent Killers

Production Bug Example

class Employee {
    String empId;
}

Used as HashMap key.

Problems:

hashCode not overridden → same hash
equals not overridden → identity comparison

Result:

All entries land in same bucket
Linked list grows
CPU spikes
Latency explodes

Correct Implementation

@Override
public int hashCode() {
    return Objects.hash(empId);
}

@Override
public boolean equals(Object o) {
    if (this == o) return true;
    if (!(o instanceof Employee)) return false;
    Employee e = (Employee) o;
    return Objects.equals(empId, e.empId);
}

9️⃣ Resizing (Rehashing) — The Hidden Performance Bomb

Defaults:

initialCapacity = 16
loadFactor = 0.75

Resize happens when:

size > capacity × loadFactor

What happens internally:

New array (double size)
Every node is repositioned
No re-hashCode call, but index recalculation
CPU intensive
GC pressure increases

⚠️ In high-QPS systems, resize = latency spike

🔟 Why HashMap Is NOT Thread-Safe

Concurrent put operations can cause:

Lost updates
Infinite loops (JDK 7)
Data corruption

Famous JDK 7 Bug

Concurrent resize → linked list cycle → CPU stuck at 100%

👉 This is why ConcurrentHashMap exists

1️⃣1️⃣ Memory & GC Impact (Rarely Discussed)

Each HashMap entry:

Node object
Key object
Value object
Next pointer

Large maps:

Increase old-gen usage
Increase GC pause time
Affect latency SLAs

💡 Solution:

Use primitive maps (FastUtil, HPPC) when needed
Pre-size HashMap
Avoid long-lived giant maps

1️⃣2️⃣ Best Practices (Battle-Tested)

✔ Use immutable keys ✔ Override equals & hashCode ✔ Pre-size HashMap ✔ Avoid custom mutable objects as keys ✔ Never share HashMap across threads ✔ Use ConcurrentHashMap for concurrency ✔ Monitor resize & GC in production

1️⃣3️⃣ Interview Questions That Actually Matter

Why bucket size 8? → Tree cost vs list traversal trade-off

Why capacity ≥ 64 for treeification? → Avoid tree overhead for small maps

Why HashMap allows null key? → Null has hash 0, fixed bucket

Why HashMap iteration order changes? → Resize + bucket redistribution

🔚 Final Reality Check

HashMap is:

Simple to use
Dangerous to misuse
Extremely optimized internally

HashMap does not fail. Developers fail to understand HashMap.

To view or add a comment, sign in

🔥 Internal Working of HashMap in Java — Deep Dive (Source-Level + Real-World)

Ram Lakhan Singh

1️⃣ What HashMap Really Is (Not the textbook version)

2️⃣ Internal Node Structure (JDK Source Reality)

3️⃣ Hashing: Why hashCode() Is Not Enough

Step 1: hashCode() is called

Step 2: Bit spreading (CRITICAL)

4️⃣ Index Calculation (Why power of 2?)

5️⃣ put() — Complete Internal Flow

What actually happens:

6️⃣ Collision Handling — Where Things Go Wrong

Before Java 8

Java 8+

Treeification Conditions (Very Specific)

Recommended by LinkedIn

7️⃣ get() — Fast or Slow Depends on You

8️⃣ equals() + hashCode(): The Silent Killers

Production Bug Example

Correct Implementation

9️⃣ Resizing (Rehashing) — The Hidden Performance Bomb

What happens internally:

🔟 Why HashMap Is NOT Thread-Safe

Famous JDK 7 Bug

1️⃣1️⃣ Memory & GC Impact (Rarely Discussed)

1️⃣2️⃣ Best Practices (Battle-Tested)

1️⃣3️⃣ Interview Questions That Actually Matter

🔚 Final Reality Check

More articles by Ram Lakhan Singh

Others also viewed

🔥 Java 25's JEP 506: Scoped Values — The ThreadLocal Killer Is FINALLY Here!

Metaspace in the JVM: The Memory You Forgot You Were Using

Mastering Java Thread Dumps: Solutions to Common JVM Thread Scheduling Challenges

🪦 Is Reactive Dead After Java 21 Virtual Threads?

From Performance Puzzles to Predictable Power: How upgrading to Java 9 unlocked high throughput?

Virtual threads in Java - Is there any benefit for all use cases??

🚀 Java 25's JEP 507: Pattern Matching Just Got WAYMore Powerful

A categorized list of all Java and JVM features since JDK 8 to 17

Are You Making This Mistake in Your Java Parallel Logic?

Understanding Garbage Collection in Java — How the JVM Manages Memory for You

Explore content categories

1️⃣ What HashMap Really Is (Not the textbook version)

2️⃣ Internal Node Structure (JDK Source Reality)

3️⃣ Hashing: Why hashCode() Is Not Enough

Step 1: hashCode() is called

Step 2: Bit spreading (CRITICAL)

4️⃣ Index Calculation (Why power of 2?)

5️⃣ put() — Complete Internal Flow

What actually happens:

6️⃣ Collision Handling — Where Things Go Wrong

Before Java 8

Java 8+

Treeification Conditions (Very Specific)

Recommended by LinkedIn

7️⃣ get() — Fast or Slow Depends on You

8️⃣ equals() + hashCode(): The Silent Killers

Production Bug Example

Correct Implementation

9️⃣ Resizing (Rehashing) — The Hidden Performance Bomb

What happens internally:

🔟 Why HashMap Is NOT Thread-Safe

Famous JDK 7 Bug

1️⃣1️⃣ Memory & GC Impact (Rarely Discussed)

1️⃣2️⃣ Best Practices (Battle-Tested)

1️⃣3️⃣ Interview Questions That Actually Matter

🔚 Final Reality Check

More articles by Ram Lakhan Singh

ThreadPoolExecutor

Designing an Industry-Ready GET API

Entity Design for Microservices (Hibernate & JPA Perspective)

🔥Most API Failures Don’t Come From “Complex” Problems

🚀 Using Apache Camel for REST External API Integration

🧠 Mid-Level Microservices Interview Q&A

🚀 Microservices for Beginners — Interview Questions You MUST Know

🚀 Saga Pattern — when @Transactional stops working

Factory vs Strategy — most developers confuse these two

🔥Factory Pattern — you’re already using it every day (thanks to Spring)

Others also viewed

🔥 Java 25's JEP 506: Scoped Values — The ThreadLocal Killer Is FINALLY Here!

Metaspace in the JVM: The Memory You Forgot You Were Using

Mastering Java Thread Dumps: Solutions to Common JVM Thread Scheduling Challenges

🪦 Is Reactive Dead After Java 21 Virtual Threads?

From Performance Puzzles to Predictable Power: How upgrading to Java 9 unlocked high throughput?

Virtual threads in Java - Is there any benefit for all use cases??

🚀 Java 25's JEP 507: Pattern Matching Just Got WAYMore Powerful

A categorized list of all Java and JVM features since JDK 8 to 17

Are You Making This Mistake in Your Java Parallel Logic?

Understanding Garbage Collection in Java — How the JVM Manages Memory for You

Explore content categories