Concurrent Collections

Thread-safe collections for concurrent applications.

Synchronized vs Concurrent Collections

Understanding the difference is crucial for performance!

Visual: Lock Granularity

Java Concurrent Collections

ConcurrentHashMap

ConcurrentHashMap provides thread-safe hash map operations with high concurrency.

How It Works

Java 7: Segment locking (16 segments by default) Java 8+: CAS operations + synchronized blocks for individual buckets

Visual: ConcurrentHashMap Architecture

Example: Thread-Safe Cache

Java

1
import java.util.concurrent.ConcurrentHashMap;
2

3
public class ConcurrentHashMapCache<K, V> {
4
    private final ConcurrentHashMap<K, V> cache = new ConcurrentHashMap<>();
5

6
    public V get(K key) {
7
        return cache.get(key);  // Thread-safe read
8
    }
9

10
    public void put(K key, V value) {
11
        cache.put(key, value);  // Thread-safe write
12
    }
13

14
    // Atomic operations
15
    public V putIfAbsent(K key, V value) {
16
        return cache.putIfAbsent(key, value);  // Atomic!
17
    }
18

19
    public boolean remove(K key, V value) {
20
        return cache.remove(key, value);  // Atomic!
21
    }
22

23
    public V computeIfAbsent(K key, java.util.function.Function<K, V> mappingFunction) {
24
        return cache.computeIfAbsent(key, mappingFunction);  // Atomic!
25
    }
26
}

CopyOnWriteArrayList

CopyOnWriteArrayList creates a new copy on each write, making reads lock-free.

Visual: Copy-On-Write

Example: CopyOnWriteArrayList

Java

1
import java.util.List;
2
import java.util.concurrent.CopyOnWriteArrayList;
3

4
public class CopyOnWriteExample {
5
    public static void main(String[] args) {
6
        List<String> list = new CopyOnWriteArrayList<>();
7

8
        // Multiple readers (no locking needed!)
9
        for (int i = 0; i < 10; i++) {
10
            final int readerId = i;
11
            new Thread(() -> {
12
                for (int j = 0; j < 1000; j++) {
13
                    list.size();  // Lock-free read
14
                }
15
                System.out.println("Reader " + readerId + " finished");
16
            }).start();
17
        }
18

19
        // Occasional writer
20
        new Thread(() -> {
21
            for (int i = 0; i < 10; i++) {
22
                list.add("Item " + i);  // Creates copy
23
                try {
24
                    Thread.sleep(100);
25
                } catch (InterruptedException e) {
26
                    Thread.currentThread().interrupt();
27
                }
28
            }
29
        }).start();
30
    }
31
}

BlockingQueue Implementations

We covered these in Producer-Consumer, but here’s a quick reference:

Queue Type	Characteristics	Use Case
`ArrayBlockingQueue`	Array-backed, bounded	Fixed-size queues
`LinkedBlockingQueue`	Node-based, optionally bounded	Better throughput
`PriorityBlockingQueue`	Priority ordering	Priority-based processing
`DelayQueue`	Time-based scheduling	Scheduled tasks
`SynchronousQueue`	Zero capacity	Direct handoff

Python Concurrent Collections

Thread-Safety of Built-in Types

Python’s built-in types have limited thread-safety due to the GIL.

Visual: Python Thread-Safety

Example: Thread-Safe vs Unsafe Operations

Python

1
import threading
2

3
# ❌ NOT thread-safe: Compound operation
4
counter = 0
5

6
def unsafe_increment():
7
    global counter
8
    counter += 1  # NOT atomic: read-modify-write
9

10
threads = [threading.Thread(target=unsafe_increment) for _ in range(10)]
11
for t in threads:
12
    t.start()
13
for t in threads:
14
    t.join()
15

16
print(f"Unsafe result: {counter}")  # May not be 10!
17

18
# ✅ Thread-safe: Single atomic operation
19
d = {}
20
def safe_operation():
21
    d['key'] = 'value'  # Atomic operation
22

23
threads = [threading.Thread(target=safe_operation) for _ in range(10)]
24
for t in threads:
25
    t.start()
26
for t in threads:
27
    t.join()
28

29
print(f"Safe result: {len(d)}")  # Always 1
30

31
# ✅ Thread-safe: Using locks
32
counter_safe = 0
33
lock = threading.Lock()
34

35
def safe_increment():
36
    global counter_safe
37
    with lock:
38
        counter_safe += 1
39

40
threads = [threading.Thread(target=safe_increment) for _ in range(10)]
41
for t in threads:
42
    t.start()
43
for t in threads:
44
    t.join()
45

46
print(f"Safe with lock: {counter_safe}")  # Always 10

queue Module

Python’s queue module provides thread-safe queue implementations.

Python

1
import queue
2
import threading
3

4
# FIFO Queue
5
fifo_queue = queue.Queue(maxsize=10)
6

7
# LIFO Queue (Stack)
8
lifo_queue = queue.LifoQueue(maxsize=10)
9

10
# Priority Queue
11
priority_queue = queue.PriorityQueue(maxsize=10)
12

13
def producer(q):
14
    for i in range(5):
15
        q.put(i)
16
        print(f"Produced: {i}")
17

18
def consumer(q):
19
    while True:
20
        try:
21
            item = q.get(timeout=1)
22
            print(f"Consumed: {item}")
23
            q.task_done()
24
        except queue.Empty:
25
            break
26

27
# Thread-safe operations
28
threading.Thread(target=producer, args=(fifo_queue,)).start()
29
threading.Thread(target=consumer, args=(fifo_queue,)).start()

multiprocessing.Manager

For shared state across processes (not threads):

Python

1
import multiprocessing
2

3
def worker(shared_dict, shared_list):
4
    shared_dict['count'] = shared_dict.get('count', 0) + 1
5
    shared_list.append(shared_dict['count'])
6

7
if __name__ == '__main__':
8
    manager = multiprocessing.Manager()
9
    shared_dict = manager.dict()
10
    shared_list = manager.list()
11

12
    processes = []
13
    for _ in range(5):
14
        p = multiprocessing.Process(target=worker, args=(shared_dict, shared_list))
15
        processes.append(p)
16
        p.start()
17

18
    for p in processes:
19
        p.join()
20

21
    print(f"Dict: {shared_dict}")
22
    print(f"List: {shared_list}")

Comparison Table

Collection Type	Java	Python	Thread-Safety
HashMap/Dict	`ConcurrentHashMap`	`dict` + locks	Java: Full, Python: Limited
List	`CopyOnWriteArrayList`	`list` + locks	Java: Full, Python: Limited
Queue	`BlockingQueue` variants	`queue.Queue`	Both: Full
Set	`ConcurrentSkipListSet`	`set` + locks	Java: Full, Python: Limited

Practice Problems

Easy: Thread-Safe Cache

Design a thread-safe cache using concurrent collections.

Solution

Java
Python

1
import java.util.concurrent.ConcurrentHashMap;
2

3
public class ThreadSafeCache<K, V> {
4
    private final ConcurrentHashMap<K, V> cache = new ConcurrentHashMap<>();
5

6
    public V get(K key) {
7
        return cache.get(key);
8
    }
9

10
    public void put(K key, V value) {
11
        cache.put(key, value);
12
    }
13

14
    public V computeIfAbsent(K key, java.util.function.Function<K, V> mappingFunction) {
15
        return cache.computeIfAbsent(key, mappingFunction);
16
    }
17
}

1
import threading
2

3
class ThreadSafeCache:
4
    def __init__(self):
5
        self._cache = {}
6
        self._lock = threading.RLock()
7

8
    def get(self, key):
9
        with self._lock:
10
            return self._cache.get(key)
11

12
    def put(self, key, value):
13
        with self._lock:
14
            self._cache[key] = value
15

16
    def compute_if_absent(self, key, mapping_function):
17
        with self._lock:
18
            if key not in self._cache:
19
                self._cache[key] = mapping_function(key)
20
            return self._cache[key]

Interview Questions

Q1: “What’s the difference between ConcurrentHashMap and synchronized HashMap?”

Answer:

synchronized HashMap: Locks entire map for any operation (low concurrency)
ConcurrentHashMap: Fine-grained locking or CAS (high concurrency)
Performance: ConcurrentHashMap is much faster for concurrent access
Use ConcurrentHashMap: When you need thread-safe map with high concurrency

Q2: “When would you use CopyOnWriteArrayList?”

Answer:

Use when: Reads vastly outnumber writes (e.g., 100:1 ratio)
Perfect for: Event listeners, configuration, read-heavy scenarios
Don’t use when: Frequent writes (too expensive - creates copy each time)
Trade-off: Expensive writes for lock-free reads

Q3: “Are Python’s built-in dict and list thread-safe?”

Answer:

Single operations: Yes, atomic (e.g., dict[key] = value, list.append(item))
Compound operations: No, NOT thread-safe (e.g., if key in dict: dict[key] = value)
Solution: Use locks for compound operations or thread-safe collections
GIL: Provides some protection but doesn’t guarantee thread-safety for compound ops

Q4: “What’s the difference between ArrayBlockingQueue and LinkedBlockingQueue?”

Answer:

ArrayBlockingQueue: Array-backed, always bounded, fixed memory, slightly lower throughput
LinkedBlockingQueue: Node-based, optionally bounded, dynamic memory, typically higher throughput
Choose: ArrayBlockingQueue for fixed-size needs, LinkedBlockingQueue for better performance

Key Takeaways

Next Steps

Continue learning concurrency:

Asynchronous Patterns - Futures and async/await
Lock-Free Programming - CAS and atomic operations

Mastering concurrent collections is essential for building thread-safe systems! 📦

Request a feature or report an issue