Fabulous Adventures in Data Structures and Algorithms you own this product

Eric Lippert

MEAP began October 2025
Last updated March 2026
Publication in Summer 2026 (estimated)

ISBN 9781633435032
400 pages (estimated)

Included with a Manning Online subscription

printed in black & white

catalog / Software Development / Software Engineering

resources: Source code Book forum Source code on GitHub

table of content

1 Starting a fabulous adventure

1.1 Defining data structures, algorithms and complexity

1.2 An immutable linked list

1.2.1 Performance so far

1.2.2 Reversing an immutable linked list

1.3 The challenges ahead

1.4 Summary

PART 1: EXTENDING THE BASICS

2 Immutable stacks and queues

2.1 Why immutability?

2.1.1 Correctness

2.1.2 Historical preservation

2.1.3 Security: TOCTOU me about it

2.1.4 Safer multithreading

2.1.5 Memoization for time performance

2.1.6 Persistence for space performance

2.1.7 The functional programming attitude

2.2 An immutable stack

2.2.1 A covariant immutable stack

2.3 A queue, a queue, an immutable queue

2.4 Mutable wrappers

2.5 Undo and redo

2.5.1 Create an army of clones

2.5.2 Use a different data structure and the command pattern

2.5.3 Undo and redo with mutable-over-immutable data structures

2.6 The Hughes List: build it for cheap, pay for it later

2.6.1 Reverse redux

2.6.2 Currying and partial application

2.6.3 Implementing the Hughes list

2.6.4 Complexity of the Hughes list

2.6.5 Where is the data in this data structure?

2.6.6 A couple more performance considerations

2.6.7 Reverse a linked list

2.7 Summary

3 An immutable deque

3.1 An immutable deque ADT

3.2 A bad naïve implementation

3.3 A finger tree

3.3.1 Step one: the mini-deque

3.3.2 A new definition of a deque

3.4 Visualizing the data structure

3.5 Amortized performance of the deque

3.6 Are we abusing the type system?

3.7 Concatenation of deques

3.7.1 Performance after adding concatenation

3.8 Why is this called a “finger tree”?

3.9 Summary

4 Memoizing immutable quadtrees to make a better Life

4.1 The rules of Life

4.2 A typical first attempt

4.2.1 Performance of the naïve implementation

4.2.2 Same algorithm, better constant factor

4.2.3 Change tracking improves the algorithm

4.3 An immutable quadtree

4.3.1 The IQuad interface

4.3.2 Implementing the 0-quad leaf cells

4.3.3 A strategy for compressing space

4.3.4 A general-purpose memoizer

4.3.5 A memoized quadtree implementation

4.3.6 Indexing an immutable quadtree like an array

4.3.7 A few more helpful extension methods

4.4 Gosper’s algorithm

4.4.1 The base case: stepping a 2-quad produces a 1-quad

4.4.2 A first attempt at a recursive algorithm

4.4.3 The grid always shrinks

4.4.4 Is this algorithm inefficient?

4.4.5 The big insight of Gosper’s Algorithm

4.4.6 Putting it all together

4.5 Summing up

5 What’s up with you, Directed Acyclic Word Graph?

5.1 Two problems, two weak solutions

5.1.1 Hash sets are a non-solution

5.1.2 Sorted lists are… fine?

5.2 The prefix tree

5.2.1 Nodes and edges

5.2.2 The trie building algorithm

5.2.3 Using a trie as a word list

5.2.4 How much memory are we saving with a trie?

5.3 Tries are not-so-secretly finite state automata

5.4 Building a DAWG

5.4.1 What problems must we solve to build a DAWG?

5.4.2 Equivalence of nodes

5.4.3 The DAWG building algorithm.

5.4.4 How much expense does optimizing the graph as you go add?

5.5 DAWG vs trie for ENABLE

5.6 Summing Up

6 Combinatorial algorithms

6.1 The Cartesian product

6.1.1 The Cartesian product of a few sets or sequences

6.1.2 The Cartesian product of arbitrarily many sequences

6.1.3 The connection to the integers

6.2 Permutations

6.2.1 Lexicographic permutations with repetitions

6.2.2 The factorial base representation of permutation numbers

6.2.3 The Fischer-Yates shuffling algorithm

6.2.4 A recursive “change ringing” algorithm

6.2.5 Even’s “change ringing” algorithm

6.3 Combinations

6.3.1 Lexicographic combinations, with a twist

6.3.2 Counting combinations

6.3.3 The combinatorial base representation of combinations

6.4 Summary

7 Interlude one: basic category theory for programmers

7.1 Covariant endofunctors in the category of types

7.2 Contravariant endofunctors in the category of types

7.3 Summary

PART 2: SEARCHING, SOLVING, INFERRING

8 Coloring graphs with backtracking search

8.1 Coloring South America

8.2 An immutable multi-dictionary

8.3 An immutable undirected graph

8.4 Coloring simple graphs

8.5 Solving sudokus with backtracking search

8.5.1 Graph coloring is NP-complete

8.5.2 Implementing a general backtracker

8.5.3 How could we improve?

8.5.4 Backtracking and the Cartesian product

8.6 Scheduling problems are graph coloring problems

8.7 Summary

9 Greedy iterative pretty printing

9.1 The pretty printing problem

9.2 Greedy algorithms, and making change

9.3 A greedy pretty printing algorithm

9.3.1 The doc data structure

9.3.2 Visualizing a doc; how deep does it get?

9.3.3 Phase two: implementing Fits() without recursion

9.3.4 Phase two continued: implementing Pretty() without recursion

9.4 Phase one: transforming parse trees with the visitor pattern

9.4.1 Implementing the visitor pattern

9.4.2 From parse tree to doc

9.5 Summary

10 Unification of binary terms

10.1 Unifying binary terms

10.1.1 Unifying binary terms, attempt one

10.1.2 Unifying binary terms, this time with an occurs check

10.2 The performance of binary term unification

10.3 Type inference and logic programming

10.4 Summary

11 Anti-unification of binary terms

11.1 Anti-unifying binary terms

11.2 The first-order binary term anti-unification algorithm

11.2.1 Performance of Reynolds’ anti-unification algorithm

11.3 Clone detection and fix deduction

11.4 Summary

12 Interlude two: monads

12.1 What’s the value for OO programmers?

12.2 How hard can it be to divide by two?

12.3 Generalizing to arbitrary functions with Map and Bind

12.4 The sequence monad is an additive monad

12.5 Summary

PART 3: PROBABILITIES

13 A better abstraction for randomness

14 The alias algorithm for the categorical distribution

15 Conditional probability with Bayes’ Theorem

16 Interlude three: the probability monad

17 Markov processes

18 Continuous distributions with the Metropolis algorithm

Overview

7 Basic category theory for programmers

Programming is fundamentally about building layers of abstraction, and category theory offers a high-level, unifying way to reason about those abstractions. Rather than studying numbers, it studies relationships and composition, earning the tongue-in-cheek label “generalized abstract nonsense.” The chapter motivates its use both practically (as compositional design patterns) and theoretically (as a way to see deeper structure in everyday programming), then sets out to connect category-theoretic ideas—categories, objects, morphisms, and functors—to real language design questions like whether a sequence of giraffes can be treated as a sequence of animals.

A category consists of objects and morphisms (arrows) with two key properties: each object has an identity arrow, and arrows compose (if you can go from A to B and B to C, you can go from A to C). Simple “posetal” categories model familiar partial orders, like numbers ordered by ≥ or people ordered by ancestry. Beyond ordinary functions, the chapter introduces functors—mappings between categories that preserve morphisms. When such a mapping goes from a category to itself it’s an endofunctor; when it preserves both the existence and direction of arrows, it’s covariant.

Viewing types as objects and “assignment compatibility” as morphisms links category theory directly to variance in type systems. Covariant interfaces like sequences model a covariant endofunctor T ↦ IE<T> when all nested generic constructions are included; this justifies why IEnumerable<Giraffe> can be used where IEnumerable<Animal> is expected, while mutable lists are not safely covariant. Contravariant interfaces (e.g., comparers) reverse arrow direction: if Giraffe → Animal, then IC<Animal> → IC<Giraffe>, explaining “contravariant.” The chapter closes by noting how this lens clarifies safety-driven language rules and hints at lattice-theoretic structure in the space of types.

A small category with three objects, represented by the nodes, and six morphisms, represented by the arrows.

A strangely familiar small category with four objects and ten morphisms.

The function F is represented as dashed arrows mapping from one category to another

The category of four types with the assignment compatibility morphism; the identity morphisms are elided for clarity. Note that both Giraffe and Tiger have morphisms to Animal but not to each other. A Giraffe can’t be assigned to a variable of type Tiger and a Tiger can’t be assigned to a variable of type Giraffe, but both can be assigned to a variable of type Animal or Object.

Adding four constructed generic types to our category. Notice how the structure of the morphisms of the interface types looks very similar to that of the original four types.

The mapping of a function from types to types, shown as dashed arrows.

Part of another infinite category of types, this time with a contravariant interface.

Summary

Category theory is a modern mathematical discipline that studies relationships between arbitrary objects. It’s cheekily characterized as “generalized abstract nonsense”.
A category is a collection of objects that have reflexive, transitive morphisms; we can think of a morphism as an arrow going from one object to another.
A function that maps objects of one category to another such that morphisms are preserved is a covariant functor. One that preserves but reverses morphisms is a contravariant functor. If the two categories are the same, it is an endofunctor.
The assignment compatibility relation “an expression of type X may be assigned to a variable of type Y” is a common morphism on a category where types are objects.
Covariant and contravariant interfaces are called that because generic interfaces can be thought of as covariant or contravariant functors: functions from types to types that preserve assignment compatibility morphisms.
Posetal categories, that define a partial order on a set of objects, are common and particularly useful when objects are types in a programming language. Semilattices are posets that have the additional nice property that you can always find the most specific type that is assignment compatible with any two types.
The relationship between category theory and analysis of programming languages is very deep; we’ve just scratched the surface. We’ll go a little deeper in subsequent interludes.

FAQ

What is category theory, and why do some call it “generalized abstract nonsense”?

Category theory studies relationships between things, regardless of what the “things” are. Because it abstracts away the nature of the objects and focuses purely on structure and relationships, practitioners jokingly call it “generalized abstract nonsense.”

What are objects and morphisms in a category?

Objects are the “things” in a category (they can be anything). Morphisms are directed arrows between objects that capture “you can get from here to there.” Two rules apply: every object has an identity morphism to itself (reflexive), and if there’s A→B and B→C then there’s A→C (transitive).

How do functions, morphisms, and functors differ?

- A function maps an object to a single related object (possibly across categories) without necessarily respecting the category’s arrows.
- A morphism is an arrow inside a category that represents a permissible move from one object to another.
- A functor maps objects (and arrows) from one category to another, preserving the existence of morphisms. Covariant functors preserve arrow direction; contravariant functors reverse it.

What is a posetal category?

A posetal category arises from a partial order: there’s a morphism X→Y exactly when X ≤ Y. Examples include numbers ordered by “≥” and a family tree ordered by “is an ancestor of (or the same person).”

What is a covariant functor? Can you give an intuitive example?

A covariant functor maps objects between categories and preserves arrow direction: if X→Y then F(X)→F(Y). Example: map 0,1,2 to family members Bob,Bruce,Eric in a way that preserves “≤” arrows as “is ancestor of” arrows. Every arrow in the source is mirrored in the target with the same direction.

What is an endofunctor, and why does mapping T ↦ IE<T> require infinitely many types?

An endofunctor maps a category to itself while preserving morphisms. The mapping T ↦ IE<T> (a simplified enumerable) is an endofunctor only if it’s defined for every object in the category, including nested types like IE<IE<T>>, IE<IE<IE<T>>>, and so on—hence an infinite collection of constructed types.

How does category theory explain covariance and contravariance in generic interfaces?

- Covariance (e.g., IE<out T>) preserves direction: if Giraffe → Animal, then IE<Giraffe> → IE<Animal> is allowed.
- Contravariance (e.g., IC<in T>) reverses direction: if Giraffe → Animal, then IC<Animal> → IC<Giraffe> is allowed. A comparer that can handle animals can also handle giraffes.

Why is IEnumerable<T> covariant but IList<T> is not?

IEnumerable<T> only produces T (T appears in output positions), so treating IE<Giraffe> as IE<Animal> is safe. IList<T> both produces and consumes T (you can insert), so allowing IList<Giraffe> where IList<Animal> is expected would permit inserting a Tiger into a list of giraffes—unsafe.

Why do arrows reverse for contravariant interfaces?

Under T ↦ IC<T> (a comparer), if there’s a morphism X→Y (e.g., Giraffe → Animal), the induced arrow goes IC<Y> → IC<X>. Intuition: something able to consume broader inputs (animals) can stand in where narrower inputs (giraffes) are required, hence the reversal.

What do “small category” and “semilattice” mean here, and why do they matter?

“Small” means the objects/morphisms form a set (possibly infinite). The category of types with assignment compatibility behaves like a semilattice, which lets us compute “the most specific common supertype” (e.g., Giraffe and Tiger both fit into Animal). Compilers use such lattice structures in type inference and analysis.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$55.99 $33.59

you save $22.40 (40%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$55.99 $33.59

you save $22.40 (40%)

eBook

pdf, ePub, online

$55.99 $33.59

you save $22.40 (40%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more