AI Agents in Action, Second Edition you own this product

Intelligent workflows with LLMs, MCP, A2A, and more

Micheal Lanham

MEAP began November 2025
Last updated March 2026
Publication in Summer 2026 (estimated)

ISBN 9781633434530
325 pages (estimated)

Included with a Manning Online subscription

printed in black & white

available in Complex Chinese

catalog / Data Science / AI / AI Agents

resources: Source code Book forum Source code on Github

table of content

1 The rise of AI agents

1.1 Defining agents and agentic thinking

1.1.1 Understanding agent/assistant and LLM patterns

1.1.2 Thinking like agents

1.1.3 Agents act with tools

1.2 Introducing the Model Context Protocol (MCP)

1.3 Understanding the functional layers of an agent

1.3.1 The Agent Persona

1.3.2 Agent Actions & Tools

1.3.3 Agent Reasoning & Planning

1.3.4 Agent Knowledge & Memory

1.3.5 Agent Evaluation & Feedback

1.4 Advancing onto multi-agent systems

1.4.1 The agent-flow assembly line

1.4.2 Agent orchestrations (hub-and-spoke)

1.4.3 Agent collaboration (teams of agents)

1.5 Summary

2 Core components: Large Language Models, prompting, and agents

2.1 Understanding Large Language Models

2.1.1 LLMs: Probabilistic Token Machines

2.1.2 What is a token?

2.1.3 Tuning Temperature, Top P, and more

2.2 Controlling LLMs with prompt engineering (Agent Persona)

2.2.1 Applying core prompt techniques

2.2.2 Thinking like an LLM

2.2.3 Avoiding common prompt pitfalls

2.3 Building an agent with OpenAI Agents

2.3.1 Building a minimal agent

2.3.2 Setting the Agent Model and other parameters

2.3.3 Controlling inputs and typed outputs

2.3.4 Tracing agents

2.4 Enhancing agents through tool integration

2.4.1 Providing agents with tools

2.4.2 Tracing agentic tool use

2.5 Exercises

2.6 Summary

3 Actions with Model Context Protocol for AI agents

3.1 Understanding MCP fundamentals for agent development

3.1.1 The standardization problem MCP solves

3.1.2 MCP architecture: Clients, servers, and services

3.1.3 Core components: Tools, resources, and prompts

3.1.4 MCP deployment patterns for agents

3.1.5 MCP powers the functional agent layers

3.2 Getting started with MCP Servers

3.2.1 Coding up an MCP Server for Claude

3.2.2 Using the MCP inspector

3.2.3 Understanding MCP transport types

3.2.4 From desktop to agents: the key differences

3.3 Actioning MCP servers for Agents

3.3.1 Actioning local MCP servers over STDIO with agents

3.3.2 Actioning local MCP servers over SSE with agents

3.3.3 Connecting to the standard MCP servers

3.4 Building MCP servers for agents

3.4.1 Converting tools to an MCP server

3.4.2 Consuming MCP servers locally or remotely

3.5 Exercises

3.6 Summary

4 Architecting and building multi-agent systems

4.1 Architecting multi-agent systems

4.1.1 Decision-making for agent systems

4.1.2 Communicating with shared-memory, message-passing, and MCP

4.1.3 Channeling multi-agent coordination strategies

4.2 Balancing agents with agentic flows

4.2.1 Transforming agents to agent flows

4.2.2 Building an Agent-to-Agent flow

4.2.3 Agency and decision making in agent flows

4.3 Understanding handoffs in aAgent flows

4.3.1 Agent-to-agent flow with handoffs

4.3.2 Visualizing agent flows

4.3.3 Monitoring the handoff

4.4 Validating agent flows with guardrails

4.4.1 Implementing input and output guardrails

4.4.2 Using agents as guardrails

4.4.3 Adding guardrails to pass off agent flows

4.5 Exercises

4.6 Summary

5 Agent Reasoning and Planning

5.1 Understanding LLM Reasoning and Planning

5.1.1 Chain of Thought Reasoning

5.1.2 ReAct Paradigm (Reasoning + Acting + Observing)

5.1.3 Planning with LLMs

5.2 Instructing agents to reason and plan

5.2.1 Applying CoT to an Agent

5.2.2 Implementing ReAct with Agents

5.3 Advanced reasoning with agents

5.3.1 Tree of Thought

5.3.2 Reflexion

5.3.3 Selecting the right pattern for your agents

5.4 Utilizing the Sequential Thinking MCP Server

5.4.1 Unchaining the Sequential Thinking Server

5.4.2 Revisiting time travel problems with Sequential Thinking

5.4.3 Advanced reasoning with sequential thinking

5.5 Exercises

5.6 Summary

6 Working with memory and knowledge RAG for agents

6.1 Understanding retrieval in AI applications

6.1.1 The basics of retrieval augmented generation (RAG)

6.1.2 Delving into semantic search and document indexing

6.1.3 Applying vector similarity search

6.2 Vector databases and similarity search

6.2.1 Demystifying document embeddings

6.2.2 Querying document embeddings from Chroma

6.3 Building practical RAG knowledge agents

6.3.1 Everything begins with search and relevance

6.3.2 Building a vector search RAG agent

6.3.3 Building a hybrid search RAG agent

6.4 Adding memory to agents with MCP

6.4.1 Understanding memory form and agent function

6.4.2 Implementing a graph database for memory using MCP

6.4.3 Creating hybrid memory systems with MCP

6.4.4 Semantic augmented memory and applications to semantic, episodic, and procedural memory

6.4.5 Uncluttering memory with compression and forgetting

6.5 Exercises

6.6 Summary

7 Building robust agents with evaluation and feedback

7.1 Introducing agent evaluation and feedback

7.2 Implementing test-driven agent development

7.2.1 Exploring TDAD in practice

7.2.2 Coding and testing the RAG agent

7.2.3 Refactoring the agent

7.2.4 Extending evaluation with an agent evaluator

7.3 Employing grounding, critic, and evaluation agents

7.3.1 Reviewing the grounding agent

7.3.2 Grounding the RAG agent

7.3.3 Implementing grounding agents as guardrails

7.3.4 Understanding the role of rubrics in evaluation

7.3.5 Building a rubric critic agent

7.4 Phoenix for evaluation and feedback

7.4.1 Connecting to Phoenix

7.4.2 Adding metadata and session tracking

7.4.3 Experimenting with evaluators

7.4.4 Providing feedback with Annotations

7.5 Exercises

7.6 Summary

8 Deploying agents and agentic systems

8.1 Strategies for consuming agents

8.1.1 Embedding real-time voice agents into web applications

8.1.2 Hosting agents through an API

8.1.3 Consuming an agent web service in a web application

8.2 Dockerizing agent systems

8.2.1 Containerizing an agent microservice

8.2.2 Orchestrating agentic systems with Docker Compose

8.2.3 Externalizing local agent microservices

8.3 Considering advanced deployment strategies

8.3.1 Choosing a runtime: edge, API, or event-driven

8.3.2 The three “wires” of communication

8.3.3 Practical multi-agent topologies that adapt well

8.3.4 State, memory, and idempotency

8.3.5 Release engineering for agents (prompts, tools, models)

8.3.6 Observability matters

8.3.7 Reliability patterns: timeouts, fallbacks, and budgets

8.3.8 Cost control and model routing

8.4 Security, safety, and governance in production

8.4.1 A quick threat model for agentic systems

8.4.2 Identity and access—for people, services, and agents

8.4.3 Secrets and configuration management

8.4.4 Tool safety: sandboxing and egress control

8.4.5 Prompt-injection and data-exfiltration defenses

8.4.6 Safety and policy enforcement

8.5 Exercises

8.6 Summary

9 Engaging GPT Assistants

10 Exploring collaborative agent systems

11 Troubleshooting

Overview

4 Architecting and building multi-agent systems

This chapter explains how to architect and build multi‑agent systems, outlining when to graduate from a single agent to a coordinated team. It compares three foundational patterns—flow (assembly line), orchestrator (hub‑and‑spoke), and collaboration (peer‑to‑peer)—and frames design choices around the core levers of decision‑making, control, and communication, with coordination as a fourth dimension. The discussion also surveys common communication styles (selective message passing, shared conversational context, and standardized agent protocols) and execution strategies (sequential and parallel pipelines, hierarchical delegation, iterative critique and refinement, voting/ensembles, role‑playing, conditional routing, and decentralized peer networks), emphasizing trade‑offs in cost, latency, predictability, and fault tolerance.

Practically, the chapter shows how overloaded single agents are decomposed into focused, specialized agents connected in an agent‑to‑agent flow to improve reliability, cost, and clarity of roles. It recommends structuring interfaces with typed inputs/outputs to make handoffs explicit and easier to validate, and using external code for key branching decisions when determinism is required. Multiple handoff approaches are contrasted: passing outputs explicitly between agents, sharing a common conversation thread, and using framework‑level handoffs that let agents transfer control without bespoke glue code. Developers are encouraged to visualize flows and instrument handoffs to understand what data moves between agents and why.

To keep multi‑agent workflows robust, the chapter introduces guardrails that validate and, when needed, correct inputs, outputs, and inter‑agent transfers—either with straightforward programmatic checks or by delegating the validation itself to specialized “guardrail agents.” It illustrates recovery patterns such as retries and fallbacks, and shows how guardrails integrate with pass‑off flows for fine‑grained control. While orchestration can centralize planning and delegation, the guidance is to start simple with flows, prefer structured data and minimal context, add deterministic checkpoints where variability hurts, and layer guardrails and monitoring as complexity grows—mixing patterns judiciously only after establishing a stable, comprehensible baseline.

The three well-established patterns for building multiple agent systems, from the agent flow (assembly line), orchestrator (hub-and-spoke), to the collaboration (peer to peer).

Decision-making (command) and control demonstrated for more specialized multi-agent architectures, the flow, orchestration and manager architectures.

Comparison of different agent communication patterns represented in a flow.

Various ways agents may coordinate execution sequentially or in parallel.

A single agent is transformed into a multi-agent flow of agents. The agent role is broken down into three well-defined and distinct roles that encapsulate a well-defined set of tools.

shows an agent-to-agent flow with a deterministic (coded) decision point added. Taking the decision away from the agents and making it deterministic keeps the workflow more consistent.

three agent flow communication patterns demonstrating how agents can pass messages from one another and in turn pass command and control from agent to agent.

There are two ways to visualize the agent flow using draw_graph and the Traces page that can be fournd on the OpenAI Dashboard under logs.

Guardrails can be used to validate and control input and output of an agent.

Traces page after executing the Orchestration agent shows X and Y.

Summary

Single-agent designs often hit scalability walls; converting a monolithic agent into an agentic flow (a chain of specialised agents) restores clarity, extensibility, and performance.
Agent-to-agent flows work like prompt-chaining with superpowers: each node can invoke tools and reason independently yet pass concise, typed outputs downstream to keep the context lean.
Insert deterministic decision points (code or schema checks) wherever the flow must repeat reliably; don’t rely on stochastic LLM judgment for pass/fail branches.
The OpenAI Agents SDK supports two hand-off styles: conversational (shared thread) and pass-off (explicit code routing). Choose conversational for speed and pass-off for fine-grained control.
Use the handoffs field of the agent plus clear instructional prompts to enable internal hand-offs that require zero extra orchestration code.
Visualise complex flows early using draw_graph(), and the Dashboard Traces view reveals hidden loops, tool chains, and latency hotspots.
Wrap risky transfers in guardrails—input/output validators that reject, retry, or correct data before it corrupts the flow; tripwires surface as explicit exceptions you can loop on.
Guardrails themselves can be LLM-powered agents, giving you natural-language policies without brittle regex or length checks—remember they, too, need schemas and tests.
Tool limits still matter in multi-agent worlds: every registered tool inflates every call; keep each agent’s tool list tight (< 10) and scoped to its role to avoid token bloat.
Flow, orchestration, and manager-worker are the three canonical decision patterns. Start with plain flows and graduate to orchestrators only when centralised delegation is truly required.
Choose one communication layer (shared memory, message passing, MCP, or emerging A2A protocol) per project; mixing channels multiplies debugging pain.
A production-ready agentic flow blends typed I/O, deterministic checkpoints, visual traces, scoped tool sets, and guardrails—yielding pipelines that can scale, recover, and evolve without surprise.
Agents may be coordinated using multiple different strategies: Sequential Flow, Parallel Delegation, Hierarchical Coordination, Iterative Debate and Refinement, Voting / Best-of-N (Ensemble), Role-Playing Collaboration, Conditional Routing (Branching), and Peer-to-Peer Network

FAQ

When should I move from a single agent to a multi-agent flow?

Move when a single agent becomes overloaded with tools or long prompts, when specialization would improve quality, or to reduce cost/latency by limiting each agent’s tools and context. Splitting into focused agents typically makes behavior easier to reason about and maintain.

What are the three core multi-agent architecture patterns?

- Flow (agent-to-agent pipeline): command, control, and context pass along an assembly line. Simple and reliable starting point.
- Orchestrator (hub-and-spoke): a central agent decides and delegates control to workers. Good for complex decomposition but adds overhead.
- Collaboration (peer-to-peer): agents share a common channel and coordinate together. Flexible but harder to design and debug.

How do decision-making, control, and communication apply to agents?

- Decision-making (command): who decides what to do and when. Limit context to what’s necessary so decisions stay effective.
- Control: who can act (use tools, perform tasks). Often passed between agents in a flow or by an orchestrator.
- Communication: how agents share context (fully shared vs. restricted). Restricting context can reduce confusion and token cost.

What communication strategies can agents use?

- Pass-off messaging: code passes only the necessary inputs/outputs to the next agent (maximum control, more code).
- Shared conversation/memory: all agents see the same thread (easy, but more tokens and possible confusion).
- Protocol/tool calls (e.g., MCP): treat another agent/tool as a function, tightly constraining what’s exchanged.

How can I coordinate agents (sequentially or in parallel)?

- Sequential pipeline: simple, deterministic handoffs; slower, brittle if a step fails.
- Parallel delegation: run independent steps concurrently, then merge results.
- Hierarchical coordination: a manager decomposes tasks and mixes parallel/serial work.
- Iterative debate/refinement: agents critique and improve outputs over rounds for higher quality.

What’s the difference between handoffs and manual pass-offs?

Handoffs are built-in transitions where one agent internally transfers control to the next (less glue code, faster to build). Manual pass-offs are coded transitions that let you validate, filter, or transform data between agents (more control, more engineering effort).

How do I validate agent inputs/outputs and handoffs?

Use guardrails to check or correct data at flow boundaries or between agents. You can implement input and output guardrails with code (e.g., length checks) or with “guardrail agents” that inspect and annotate data. Tripwires let you fail fast, retry, or route to fallback behavior.

How can I make multi-agent flows more deterministic and predictable?

- Add external, coded decision points for critical branches.
- Strongly type inputs/outputs (schemas/models) to reduce parsing ambiguity.
- Limit each agent’s tools and context to only what’s necessary.
- Use guardrails and retries to enforce minimum quality and recover from variability.

How do I monitor and visualize multi-agent systems?

- Use a handoff wrapper/callback to inspect what data triggered a handoff and what was passed to the next agent.
- Visualize the graph of agents and handoffs to understand structure and dependencies.
- Use tracing to inspect calls, tool use, and data flow for debugging.

When should I choose orchestration over flows, and what are the trade-offs?

Pick orchestration when you need a central planner to dynamically decompose and delegate complex tasks. Trade-offs: more complexity, potential planning overhead and failure points, and risk of losing specialized details if instructions are not precise. As a rule, start with flows, gain stability, then consider orchestration if needed.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$47.99 $33.59

you save $14.40 (30%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $33.59

you save $14.40 (30%)

eBook

pdf, ePub, online

$47.99 $33.59

you save $14.40 (30%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more