Overview

2 Software Defined Products and Architectures

Platform engineering is positioned as the creation of a durable, self-serve internal product that removes friction for software teams while meeting quality, security, and compliance expectations. Success depends as much on organizational design, decision habits, and engineering culture as on technology. The chapter stresses clear differentiation between customers (the developers who use the platform) and stakeholders (influencers such as security, finance, and legal), urging that stakeholder needs be expressed as service interfaces or outcome standards rather than prescribed implementations. Effective domain boundaries, measured through actual internal product usage, and a relentless focus on the developer experience are the primary yardsticks for platform value.

The recommended operating model is product-led: an empowered technical product owner, end-to-end lifecycle optimization from ideation to operations, “service-interface-first” capabilities, and rapid, feedback-driven increments that reach early adopters quickly. Architectural and tooling choices are validated through real experiments before commitments. The platform must be fully software-defined—everything in version control, delivered by automated pipelines—following a disciplined SDLC (plan, design, code, build/test, release, observe, operate). Observability-driven development expands “definition of done” to include instrumentation, dashboards, alerts, and change events so teams can understand behavior, detect regressions, and verify that delivered features create measurable value.

Architecturally, the chapter advocates designing for evolution: API-first patterns, domain-driven platform design, and automated architectural fitness functions that keep coupling in check, protect nonfunctional requirements, and enable portability. Backlog management is value-centric and incremental, informed by real user behavior, stakeholder input framed as outcomes, and market learning. Cloud-native foundations are preferred to reduce undifferentiated heavy lifting; extensible control planes enable policy enforcement, automation, and governance at scale, while serverless trade-offs are weighed when extensibility is limited. Start with a single team and a unified roadmap, expand to domain-aligned teams as demand grows, and preserve self-serve boundaries and cost efficiency throughout.

A Technical Product Owner (TPO, might be referred to as a Product Manager) serves as the voice of the customer, staying deeply connected to what the developers need and their experience in using the solution provided by the platform, while constantly assessing the measurable value being provided.
Internal stakeholders have legitimate concerns. But these outcomes can be achieved without the stakeholders controlling the implementation.
A Stakeholder Map visualizes a stakeholder’s influence over the backlog against their alignment to platform objectives. High-influence stakeholders must be highly aligned for a platform to succeed.
Look at all the activities, from ideation to operations, and assess engineering quality and impacts from organizational or architectural decisions. Yrjö Engeström’s human activity model inspires this Developer Activity model[2].
shows the Progression of an MVP delivery process. A fully functional product does not need to be delivered at the start, but all phases should deliver some incremental value to the customer.
shows the evolutionary path of platform development teams. Often, while an EP starts as an effort driven by a single team, multiple teams spin up over time to handle specific aspects of platform functions, with all development efforts under a top-level product owner.
The software delivery lifecycle (“SDLC”) covers all stages from feature planning through running in production.
With every change you make to the platform, think about the kind of data you will need in order to understand how the new technology, service, or feature is behaving. Alerting from service failure is not enough. When we deploy automatic configuration behavior or control guardrails, even healthy behavior could be interpreted as failure.
New features or changes should go through a Value flow. There will be routine operations and refactoring going on simultaneously, and using this flow will help us stay clear on how our current work fits into the bigger picture.
When we have a DevOps or traditional operations team, developers will be responsible for submitting requests for infrastructure or changes they need. The requests go into the queue.
Ideas, improvements, and sometimes requirements will come from a variety of sources. These four are the most important. And everything needs to successfully make it through the top row of our Value Flow in Figure 2.9, except for production outages…of course.
When a particular service within the API needs to interact with the API database, it calls a Repository function that provides all the normal database methods. The repository function would call the datastore functions for the type of database the API needs to use (relational or unstructured in our case).
Primary domains in an engineering platform. (From chapter 1)
Start by defining the primary boundary experience, which is our platform customers. Then, figure out where the underlying changes or capabilities lay, and when more than one team is involved, does an effective boundary experience exist between them?
The components of a Kubernetes Cluster. Courtesy: https://kubernetes.io/docs/concepts/overview/components/
shows an example of a pod sidecar. This container runs a process alongside the application container to enhance functionality. In this example, a sidecar continuously loads a new secret value from a vault so that the application will always get the latest updates when secrets are requested.

Summary

  • Successful delivery of a self-serve engineering platform happens through a Product Delivery Model.
  • Stakeholders have significantly different success criteria than do customers.
  • Shape the experience of using an engineering platform for customers of the platform.
  • Stakeholder needs can be met without destroying the customer experience.
  • Don’t underestimate the importance of tracking stakeholder impact on platform delivery. It can mean the difference between success and failure for the platform to provide a return on investment.
  • Optimize the product team's engineering practices and the organizational support for delivering a product.
  • Optimize the engineering process from ideation to platform operation, not around individual teams, technologies, or departments.
  • Features and capabilities should be delivered as service interfaces first.
  • Make final architecture and technologies decision only after realistic experimentation that measures the expected value.
  • Test experiences and implementation with alpha-users early
  • Find ways to introduce the fundamental aspects of new features before more complex implementations, using the feedback to influence next steps.
  • Try to start with a single engineering platform delivery team, with the initial squad setting the architectural foundations.
  • Grow the number of delivery teams within the platform product based on the most needed capabilities.
  • Adjacent teams within the organization can provide capabilities to the engineering platform only if they can also provide their service as-a-product and support the product roadmap of the platform.
  • The idea of a software-defined platform introduces more rigor into the SDLC process, using software engineering principles to increase the product's stability, scalability, extensibility, and maintainability.
  • Observability is an integral part of each story in the product delivery backlog, not something added on afterward.
  • Both the healthy and unhealthy behavior of each feature, capability, or technology of the platform should be easily observable.
  • Plan for evolutionary change, even in the way you manage the platform product backlog
  • Team engineering standards, such as test-driven development, include architecture fitness testing to support easier evolution.

FAQ

What is an Engineering Platform, and why is a self-serve experience essential?An Engineering Platform is an internal product that enables development teams to create, test, deploy, and operate software continuously while meeting quality, security, and compliance. Self-serve is essential because it eliminates ticket-driven delays and lets teams get what they need on demand through APIs, making developer experience the core measure of success.
How does a Product Delivery Model differ from traditional IT project approaches?Unlike one-off, tool-by-tool projects planned as capital purchases, a product delivery model treats capabilities as long-lived, continuously evolving products measured by user experience. Work ships in small increments, is optimized for change, and is prioritized by real usage and value—never by big-bang, Gantt-style rollouts.
What is Technical Product Ownership (TPO), and why is it critical for platform success?The TPO is the empowered, accountable product leader who represents developers, prioritizes the backlog, tracks measurable value, and adapts the roadmap based on actual usage. Without true product ownership, platforms drift into low-value, compliance-driven tools that miss developer needs.
How should we handle customers vs. stakeholders in platform engineering?Customers are the people who use the platform (primarily developers); stakeholders influence outcomes (e.g., security, finance, legal) but shouldn’t dictate implementations. Ask stakeholders to provide either self-serve APIs to their capabilities or clear, testable outcome standards. Use a stakeholder map (influence vs. alignment) to focus alignment work where it matters most.
What does “service interface first” mean, and why does it matter?Every capability is exposed as an API first; CLIs and UIs are layered on top. This enables automation, multi-tenant usability, and long-term flexibility. Cloud providers excel at API-first services, and when a stakeholder can’t offer an API, outcome-based standards let platform teams build one without breaking the product experience.
How should we start delivering the platform—what does MVP and early adopters look like?Ship the smallest valuable capability early, treat it as a hypothesis, and observe real usage to guide the next increment. Begin with a single, unified team and TPO to prove value, then scale to multiple teams under one product vision. Avoid “do-everything” first releases; most value comes from a few core paths.
What does it mean for a platform to be software-defined, and what SDLC practices are expected?“Software-defined” means everything—infra, policies, pipelines, configs, observability—is versioned and deployed via automation. A healthy SDLC includes: plan (with measurable outcomes), design (experiment first), code (tests first), build→test (pipeline through consistent commits), release (preview→non-prod→prod), observe, operate, repeat. Dogfood the same tools you provide to users.
What is Observability-Driven Development (ODD), and how does it complement TDD?ODD adds observability to acceptance criteria: define and automate metrics, logs, traces, events, and alerts that show both failures and healthy behavior. Instrument change events, make monitors part of “definition of done,” and use this data to diagnose faster and prove value—extending TDD from correctness to operability.
How do evolutionary architecture and fitness functions reduce migration and change risk?Make technology choices only after real-world experiments and design for changeability (e.g., abstract datastore access to swap databases or clouds). Fitness functions are automated tests that assert architectural rules (layer boundaries, required tests/monitors, cloud-agnostic interfaces), continuously guarding changeability and readiness.
Why do cloud-native standards like Kubernetes matter, and when should we build vs. buy?Kubernetes provides standardized APIs and extensibility (admission controllers, sidecars, meshes) to enforce governance with low friction—capabilities that serverless platforms often can’t extend. Prefer managed/SaaS for non-differentiating tech to minimize total cost of ownership and opportunity cost, and focus engineering time on strategic product value.

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime

lite $19.99 per month

  • access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more


choose your plan

team

monthly
annual
$49.99
$499.99
only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Effective Platform Engineering ebook for free
choose your plan

team

monthly
annual
$49.99
$499.99
only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Effective Platform Engineering ebook for free
choose your plan

team

monthly
annual
$49.99
$499.99
only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Effective Platform Engineering ebook for free