Build AI into Your Web Apps you own this product

Classic methods with modern AI tools

Theo Despoudis

MEAP began December 2024
Last updated December 2025
Publication in March 2026 (estimated)

ISBN 9781633436084
375 pages (estimated)

Included with a Manning Online subscription

printed in black & white

catalog / Data Science / AI

resources: Source code Book forum Source code on GitHub

table of content

Part 1: Building basic generative AI web apps

1 Using generative AI in web apps

1.1 What generative AI can do for web applications

1.1.1 Gen AI capabilities

1.1.2 Real-world uses of generative AI

1.2 How a generative AI web app works

1.2.1 Core components

1.2.2 The flow of user interactions

1.3 AI tools and ecosystem

1.3.1 Key technologies creating AI-powered apps

1.3.2 Choosing the right model

1.4 Generative vs traditional AI

1.5 Handling the concerns and implications of generative AI

1.5.1 What are the limitations of generative AI?

1.5.2 Will developers lose jobs because of AI?

1.5.3 Are generative AI outputs reliable?

1.6 References

1.7 Summary

2 Building your first generative AI web application

2.1 Project goal and requirements

2.1.1 Goal: Building a simple interactive AI chat interface

2.1.2 Project requirements and needed technologies

2.1.3 Setting up

2.1.4 Running the project

2.2 Under the hood: The generative AI lifecycle

2.2.1 Designing for a better user experience (UX)

2.3 Building the major components

2.3.1 Frontend

2.3.2 Autoscroll

2.3.3 ChatPage

2.3.4 ChatList

2.3.5 The backend: handling API communication

2.3.6 Tests

2.3.7 Common challenges and solutions

2.4 Assessing the app’s first iteration

2.4.1 Memory and context

2.4.2 UI performance

2.4.3 Security

2.4.4 Improved tooling

2.5 Migrating the app to Next.js

2.5.1 Requirements, set up, and running the project with Next.js

2.6 Routing and configuration on Next.js

2.6.1 Environment variables in next.js

2.6.2 Route groups

2.6.3 Layout components

2.6.4 Route API handlers

2.6.5 Going deeper with Next.js

2.7 Summary

3 Connecting AI models with the Vercel AI SDK

3.1 Introduction to the Vercel AI SDK

3.1.1 Key features and benefits

3.1.2 A strategic approach to integration

3.1.3 Practical Integration: Vercel AI SDK with Astra AI

3.2 Handling streaming responses with the Vercel AI SDK

3.2.1 Challenges and how the SDK solves streaming in web applications

3.2.2 Implementing streaming with Vercel AI SDK

3.2.3 Integrating streaming into Astra AI

3.3 Working with multiple AI providers

3.3.1 Handling different AI providers and models

3.3.2 Leveraging the Vercel AI SDK’s interoperability

3.3.3 Astra AI project: Integrating multiple AI providers and models

3.4 Enhancing conversational UIs with multimedia content

3.4.1 Introduction to OpenAI’s vision capabilities

3.4.2 Astra AI project: Integrating Gemini vision queries

3.5 Summary

4 Managing conversation and state in your application

4.1 AI SDK React Server Components

4.1.1 Overview of React Server Components

4.1.2 Using server actions for AI-powered RSC

4.1.3 Updating the UI to leverage server actions

4.1.4 Techniques for generating and streaming UI components

4.1.5 Create streamable UI components from LLM providers with streamUI

4.1.6 Streaming React components with createStreamableUI

4.2 Managing UI state in AI-powered applications

4.2.1 Separating AI and UI state in React/Next.js applications

4.2.2 Key components for UI state management

4.2.3 Implementing UI state management patterns

4.3 Structured data generation using Vercel AI SDK

4.3.1 How structured data generation works

4.3.2 Techniques for generating structured data from AI responses

4.3.3 Tools for implementing type-safe AI-generated content

4.3.4 Integrating structured data generation into our web application

4.4 Tool and function calling with AI models

4.4.1 Understanding tool calling and function calling in AI models

4.4.2 Implementing custom tools and functions with Vercel AI SDK

4.5 Summary

Part 2: Advanced generative AI techniques for web apps

5 Prompt engineering in web applications

5.1 Introduction to prompt engineering

5.1.1 What exactly are prompts?

5.1.2 Prompt types

5.1.3 Organizing your prompts: versioning, testing, and optimization

5.2 Few-shot learning

5.2.1 Examples of few-shot learning

5.2.2 General methodology for creating few-shot learning prompts

5.3 Chain of thought prompting: A deeper dive into reasoning

5.3.1 Example of chain-of-thought prompting

5.3.2 General methodology for creating chain-of-thought prompts

5.4 Embeddings: giving AI a sense of meaning

5.4.1 The restaurant menu analogy: A taste of embeddings

5.4.2 Using embeddings in practice: Vercel AI SDK

5.4.3 Use case: IT Support Knowledge Base

5.5 Going deeper into LLM techniques

5.5.1 Tree of Thoughts (ToT)

5.5.2 Self-Refine

5.5.3 LLM-as-a-Judge

5.6 Summary

6 Building AI workflows with Langchain.js

6.1 Introduction to LangChain

6.1.1 Chaining calls with LangChain

6.1.2 Integration with Vercel AI SDK

6.2 Preparing and storing documents for retrieval using LangChain

6.2.1 Document ingestion using text splitters

6.2.2 Introduction to vector stores

6.2.3 Document retrieval

6.2.4 Full example of preparing and storing documents with LangChain

6.3 Leveraging memory components in LangChain to remember conversation history

6.4 Utilizing agents in LangChain.js

6.4.1 How LangChain agents work

6.4.2 Creating an agent using LangChain.js

6.4.3 Agent integration with Vercel AI SDK

6.4.4 Overview of LangChain.js modules

6.5 Going Deeper with LangChain.js

6.5.1 LangChain Expression Language (LCEL)

6.5.2 LangGraph

6.6 Summary

7 Document summarization and RAG with Langchain.js

7.1 Building a document summarization web application with LangChain.js

7.1.1 Summarization app project requirements

7.1.2 Architecture and workflow

7.1.3 Building the document summarization web application

7.1.4 Caveats and limitations of document summarization

7.1.5 Demonstration of the app

7.1.6 Additional considerations for summarizing documents

7.2 Building a RAG web application with LangChain.js

7.2.1 RAG app project requirements

7.2.2 Key architectural components of RAG

7.2.3 Technical architecture overview

7.2.4 RAG system components

7.2.5 Web app demonstration

7.2.6 Adding grounding support

7.3 Summary

8 Testing and debugging techniques

8.1 Debugging Next.js AI applications

8.1.1 Debugging common Next.js rendering Issues

8.1.2 Debugging client-server problems

8.1.3 Handling state management

8.1.4 Performance monitoring

8.2 Vercel AI SDK troubleshooting

8.2.1 Handling error states in AI-generated Content

8.2.2 Managing token limits and rate limiting

8.3 Troubleshooting LangChain.js

8.3.1 Chain execution errors

8.3.2 Troubleshooting model integration issues

8.4 Testing strategies for AI applications

8.4.1 Unit and integration testing in React + Next.js

8.4.2 Mocking LLM responses

8.4.3 Testing Vercel AI SDK responses

8.4.4 Testing Langchain.js

8.5 Summary

9 Deployment and security

9.1 Building a secure foundation with input validation, rate limits, and middleware

9.1.1 Input validation

9.1.2 Security middleware layer

9.2 Building a core security & data protection pipeline

9.3 Setting up authentication and authorization

9.3.1 Simple authentication with Clerk.js and Next.js

9.3.2 Practical security control: Rate limiting

9.4 API key and secrets management

9.4.1 Understanding Next.js environment variables

9.5 Data protection and compliance

9.6 Deployment considerations for AI web applications

9.6.1 Deployment options

9.6.2 Production deployment checklist

9.6.3 Example deployment to Vercel

9.6.4 Alternative deployments: Netlify

9.6.5 Alternative deployments: Hugging Face Spaces

9.6.6 Next steps

9.7 Summary

Part 3: Hands-on projects

10 Build an AI Interview assistant: Project walkthrough

10.1 Overview of the application

10.1.1 Key features

10.1.2 Technical implementation

10.1.3 Technology stack overview

10.2 Security measures implemented

10.3 Challenges during development

10.3.1 State management considerations

10.3.2 Text-to-Speech integration

10.3.3 Generating feedback

10.4 Additional considerations and improvements

10.5 Summary

11 Build an AI RAG Agent: Project walkthrough

11.1 Overview of the application

11.1.1 Key features

11.1.2 Technical implementation

11.1.3 Technology stack overview

11.2 Challenges during development

11.2.1 Shared vs. dedicated user data in vector stores

11.2.2 Security considerations around document management and heavy workloads

11.2.3 API Design and URL structure to minimize information exposure

11.3 Additional thoughts on AI and the future of web development

11.4 Summary

Part 4: Advanced integrations and the future of AI

== 12 Integrating web apps with the Model Context Protocol

12.1 Why the model context protocol matters for AI integration

12.2 MCP architecture

12.3 Connecting Next.js and the Vercel AI SDK with MCP

12.3.1 Architecture overview

12.3.2 Building an end-to-end integration with MCP in Next.js

12.3.3 Benefits of using MCP for web applications with LLMs

12.4 Inside an MCP Server: Extending web applications

12.4.1 MCP server structure

12.4.2 Additional considerations for MCP servers

12.5 Integrating MCP Servers with Langchain.js

12.5.1 Architecture overview

12.5.2 Building an end-to-end integration with LangChain.js

12.6 The future of MCP: Gateways, directories, and MCP-as-a-service

12.6.1 MCP Gateways

12.6.2 MCP-as-a-service

12.6.3 MCP directories and registries

12.7 Your next steps with MCP servers

12.8 Summary

Appendix

Appendix A: Running the examples

A.1 Running Examples

A.2 Accessing OpenAI APIs

A.3 Accessing Google AI APIs

A.4 Accessing Upstash Redis Database

A.5 Integrate Clerk.js authentication

Overview

1 Using generative AI in web apps

Generative AI web apps weave advanced models—especially large language models—into the browser experience to produce text, images, audio, and video on demand. This unlocks conversational interfaces, adaptive workflows, and personalized content that go beyond static logic. The chapter frames the book’s goal: help developers with basic JavaScript and React skills build production-grade AI features, integrating leading providers and patterns while staying focused on practical, real-world outcomes.

It explains how these apps operate end to end: users interact through UI components; backends clean and route inputs; the system selects models; content is generated and returned, often with a feedback loop. Core pieces include model access, conversational UIs, resilient backend infrastructure (caching, serverless, containers, and model serving), data pipelines, API integrations, and deployment for scale. The stack centers on React and Next.js with the Vercel AI SDK, plus LangChain.js for RAG and agentic patterns, and popular models like Gemini and OpenAI. The chapter also guides model selection—covering types such as transformers, GANs, and autoregressive models—alongside trade-offs between pre-trained services and self-hosting, and the performance considerations that shape UX.

Capabilities span text, image, multimedia, and code generation, with concrete use cases like marketing content toolboxes, customer support chatbots with dynamic responses, and mock interview agents with speech interfaces and adaptive coaching. The chapter balances enthusiasm with responsibility: it highlights quality control, security, cost, and regulatory compliance; techniques to validate outputs and reduce hallucinations; bias assessment and mitigation; and UX practices such as streaming and multimodal inputs to keep interactions fast and clear. It closes by noting the workforce impact of automation while emphasizing that developers who embrace these tools can focus on higher-level, creative work—and build reliable, user-centered AI applications.

The flow of information and interactions between the key components of a generative AI web application.

How an AI web app works: users input data, the app processes it, selects a model, generates content, delivers it, and optionally collects feedback.

Simplified architecture diagram of a web application ecosystem. Clients, including web browsers and mobile devices, interact with the core application service, which handles user requests and business logic. The service interacts with a database to store and manage application data. Additionally, the service communicates with external APIs to access additional functionality and interacts with external services utilized by the application.

Leveraging key technologies to create generative AI web applications

How AI can be used to detect whether a picture of a cat is a cat or not. It accepts an image as input and responds with yes or no (or 0 and 1).

Summary

Generative AI can generate not only text, but all sorts of media resources like images, video clips and audio. This greatly enhances their potential usage in web applications, and real-world uses of generative AI in web applications range from digital marketing and customer experience management to mock interview applications.
Generative AI web apps center on powerful models like large language models (LLMs) to create content from user input. The apps require a full supporting ecosystem to integrate with the model, including UI and conversational AI components, backend infrastructure, data processing pipelines, API integration, and deployment and scaling mechanisms.
The apps we build in this book will use JavaScript and React to display the UI interface components, along with Next.js and the Vercel AI SDK to manage the backend and interact with external AI service providers.
Choosing the right model for an app is a key architectural decision and depends on the task required. Different model types ( such as LLMs, GANs, autoregressive, transformers, VAE, and RNNs) excel at different kinds of problems. But the model architecture is just one consideration; developers also need to consider the quality and type of data it was trained on.
Software engineers have been using AI long before generative AI came into existence. Common applications include machine learning, search recommendations, chatbots and computer vision.
Foundational research like Google's "Attention is All You Need" laid the groundwork for transformative technologies such as transformers, which simplified natural language processing tasks by leveraging attention mechanisms. Transformers revolutionized language modeling by improving efficiency and accuracy in understanding textual data, addressing long-standing challenges faced by traditional AI models.
Limitations of generative AI include quality control issues, resource intensiveness, security concerns, and regulatory compliance. Concerns include its potential impact on jobs, the reliability of outputs, handling bias, and enhancing the user experience.

FAQ

What are generative AI web applications and what can they do?

They are web apps that integrate advanced AI models—most commonly large language models (LLMs)—to generate text, images, audio, or video on the fly. This enables conversational interfaces, intelligent automation, personalization, and entirely new interactive experiences beyond predefined logic or static content.

How do generative AI apps compare to traditional AI systems?

Traditional AI often classifies or recognizes patterns (e.g., “cat vs. not-cat”). Generative AI goes further by learning underlying patterns well enough to produce new content that resembles its training data. Modern transformer-based models with self-attention make this possible by capturing long-range context to generate coherent, contextual outputs.

How does a generative AI web app work from user input to response?

Typical flow: 1) User input via the UI (text, voice, images, selections). 2) Backend processing (cleaning, feature extraction, model selection). 3) Content generation by a selected model, possibly enhanced by techniques like RAG. 4) Response delivery to the UI, with an optional feedback loop to refine results and improve future performance.

What are the core components of a generative AI web app?

Key pieces include: LLMs and other AI models; UIs and conversational components (chatbots/agents); backend infrastructure (caching, containers/orchestration, serverless, model serving); data processing (pre/post-processing and formatting); external API integrations; and deployment/scaling for reliability and performance.

Which tools and frameworks does the book use?

The book builds apps with React and Next.js, integrates AI via the Vercel AI SDK, and uses LangChain.js for chains, agents, and RAG. It primarily leverages Google Gemini models, with occasional OpenAI usage, and introduces the Model Context Protocol (MCP) for secure, standardized access to external tools and data sources.

How should I choose the right model and provider?

Decide by: - Model type (LLMs, transformers, GANs, VAEs, etc.) aligned to your task. - Pre-trained APIs vs. self-hosted models (ease of use vs. control/customization and cost). - Performance needs (latency, resource usage, cost). Also consider your UI/input style, pre-processing, and any post-processing required for user-ready results.

What real-world use cases does the chapter highlight?

Examples include: - Digital marketing tools for text-to-image, image-to-image (e.g., DALL‑E, CycleGAN/Pix2Pix). - Customer experience management with generative chatbots and sentiment-aware responses. - A mock interview app with AI interviewer agents, speech-to-text, adaptive difficulty, and personalized feedback.

What are the main limitations and risks to plan for?

Key concerns: - Quality control (accuracy, coherence, relevance; hallucinations). - Resource intensiveness (training/serving costs, GPU needs). - Security and misuse (fake content, impersonation, misinformation). - Regulatory compliance (privacy, IP, PII; GDPR/CCPA obligations for consent, storage, deletion).

Are AI outputs reliable? How do we validate and reduce bias?

Outputs are probabilistic and can be inaccurate or biased. Mitigations include: setting clear objectives and context, tuning model parameters, cross-validating results, constraining knowledge sources (e.g., RAG), and running bias audits with diverse test inputs. The book takes a hands-on approach to validation and bias mitigation.

Will developers lose jobs because of AI—and how should they adapt?

Some tasks (code generation, docs, tests, reviews) will be automated, shifting the role rather than eliminating it. Developers who embrace tools like LLMs, focus on higher-value design and problem-solving, and integrate AI responsibly can stay relevant and amplify their impact.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$47.99 $35.99

you save $12.00 (25%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $35.99

you save $12.00 (25%)

eBook

pdf, ePub, online

$47.99 $35.99

you save $12.00 (25%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more