Elliot Arledge

Elliot Arledge created the 12-hour CUDA course and the 6-hour LLM from Scratch course for FreeCodeCamp, and consults on deep learning performance.

books by Elliot Arledge

CUDA for Deep Learning

  • MEAP began January 2026
  • Last updated May 2026
  • Publication in Fall 2026 (estimated)
  • ISBN 9781633434899
  • 375 pages (estimated)
  • printed in black & white

CUDA (Compute Unified Device Architecture) provides a powerful parallel programming model AI engineers can use to tap the massive processing power of NVIDIA GPUs. CUDA delivers direct control, debugging power, and acceleration at the GPU level that can’t be matched by other types of optimizations.

CUDA for Deep Learning shows you how to work within the CUDA ecosystem, from your first kernel to implementing advanced LLM features like Flash Attention. You’ll learn to profile with Nsight Compute, identify bottlenecks, and understand why each optimization works. By solving problems at multiple levels of abstraction, you’ll develop a deep understanding of CUDA, along with a practical mastery of kernel-building skills. Written for the latest NVIDIA hardware, the book builds a deep understanding of CUDA fundamentals that will stay relevant as chips upgrade and evolve.