MODULE 00
Getting Started
Set up your local environment on any OS to run all code in this course.MODULE 01
Data Infrastructure & Engineering
From raw business events to analytical systems: warehouses, lakehouses, pipelines, streaming, orchestration, quality, and governance.MODULE 02
ML/AI Data Engineering
Feature stores, training data at scale, dataset versioning, LLM data pipelines, vector databases, and production feedback loops.MODULE 03
Linear Algebra & Matrix Analysis
Vector spaces, spectral theory, matrix decompositions, and the geometric machinery behind every ML model — developed from first principles.MODULE 04
Multivariate Calculus & Differential Geometry
Gradients, Hessians, manifolds, and Riemannian geometry — the geometric foundation of learning.MODULE 05
Convex Analysis & Optimization Theory
Duality, KKT conditions, proximal methods, and the geometry of non-convex loss landscapes.MODULE 06
Probability Theory
Measure-theoretic probability, distributions, convergence modes, limit theorems, and concentration inequalities.MODULE 07
Statistical Inference & Learning Theory
MLE, Bayesian inference, PAC learning, VC dimension, and the theory behind why models generalize.MODULE 08
Information Theory
Entropy, mutual information, KL divergence, channel capacity, and the deep connection between compression and intelligence.MODULE 09
Stochastic Processes
Markov chains, martingales, Brownian motion, Itô calculus, and stochastic differential equations.MODULE 10
Numerical Methods & Scientific Computing
Floating point, direct and iterative solvers, automatic differentiation, and numerical stability in practice.MODULE 11
Operations Research
Linear programming, integer programming, network flows, combinatorial optimization, and dynamic programming.MODULE 12
Game Theory & Mechanism Design
Nash equilibria, minimax, mechanism design, and the multi-agent foundations of GANs and RLHF.MODULE 13
Functional Analysis & Operator Theory
Hilbert and Banach spaces, bounded operators, RKHS, and the operator-theoretic view of attention and kernels.MODULE 14
Graph Theory & Combinatorics
Spectral graph theory, random graphs, the probabilistic method, and the combinatorial foundations of GNNs.MODULE 15
Classical ML Foundations
Linear models, trees, and the core concepts every ML engineer must know.MODULE 16
NLP Essentials
Classical NLP from tokenization through n-gram language models, text classification, and sequence labeling — the pre-neural foundations every practitioner needs.MODULE 17
Deep Learning
Neural networks from backprop to Transformers.MODULE 18
Alignment & Safety
RLHF, Constitutional AI, red-teaming, and the alignment tax.MODULE 19
LLMs in Production
Claude API, RAG, agents, evals, and deployment at scale.MODULE 20
Agent Engineering
ReAct, tool use, multi-agent systems, planning, and production reliability for LLM agents.MODULE 21
Vision & Multimodal
From pixels to perception to generation: ViT, CLIP, VLMs, diffusion models, and video architectures.MODULE 22
Specialized Topics
Recommender systems, graph neural networks, time series, and reinforcement learning — specialized architectures powering production systems.MODULE 23
Experimentation
The statistics and design of A/B tests: hypothesis testing, variance reduction, and validity threats.MODULE 24
Causal Inference
When experiments fail: potential outcomes, DAGs, DiD, RDD, IV, and modern observational methods.MODULE 25
Advanced Experiment Design
Geo holdouts, switchback designs, network experiments, always-valid inference, and long-run measurement — for when standard A/B testing breaks down.MODULE 26
Modern Causal Methods
Synthetic control, staggered DiD, Double ML, causal forests, and sensitivity analysis — the state-of-the-art toolkit for rigorous causal estimation at scale.