Neural-Path
Math · ML · AI

The foundations of ML, documented while learning.

From linear algebra and probability theory to production AI — the math that actually underlies the models, the code that ships, and the gaps that textbooks skip. Free to read.

0Notes
0Modules
FreeAlways

164 notes. 26 modules. 5 parts.

From linear algebra and probability theory through production AI — organized as a proper curriculum.

Data Engineering

20 notes
MODULE 01

Data Infrastructure & Engineering

12 notes

  • Data Systems Landscape
  • The Modern Data Warehouse
  • Data Lakes & Lakehouses
  • Data Ingestion & CDC
  • Batch Pipelines & ELT
  • Data Modeling
  • Stream Processing & Real-Time Data
  • Pipeline Orchestration
  • Data Quality & Testing
  • Lineage, Governance & Contracts
  • Data Observability
  • Data Mesh & Platform Thinking
MODULE 02

ML/AI Data Engineering

8 notes

  • Feature Stores
  • Training Data at Scale
  • Dataset Versioning & Reproducibility
  • ML Metadata & Lineage
  • LLM Dataset Construction
  • Vector Databases
  • ML Data Pipelines
  • Data Flywheels & Feedback Loops

Mathematics

68 notes
MODULE 03

Linear Algebra & Matrix Analysis

8 notes

  • Vector Spaces & Linear Maps
  • Inner Products, Norms & Orthogonality
  • Eigenvalues, Eigenvectors & Diagonalization
  • The Spectral Theorem & Symmetric Matrices
  • SVD, QR & LU Decompositions
  • Positive Semidefinite Matrices & Quadratic Forms
  • Linear Systems & Least Squares
  • Matrix Calculus & Differentiation
MODULE 04

Multivariate Calculus & Differential Geometry

6 notes

  • Topology Primer: Metric Spaces, Continuity & Compactness
  • Differentiation in Rⁿ: Jacobians, Hessians & the Chain Rule
  • Integration in Rⁿ: Fubini, Change of Variables & Surface Integrals
  • Smooth Manifolds & Tangent Spaces
  • Riemannian Geometry: Metrics, Geodesics & Curvature
  • Bridge: Loss Landscapes, Natural Gradient & Equivariant Networks
MODULE 05

Convex Analysis & Optimization Theory

6 notes

  • Convex Sets & Functions: Definitions, Examples & Closure Properties
  • Duality: Lagrangians, KKT Conditions & Strong Duality
  • Gradient Methods: Convergence Rates & Information-Theoretic Lower Bounds
  • Proximal Methods, ADMM & Operator Splitting
  • Non-Convex Landscapes: Saddle Points, Spurious Minima & Escape
  • Bridge: Adam, Learning Rate Theory & Neural Loss Landscape Analysis
MODULE 06

Probability Theory

6 notes

  • Measure Theory Primer: σ-Algebras, Measures & Lebesgue Integration
  • Probability Spaces, Random Variables & Distributions
  • Expectation, Moments, Characteristic Functions & Generating Functions
  • Modes of Convergence: Almost Sure, In Probability, Lp & In Distribution
  • Limit Theorems: LLN, CLT & Berry-Esseen
  • Bridge: Concentration Inequalities (Hoeffding, Bernstein, Sub-Gaussian) & Generalization Bounds
MODULE 07

Statistical Inference & Learning Theory

6 notes

  • Estimation Theory: MLE, Sufficiency, Fisher Information & Cramér-Rao
  • Hypothesis Testing: Neyman-Pearson, Likelihood Ratio Tests & Multiple Testing
  • Bayesian Inference: Priors, Posteriors & Conjugacy
  • PAC Learning, VC Dimension & Rademacher Complexity
  • High-Dimensional Statistics: Sparsity, RIP & Compressed Sensing
  • Bridge: Double Descent, Implicit Regularization & Modern Generalization Theory
MODULE 08

Information Theory

5 notes

  • Entropy, Mutual Information & the Information Hierarchy
  • KL Divergence, f-Divergences & Total Variation Distance
  • Data Processing Inequality & Sufficient Statistics
  • Channel Capacity & Shannon's Coding Theorems
  • Bridge: ELBO & VAEs, Contrastive Learning & Rate-Distortion as Compression
MODULE 09

Stochastic Processes

6 notes

  • Discrete-Time Markov Chains: Stationarity, Ergodicity & Mixing Times
  • Continuous-Time Markov Chains & Poisson Processes
  • Martingales: Optional Stopping & Doob's Inequalities
  • Brownian Motion & Gaussian Processes
  • Itô Calculus & Stochastic Differential Equations
  • Bridge: MCMC, Langevin Dynamics & Diffusion Models as SDEs
MODULE 10

Numerical Methods & Scientific Computing

5 notes

  • Floating Point Arithmetic, Numerical Stability & Condition Numbers
  • Direct Linear Solvers: LU, Cholesky & QR Factorizations
  • Iterative Solvers: Conjugate Gradient & Krylov Methods
  • Automatic Differentiation: Forward Mode, Reverse Mode & Computation Graphs
  • Bridge: Autodiff in PyTorch/JAX, Mixed Precision & Numerical Stability in Training
MODULE 11

Operations Research

5 notes

  • Linear Programming: Simplex Method, Geometry & LP Duality
  • Integer Programming: Branch & Bound & Cutting Planes
  • Network Flows: Max-Flow, Min-Cost & Transportation Problems
  • Combinatorial Optimization: Matching, Approximation Algorithms & Complexity
  • Bridge: Dynamic Programming, Bellman Equations & Neural Architecture Search
MODULE 12

Game Theory & Mechanism Design

5 notes

  • Normal Form Games, Nash Equilibria & Mixed Strategies
  • Extensive Form Games, Backward Induction & Subgame Perfection
  • Zero-Sum Games & the Minimax Theorem
  • Mechanism Design: Revelation Principle, VCG & Auction Theory
  • Bridge: GANs as Zero-Sum Games, Multi-Agent RL & RLHF as Mechanism Design
MODULE 13

Functional Analysis & Operator Theory

5 notes

  • Normed & Banach Spaces: Completeness, Compactness & the Hahn-Banach Theorem
  • Hilbert Spaces: Inner Products, Orthonormal Bases & Riesz Representation
  • Bounded Linear Operators: Adjoints, Spectrum & Compact Operators
  • Reproducing Kernel Hilbert Spaces: Mercer's Theorem & Feature Maps
  • Bridge: Kernel Methods, Attention as Operators & Neural Operators (FNO)
MODULE 14

Graph Theory & Combinatorics

5 notes

  • Graph Fundamentals: Connectivity, Trees, Planarity & Colorings
  • Spectral Graph Theory: Laplacians, Eigenvalues & the Cheeger Inequality
  • Random Graphs: Erdős–Rényi, Phase Transitions & Small-World Networks
  • Generating Functions, Combinatorial Enumeration & the Probabilistic Method
  • Bridge: Spectral Clustering, GNN Expressivity & the Weisfeiler-Leman Hierarchy

Machine Learning

22 notes
MODULE 15

Classical ML Foundations

7 notes

  • Linear & Logistic Regression
  • Decision Trees & Ensembles
  • Feature Engineering
  • Train / Validation / Test Splits
  • Bias-Variance Tradeoff
  • Gradient Descent
  • Gradient Boosting & Tabular ML
MODULE 16

NLP Essentials

5 notes

  • Tokenization & BPE
  • Text Preprocessing
  • N-gram Language Models
  • Classical Text Classification
  • Sequence Labeling
MODULE 17

Deep Learning

10 notes

  • Neural Networks & Backprop
  • Regularization Techniques
  • Training Dynamics
  • Convolutional Neural Networks
  • RNNs & LSTMs
  • Embeddings
  • Attention & Transformers
  • BERT & Encoder Models
  • Seq2Seq & T5
  • Mixture of Experts

Modern AI

36 notes
MODULE 18

Alignment & Safety

7 notes

  • SFT: Supervised Fine-Tuning
  • RLHF & PPO
  • DPO: Direct Preference Optimization
  • GRPO: Group Relative Policy Optimization
  • Constitutional AI
  • Red-Teaming & Evaluation
  • The Alignment Tax
MODULE 19REQUIRES ANTHROPIC_API_KEY

LLMs in Production

11 notes

  • Claude API & SDK
  • Prompt Engineering
  • Structured Outputs
  • RAG Systems
  • Advanced RAG
  • Agents & Tool Use
  • Fine-Tuning in Practice
  • Evals Framework
  • LLM-as-Judge & Regression Evals
  • Latency & Cost Optimization
  • Deployment & Serving
MODULE 20

Agent Engineering

6 notes

  • Agent Patterns
  • Tool Use & MCP
  • Multi-Agent Systems
  • Planning & Reasoning
  • Agent Evals & Reliability
  • Agents in Production
MODULE 21

Vision & Multimodal

8 notes

  • Vision Transformers (ViT)
  • CLIP & Contrastive Learning
  • Vision-Language Models
  • Object Detection & Segmentation
  • Diffusion Models
  • Latent Diffusion & Guided Generation
  • Video Understanding
  • Video Generation
MODULE 22

Specialized Topics

4 notes

  • Recommender Systems
  • Graph Neural Networks
  • Time Series & Forecasting
  • Reinforcement Learning Fundamentals

Measurement

18 notes
MODULE 23

Experimentation

4 notes

  • Hypothesis Testing
  • Experimental Design
  • Validity Threats
  • Adaptive Experiments & Bandits
MODULE 24

Causal Inference

4 notes

  • Potential Outcomes & DAGs
  • Quasi-Experimental Methods
  • Propensity Scores & Modern Methods
  • Heterogeneous Treatment Effects
MODULE 25

Advanced Experiment Design

5 notes

  • Geo Testing & Market Holdouts
  • Switchback Experiments
  • Network Experiments & Interference
  • Always-Valid Sequential Testing
  • Long-run Measurement & Holdout Groups
MODULE 26

Modern Causal Methods

5 notes

  • Synthetic Control
  • Staggered DiD & Modern Estimators
  • Double ML & Debiased Estimation
  • Causal Forests & CATE Estimation
  • Sensitivity Analysis & Robustness
About

Written while learning. Shared openly.

Rachel Z.

Rachel Z.

ML Practitioner

These notes started as margin annotations working through ML — the proofs I had to re-derive, the intuitions that finally clicked, the production patterns nobody writes down. They grew into something more systematic: 5 parts covering the math, the models, and the production layer, shared in case they're useful to anyone else.

"I write these for the version of me that hit a wall trying to understand why the math actually matters."

164

Notes

26

Modules

Free

Always

Rachel Z.

Rachel Z.

ML Practitioner

model.fit(X_train, y_train)
accuracy = model.score(X_test)