study guide
How to Work Through the Book
Do not read this course as trivia about AI. Read it as a sequence of
mental models, mathematical objects, and experiments. The goal is not to
remember every term. The goal is to see a new machine-learning system and
know what questions to ask.
Three Reading Tracks
First-year orientation
For readers who want the big picture before the math gets dense.
- Read the hook, idea, demo, takeaways, and coda first.
- Skip advanced GoDeeper boxes on the first pass.
- Return to the math details after you can explain the demo in your own words.
Graduate core
For readers who want the machinery behind the intuition.
- Work every ProblemSet without looking back at the prose.
- Run the browser labs and modify at least one assumption.
- For each phase, write the objective, assumptions, estimator, and failure mode.
Builder track
For readers who want to turn the course into working systems.
- Treat every demo as a specification for a small implementation.
- Replace toy data with a real dataset when the chapter ends.
- Use the projects page as a portfolio spine.
Phase Checks
At the end of each phase, pause and make the promise falsifiable. If you
cannot do the check without rereading the chapter, the concept is still
vocabulary, not skill.
- Phase 1 The Learning Loop
You can describe any supervised learning system using data, model, loss, optimizer, and generalization.
- Phase 2 Generalization and Measurement
You can design train/validation/test splits, choose metrics, and explain why held-out performance matters.
- Phase 3 Linear Models and Optimization
You can derive how a line becomes a classifier and how a local update changes the decision boundary.
- Phase 4 Deep Networks
You can explain how a scalar loss teaches many layers and why training stability is an engineering problem.
- Phase 5 Representations Across Space and Time
You can recognize the inductive bias behind CNNs, RNNs, and attention instead of memorizing architectures.
- Phase 6 Foundation Models
You can trace how next-token training becomes a general interface for language, images, retrieval, and tools.
- Phase 7 Agents, Alignment, and Evaluation
You can reason about policies, preference models, eval suites, reward hacking, and agent reliability.
Notation Ledger
Machine learning notation is overloaded across papers. This course uses
the following ledger unless a chapter says otherwise.
| Symbol | Meaning | Used in |
x | One input example or feature vector. | All supervised learning chapters. |
y | The target label, value, token, reward, or desired output. | Losses, metrics, generation. |
f(x; theta) | A model with parameters theta applied to input x. | Prediction, optimization, neural networks. |
theta | The parameters that training changes. | Gradient descent and model fitting. |
L(y_hat, y) | A loss for one prediction compared with one target. | Learning loop and optimization. |
R(f) | Expected risk, the average loss under the real data distribution. | Generalization. |
R_hat(f) | Empirical risk, the average loss on the sample we have. | Generalization. |
grad_theta L | The direction each parameter should move to change the loss. | Gradient descent and backpropagation. |
W, b | Weights and bias in a linear layer or neuron. | Linear models and neural networks. |
h | Hidden state or hidden representation. | Deep networks, RNNs, transformers. |
q, k, v | Query, key, and value vectors used by attention. | Attention and transformers. |
pi(a | s) | A policy giving action probabilities in a state. | Reinforcement learning and agents. |
Comprehensive Exam Prompts
These prompts are the course's exit criteria. A strong answer should use
concrete examples, the right metric, and at least one failure mode.
- Given a new ML problem, write the data schema, model family, loss, metric, and evaluation split.
- Explain why a lower training loss can be bad news.
- Trace one prediction through a neural network forward pass and one error through the backward pass.
- Compare CNNs, RNNs, and attention by the structure each assumes about the input.
- Design a retrieval or tool-using AI system and name where it can fail.
- Write an eval plan that would catch both capability regressions and unsafe behavior.