course bible
Seven Phases of Machine Learning
A first-year student should stay oriented; a graduate student should see the real machinery and the places where the field is still unsettled.
The course is designed as an interactive book: intuition first, then the
mathematical object, then a lab or demo, then the failure mode. A reader
should be able to move from first principles to modern AI systems without
losing the thread.
Historical Spine
The course treats history as a debugging aid. Each breakthrough changed
what researchers could optimize, measure, or scale.
1950 Alan Turing frames machine intelligence as a behavioral question: can a machine carry on an imitation game well enough to make the boundary hard to police?
1958 Frank Rosenblatt demonstrates the Mark I Perceptron, turning weighted votes from examples into a public symbol of learning machines.
1959 Arthur Samuel uses the phrase machine learning while building a checkers program that improves from play.
1986 Rumelhart, Hinton, and Williams make backpropagation central to neural-network training, showing how one loss can teach hidden layers.
1998 LeNet shows convolutional networks working at production scale for handwritten digit recognition.
2012 AlexNet wins ImageNet by a large margin and makes GPU-trained deep learning impossible to ignore.
2017 Attention Is All You Need introduces the transformer architecture that later becomes the workhorse for language, code, vision, and multimodal systems.
2020 Scaling-law work turns model size, data size, and compute into an empirical design surface rather than guesswork.
2022 Diffusion image models and conversational language models make generative AI visible to the public, raising the stakes for evaluation and alignment.