Stanford CS 229: Machine Learning

CS 229 is Stanford's graduate-level machine learning course, made world-famous by Andrew Ng's recorded lectures: generalized linear models, SVMs and kernels, deep learning foundations, unsupervised learning, and learning theory. It's simultaneously a campus rite of passage and one of the most self-studied courses on the internet.

Fennie is independent and not affiliated with Stanford University. This is an unofficial study guide.

What makes it hard

It's a math course wearing an ML costume: problem sets are dominated by derivations in linear algebra, multivariable calculus, and probability, and students expecting an applied modeling class get flattened by pset one. Self-learners hit the same wall in a different form: the lectures watch easily, but the understanding lives in doing the problem sets, which most online attempts skip.

What you'll cover

• Linear and logistic regression, GLMs
• Support vector machines and kernels
• Deep learning foundations
• Generalization and regularization
• Unsupervised learning: k-means, EM, PCA
• Learning theory basics

The CS 229 study guide

How to study for Stanford CS 229, step by step.

1
Audit the math prerequisites honestly
Matrix calculus, eigenvectors, multivariate Gaussians, and probability manipulation are used without slowing down. Spend the first two weeks patching whichever of these is rusty. Pset one will audit them for you otherwise.
2
Re-derive lecture results by hand
Following a derivation in lecture and producing it cold are different abilities, and the psets test the second. After each lecture, close the notes and re-derive the key results.
3
Do the problem sets, especially if self-studying
The lectures are famous but the understanding lives in the psets. Watching without deriving produces ML vocabulary, not ML ability. Schedule the problem work like it's the course, because it is.
4
Build a model-assumptions map
For every algorithm, record what it assumes, when it breaks, and its relationship to the others (logistic regression as a GLM, SVMs through the kernel lens). Exam questions live in those connections.
5
Implement the core algorithms once each
A from-scratch gradient descent, logistic regression, and k-means cement what the equations mean. Keep implementations small; the goal is understanding, not a framework.

Today

Today's CS 229 plan

Preview

65 min

What a Fennie Daily Plan looks like for CS 229. Yours is built from your own syllabus and adapts every day to your deadlines and progress.

0 / 4 done~65m remaining

Keep this plan free

First plan free, no card required. Fennie is independent and unaffiliated with your school.

FAQ

Is CS 229 hard?

Yes. It's a graduate course where problem sets are mostly mathematical derivation, not model-fitting. Students with solid linear algebra and probability who re-derive lecture results consistently manage it; students expecting an applied class get recalibrated by pset one.

Can I self-study CS 229 online?

The lectures and notes are public and famously good. The honest answer: watching them is the easy part. The understanding lives in the problem sets, and self-learners who schedule pset work like coursework get the real value while lecture-only watchers get vocabulary.

What's the difference between CS 229 and Andrew Ng's Coursera course?

The Coursera/DeepLearning.AI courses are applied and gentler, built for practitioners. CS 229 is the theory-first Stanford course: derivations, proofs, and the mathematical foundations underneath the same algorithms. Many people do both, in that order.

More Stanford courses

CS 106A: Programming Methodology

CS 106A is Stanford's famous introduction to programming, taught in Python. It covers control flow, functions, decomposition, lists, dictionaries, and graphics, and assumes zero prior experience. Its lectures and assignments are public, and through Code in Place it has been taught free to hundreds of thousands of people, so it's studied worldwide by enrolled students and self-learners alike.

CS 106B: Programming Abstractions

CS 106B follows 106A with programming abstractions in C++: recursion, ADTs and the standard collections, big-O, linked structures, trees, and hashing. It's the course where Stanford CS gets real, and like 106A its materials are public and heavily used by self-learners.

CS 107: Computer Organization and Systems

CS 107 takes students from C++ down to the machine: C programming, pointers and memory, bit-level representation, x86-64 assembly, and how the heap actually works, culminating in the famous heap allocator assignment. It's the systems gateway of the Stanford CS core.

CS 103: Mathematical Foundations of Computing

CS 103 is Stanford's discrete math and theory gateway: proof techniques, set theory, induction, graph basics, then finite automata, regular languages, and the first look at computability and P vs NP. For most students it's the first course where the deliverable is a proof, not a program.