Stanford Machine Learning
The following notes represent a complete, stand alone interpretation of Stanford's machine learning course presented by Professor Andrew Ng and originally posted on the ml-class.org website during the fall 2011 semester. The topics covered are shown below, although for a more detailed summary see lecture 19. The only content not covered here is the Octave/MATLAB programming.
All diagrams are my own or are directly taken from the lectures, full credit to Professor Ng for a truly exceptional lecture course.
What are these notes?
Originally written as a way for me personally to help solidify and document the concepts, these notes have grown into a reasonably complete block of reference material spanning the course in its entirety in just over 40 000 words and a lot of diagrams! The target audience was originally me, but more broadly, can be someone familiar with programming although no assumption regarding statistics, calculus or linear algebra is made. We go from the very introduction of machine learning to neural networks, recommender systems and even pipeline design. The one thing I will say is that a lot of the later topics build on those of earlier sections, so it's generally advisable to work through in chronological order.
The notes were written in Evernote, and then exported to HTML automatically. As a result I take no credit/blame for the web formatting.
How can you help!?
If you notice errors or typos, inconsistencies or things that are unclear please tell me and I'll update them. It would be hugely appreciated!
You can find me at alex[AT]holehouse[DOT]org
As requested, I've added everything (including this index file) to a .RAR archive, which can be downloaded below. For some reasons linuxboxes seem to have trouble unraring the archive into separate subdirectories, which I think is because they directories are created as html-linked folders. Whatever the case, if you're using Linux and getting a, "Need to override" when extracting error, I'd recommend using this zipped version instead (thanks to Mike for pointing this out). They're identical bar the compression method. [Files updated 9th Jan]. RAR archive - (~20 MB) Zip archive - (~20 MB)
A changelog can be found here - Anything in the log has already been updated in the online content, but the archives may not have been - check the timestamp above.
- 01 and 02: Introduction, Regression Analysis and Gradient Descent
- 03: Linear Algebra - review
- 04: Linear Regression with Multiple Variables
- 05: Octave[incomplete]
- 06: Logistic Regression
- 07: Regularization
- 08: Neural Networks - Representation
- 09: Neural Networks - Learning
- 10: Advice for applying machine learning techniques
- 11: Machine Learning System Design
- 12: Support Vector Machines
- 13: Clustering
- 14: Dimensionality Reduction
- 15: Anomaly Detection
- 16: Recommender Systems
- 17: Large Scale Machine Learning
- 18: Application Example - Photo OCR
- 19: Course Summary