CS480 β Introduction to Machine Learning
Fall 2025, taught by Gautam Kamath at Waterloo.
- Course page: http://www.gautamkamath.com/courses/CS480-fa2025.html
- Sections: TTh 2:30β3:50 PM (DC 1302) / TTh 1:00β2:20 PM
- Final: December 11, 12:30β3:00 PM
Lectures
1. Supervised Learning: Linear Models
- Lec 1β2: Perceptron β convergence theorem, mistake bound, padding trick
- Lec 3: Linear Regression β ERM, squared loss, normal equations, MLE under Gaussian noise, ridge, lasso, cross validation
- Lec 4: k-Nearest Neighbours β Bayes optimal, Cover-Hart bound, curse of dimensionality
- Lec 5: Logistic Regression β logit transform, cross-entropy, softmax multiclass
2. Kernels and Margins
- Lec 6β7: Support Vector Machine β hard-margin, soft-margin, dual, support vectors, hinge loss
- Lec 8: Kernel Method β feature maps, kernel trick, RBF/polynomial, Mercerβs condition
3. Trees and Ensembles
- Lec 9: Decision Tree β entropy, Gini, information gain, pruning, Random Forest
- Lec 10: Bagging & Boosting (AdaBoost) β variance reduction, Hedge algorithm, weighted experts
4. Neural Networks
- Lec 11: Multilayer Perceptrons β XOR via nonlinearities, universal approximation, activations, backprop
- Lec 12: Deep Networks β regularization, weight decay, dropout, batch/layer norm, data augmentation
- Lec 13: Optimization + CNNs β SGD, momentum, AdaGrad, RMSProp, Adam
- Lec 15: Recurrent Neural Networks
5. Unsupervised Learning
- Lec 16: k-Means & Gaussian Mixture Models β fit via Expectation Maximization
6. Generative Models
- Lec 17: Autoencoders & Variational Autoencoders β ELBO, reparameterization trick
- Lec 18: Generative Adversarial Networks
- Lec 24: Normalizing Flows
7. Trustworthy ML
- Lec 19: Adversarial Robustness β FGSM, PGD, adversarial training
- Lec 20: Differential Privacy β -DP, Gaussian mechanism, DP-SGD
8. Sequence Models & LLMs
- Lec 21: Attention β Transformer
- Lec 22β23: Large Language Models β BERT, GPT, scaling laws, chain of thought, RLHF
Readings
Textbook shorthand used in the schedule:
- UML: Understanding Machine Learning (Shalev-Shwartz & Ben-David)
- ESL: Elements of Statistical Learning (Hastie, Tibshirani, Friedman)
- ISL: Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani) β βmost recommended when availableβ
- DL: Deep Learning (Goodfellow, Bengio, Courville)
- D2L: Dive into Deep Learning (Zhang, Lipton, Li, Smola)