Welcome to Theoretical and Empirical Foundations of Modern Machine Learning (15-884), Fall 2022!
Instructor: Aditi Raghunathan (raditi at cmu dot edu)
TA: Christina Baek (kbaek at cs dot cmu dot edu)
Lectures: Tuesday, Thursday 4:40-6:00pm at GHC 4102
Overview:
In this advanced machine learning seminar class, we tackle the typical struggle in using the powerful deep learning machinery: what works and why? We build a conceptual understanding of deep learning through several different angles: standard in-distribution generalization, out-of-distribution generalization, self-supervised learning, scaling laws, memorization etc. We will read papers that contain a mix of theoretical and empirical insights with a focus on making connections to classic ideas, identifying recurring themes, and discussing avenues for future developments. The class aims to equip students with the ability to critically reason about and build a more principled understanding of current advances which will hopefully spark their own research.
Format:
This course combines lectures with paper presentations
by the students, encouraging both fundamental knowledge
acquisition as well as open-ended discussions and new research directions.
The lectures will briefly introduce the main concepts, summarize a few key papers
and connect to classical ideas if applicable.
The paper discussions will involve role-playing student seminars inspired by Alec Jacobson and Colin Raffel. We will be adopting the following roles.
Prerequisites: There are no official prerequisites but a knowledge of probability, linear algebra, machine learning is expected.
Course requirements:
Important dates:
Topics (tentative):
Schedule:
Date | Topic | Content | Presenter |
---|---|---|---|
08/30/2022 | [Lecture 1] Introduction |
|
Aditi Raghunathan |
09/01/2022 | [Lecture 2] The generalization puzzle | Uniform convergence, implicit regularization | Aditi Raghunathan |
09/06/2022 | [Paper discussion 1] Generalization | ||
09/08/2022 | [Paper discussion 2] Generalization | ||
09/13/2022 | [Guest Lecture] | Limitations of uniform convergence | Vaishnavh Nagarajan |
09/15/2022 | [Lecture 3] Phenomena captured by simpler models | Double descent, bias-variance tradeoff, kernel methods | |
09/20/2022 | [Paper discussion 3] | ||
09/22/2022 | [Lecture 4] Robustness of deep networks | Out-of-distribution generalization, adversarial examples, spurious correlations, shortcut learning and simplicity bias | Aditi Raghunathan |
09/27/2022 | [Paper discussion 4] Why are models brittle? (I) | ||
09/29/2022 | [Lecture 5] Robust training of deep networks | Robust optimization, accuracy tradeoff, effect of overparameterization | Aditi Raghunathan |
10/04/2022 | [Paper discussion 5] Why are models brittle? (II) | ||
10/06/2022 | [Lecture 6] Data poisoning, causality | Discussion of data poisoning, intro to causality | Aditi Raghunathan |
10/11/2022 | [Paper discussion 6] Causality | ||
10/13/2022 | [Lecture 7]Unlabeled data-I | A brief history | Aditi Raghunathan |
10/18/2022 | [Fall Break] | ||
10/20/2022 | [Fall Break] | ||
10/25/2022 | [Paper discussion 7] Learning from unlabeled data | ||
10/27/2022 | [Guest Lecture] | Self-supervised learning | Alexei A. Efros |
11/1/2022 | [Paper discussion 8] Learning from unlabeled data | ||
11/3/2022 | [Lecture 8]Unlabeled data-II | Analysis of self-training, self-supervision and domain adaptation methods | |
11/8/2022 | [Paper discussion 9]Distribution shifts with access to unlabeled data | ||
11/10/2022 | [Lecture 9] Foundation models | Transfer learning, analysis of fine-tuning methods, in-context learning | |
11/15/2022 | [Guest lecture] | Graham Neubig, Maarten Sap | |
11/17/2022 | [Paper discussion 10] | ||
11/22/2022 | [Paper discussion 11] | ||
11/24/2022 | [Thanksgiving break] | ||
11/29/2022 | [NeurIPS break] | ||
12/1/2022 | [NeurIPS break] | ||
12/06/2022 | [Guest Lecture] | Privacy and fairness in modern machine learning | Nicholas Carlini |
12/08/2022 | [Guest Lecture] | Benchmarking large language models | Rishi Bommasani |
12/13/2022 | [Paper presentation 12] | ||
12/15/2022 | [Project presentations] | ||
12/20/2022 | [Project presentations] |