Welcome to Theoretical and Empirical Foundations of Modern Machine Learning
15-789, Fall 2024!
Overview
The Fall 2024 iteration of this class will focus on foundation models. Foundation models have heralded a new era in modern machine learning. They are trained on massive raw data at scale and work well on a wide range of tasks with little to no fine-tuning. In this advanced machine learning seminar class, we will build a principled understanding of when and why they work or fail, and avenues for improving their reliability and trustworthiness. The class aims to equip students with the ability to critically reason about and build a more principled understanding of current advances which will hopefully spark their own research.
The Fall 2022 offering can be accessed here.
Format
This course combines lectures with paper presentations
by the students, encouraging both fundamental knowledge
acquisition as well as open-ended discussions and new research directions.
The lectures will briefly introduce the main concepts, summarize a few key papers
and connect to classical ideas if applicable.
The paper discussions will involve role-playing student seminars inspired by Alec Jacobson and Colin Raffel. We will be adopting the following roles.
Course Requirements
Important Dates
Schedule
Date | Topic | Content | Presenter |
---|---|---|---|
08/26/2024 | Lecture 1: Introduction |
|
Aditi Raghunathan |
08/28/2024 | Lecture 2: From ''classical'' ML to foundation models |
|
Aditi Raghunathan |
09/02/2024 | Labor Day (No class) | ||
09/04/2024 | Guest lecture | The pitfalls of next-token prediction | Vaishnavh Nagarajan |
09/09/2024 | Lecture moved to Friday 09/13/2024 | ||
09/11/2024 | Paper discussion 1 | ||
09/13/2024 | Lecture 3: The generalization puzzle |
|
Aditi Raghunathan |
09/16/2024 | Paper discussion 2 | ||
09/18/2024 | Lecture 4: Scaling laws | Aditi Raghunathan | |
09/23/2024 | Paper discussion 3 | ||
09/25/2024 | Lecture 5: Downstream capabilities of pretrained models |
|
Aditi Raghunathan |
09/30/2024 | Paper discussion 4 | ||
10/02/2024 | Lecture 6: A critical look at capabilities |
|
Aditi Raghunathan |
10/07/2024 | Paper discussion 5 | ||
10/09/2024 | Paper discussion 6 | ||
10/14/2024 | Fall break (No class) | ||
10/16/2024 | Fall break (No class) | ||
10/21/2024 | Lecture 7: A critical look at capabilities - part 2 |
|
Aditi Raghunathan |
10/23/2024 | Lecture 8: Post-training |
|
Aditi Raghunathan |
10/28/2024 | Paper discussion 7 | ||
10/30/2024 | Paper discussion 8 | ||
11/04/2024 | Lecture 9 |
|
Aditi Raghunathan |
11/06/2024 | Paper discussion 9 | ||
11/11/2024 | Paper discussion 10 | ||
11/13/2024 | Guest Lecture |
|
Tim Dettmers |
11/18/2024 | Guest Lecture: Inference-time methods |
|
Sean Welleck |
11/20/2024 | Paper discussion 11 | ||
11/25/2024 | Paper discussion 12 | ||
12/2/2024 | Project Presentations |
|
|
12/4/2024 | Project Presentations |
|