Foundations of Autonomous Decision Making under Uncertainty

10-734, Fall 2025

Aarti Singh

Home

Note: this is a tentative lecture schedule that is subject to change.

Date	Topic	Notes	Useful links

Aug 26 Tuesday	Intro to Decision Making	Lec1
Aug 28 Thursday	Experimental Design	Lec2	On Computationally Tractable Selection of Experiments Near optimal Design of experiments via Regret Minimization Combinatorial Algorithms for Optimal Design
Sept 2 Tuesday	Experimental Design; Multi Armed Bandits	Lec3	Coreset selection NNs Intro to Multi-Armed Bandits, Ch1
Sept 4 Thursday	Adaptive exploration, Martingale bounds	Lec4	Intro to Multi-Armed Bandits, Ch1 RL_theory book AJKS, Lemma6.2
Sept 9 Tuesday	UCB; lower bound	Lec5	Intro to Multi-Armed Bandits, Ch1,2
Sept 11 Thursday	Nonparametric Bandits	Lec6	Intro to Multi-Armed Bandits, Ch4
Sept 16 Tuesday	Linear Bandits	Lec7	Reinforcement Learning: Theory & Algorithms, Ch6, Dani et al, Abbasi et al
Sept 18 Thursday	Linear Bandits concentration	MAB Motivation Lec8	Intro to Multi-Armed Bandits, Ch3
Sept 23 Tuesday	Bayesian regret, Thompson Sampling	Lec9	Intro to Multi-Armed Bandits, Ch3 Optimize via posterior sampling, Russo_VanRoy GP-UCB paper
Sept 25 Thursday	Online learning with experts	Lec10	Intro to Multi-Armed Bandits, Ch5
Sept 30 Tuesday	Adversarial Bandits	Lec11	Intro to Multi-Armed Bandits, Ch6
Oct 2 Thursday	Contextual Bandits	Lec12	Intro to Multi-Armed Bandits, Ch8
Oct 7 Tuesday	Generalized Bandits	Lec13	Square-CB paper
Oct 9 Thursday	Markov Decision Processes	Lec14	RL_theory book AJKS, Ch 1.1,1.2
Oct 14 Tuesday	No Class - FALL BREAK
Oct 16 Thursday	No Class - FALL BREAK
Oct 21 Tuesday	Value and Policy Iteration	Lec15	RL_theory book AJKS, Ch 1.3
Oct 23 Thursday	Tabular MDP	Lec16	RL_theory book AJKS, Ch 7
Oct 28 Tuesday	Tabular MDP	Lec17	RL_theory book AJKS, Ch 7
Oct 30 Thursday	Linear MDP	Lec18	LSVI-UCB paper
Nov 4 Tuesday	No Class - ELECTION DAY
Nov 6 Thursday	General Function Approximation	Lec19	RL_theory book AJKS, Ch 9,General_func_Akshaynotes
Nov 11 Tuesday	General Function Approximation; Policy Gradient	Lec20	RL_theory book AJKS, Ch 9,11
Nov 13 Thursday	Policy Gradient		RL_theory book AJKS, Ch 11
Nov 18 Tuesday	Offline RL	Lec22	RL_theory book AJKS, Ch 4
Nov 20 Thursday	Project Presentations
Nov 25 Tuesday	Project Presentations
Nov 27 Thursday	No Class - THANKSGIVING
Dec 2 Tuesday	Hybrid RL	Lec23	Hybrid_YudaNotes, Paper
Dec 4 Thursday	Reinforcement Learning from Human Feedback	Lec24	RLHF_paper