CMU MLD Logo

10-701: Introduction to Machine Learning

Spring 2026

Key Information and Links

Instructors: Geoff Gordon and Pradeep Ravikumar
Education Associate: Jacob Rast
Announcements/Q&A: We will be using Piazza for making announcements and answering questions.
Lectures: Mondays and Wednesdays from 2:00 PM to 3:20 PM (EST) in BH A36. Lectures will be recorded for students to review after the fact; the recordings will be hosted on Panopto or Canvas.
In-class Polls: Some portion of your grade will be determined based on participation in polls during lecture. These polls will be hosted on Google Forms.
Recitations: Fridays from 2:00 PM to 3:20 PM (EST) in BH A36. Attendance at recitations is optional and therefore, outside of extraordinary circumstances, these will not be recorded. Recitation handouts can be found under the Recitations tab.
Assignments: Homework handouts will be posted to the course website under the Assignments tab. All code and written responses should be submitted via Gradescope.
Grades: Feedback on homeworks and exams will be received via Gradescope. After the grade period ends, you can review your final grades before submission to the registrar on Canvas.
Office Hours: The time and location of office hours can be found on the course calendar.

Syllabus

1. Course Description

Machine learning studies the question: “how can we build adaptive algorithms that automatically improve their performance on a given task as they acquire more experience?” This can cover a wide array of technologies depending on what sort of task we have in mind, and what we take to constitute experience. Through this framing, we might view classical statistics problems, like estimating the likelihood that a coin lands on heads as an ML problem: the task is to produce an estimate, and the experience would consist of observations. But ML can also include robotics challenges, where the experience is acquired dynamically as our artificial agent interacts with the real world. Other grand challenges in machine learning relate to personalized medicine, materials discovery, and most recently generating media artifacts like images and text.

This course is designed to give PhD students a solid foundation in the methods, mathematics, and algorithms of modern machine learning. Students entering the class with a pre-existing working knowledge of probability, statistics and algorithms will be at an advantage, but the class has been designed so that anyone with a strong mathematical and computer science background can catch up and fully participate. If you are interested in this topic, but are not a PhD student, or are a PhD student not specializing in machine learning, you might consider the master's level course on machine learning, 10-601. This class may be appropriate for MS and undergraduate students who are interested in the theory and algorithms behind machine learning.

Learning Outcomes

By the end of the course, students should be able to:

2. Prerequisites

Students entering the class are expected to have pre-existing working knowledge in the following areas:

  1. Significant experience programming in a general programming language. Specifically, you need to have written from scratch programs consisting of several hundred lines of code. For undergraduate students, this will be satisfied for example by having passed 15-122 (Principles of Imperative Computation) with a grade of ‘C’ or higher, or comparable courses or experience elsewhere.

    Note: For the programming components of our homework assignments, you will be required to use Python. You will be expected to know, or be able to quickly pick up, that programming language.

  2. Basic familiarity with probability and statistics, as can be achieved at CMU by having passed 36-217 (Probability Theory and Random Processes), or 36-225 (Introduction to Probability and Statistics I), or 15-259, or 21-325, or comparable courses elsewhere, with a grade of ‘C’ or higher.
  3. College-level maturity in discrete mathematics, as can be achieved at CMU by having passed 21-127 (Concepts of Mathematics), 21-128 (Mathematical Concepts and Proofs), 15-151 (Mathematical Foundations for Computer Science) or comparable courses elsewhere, with a grade of ‘C’ or higher.
  4. College-level maturity in calculus and linear algebra, as can be achieved at CMU by having passed 21-122 (Integration and Approximation), 21-241 (Matrix Algebra), 21-242 (Matrix Theory) or comparable courses elsewhere, with a grade of ‘C’ or higher.

You must strictly adhere to these pre-requisites! Even if CMU’s registration system does not prevent you from registering for this course, it is still your responsibility to make sure you have all of these prerequisites before you register.

3. Recommended Textbooks

This course does not exactly follow any one textbook. However, most lectures will have some optional reading to help you better understand the material or see a different presentation/perspective. We recommend you read these after the corresponding lecture. These readings will typically be drawn from the following texts, many of which are freely available online:

The textbook below is a great resource for those hoping to brush up on the prerequisite mathematics background for this course:

However, if you find yourself regularly and frequently referring to this textbook, we strongly encourage you to consider whether or not you have the necessary prerequisite knowledge to succeed in this course.

4. Assessments

Your grade in this course will consist of participation in lectures, midterm and final exams, homework assignments and a class project. The breakdown is as follows:

We will convert numerical course grades to letter grades based on grade boundaries that are determined at the end of the semester. The following is a list of upper bounds on the grade cutoffs we will use; in all likelihood, these will be adjusted down at the end of the semester:

Participation

We will be using Google Forms for in-class polls.

You will always be allowed to submit multiple times so if there are multiple questions during a lecture, you should submit multiple times. Your participation grade will be based on the percentage of in-class polls answered:

The correctness of your responses will not be taken into account when computing participation grades, all that matters is that you submit something. You must submit your responses to participation polls within 48 hours from the start of lecture. You can submit the participation poll anytime within this 48-hour period, including after the lecture if you choose to watch asynchronously.

Midterm and Final Exams

Unless otherwise noted, all exams for this course will be closed notes; you may bring one sheet of A4 paper as a cheatsheet (both back and front may be used). You are encouraged to handwrite this cheatsheet as a form of preparing for the exam but you may typeset them if you so choose.

The first exam will take place roughly at the midpoint of the semester; see the schedule page for more details. The second exam will take place during the finals period and will be scheduled by the registrar. Please plan your travel accordingly as we will not be able accommodate individual travel needs (e.g., by offering the exam early). If you have an unavoidable conflict with either exam (e.g., an exam in another course), notify us by making a private post on Piazza.

Homework Assignments

The homework assignments will be divided into two components: programming and written; most assignments will have both components but HW5 and HW6 will be written only assignments. The programming portions will ask you to implement machine learning algorithms from scratch; they emphasize understanding of real-world applications of machine learning, building end-to-end systems, and experimental design. The written portions will focus on core concepts, “on-paper” implementations of classic learning algorithms, derivations, and understanding of theory. All programming must be completed in Python.

Homework Quizzes

In order to provide incentive against excessive use of generative AI, all homeworks with programming assignments (Homework 1 - Homework 4) will be accompanied by a corresponding homework quiz. These homework quizzes are intended to be lightweight and relatively low-stakes; they will be just 30 minutes long and should be possible to complete within 15 minutes. The questions asked are designed to determine if you understand the concepts presented in homework assignments. These quizzes are intended to be easy if you have properly worked through the material and difficult or impossible to answer if you have not.

Project

The course project is an opportunity for you to implement, explore and go beyond the methods presented in class applied to a real-world problem/research area. You will be given a small list of topics to choose from and will work in groups of 2 or 3; you may not work by yourself on the project.

There will be three deliverables associated with the project:

5. Office Hours

The schedule of office hours will always appear on the course calendar. In general, we will endeavor to hold a minimum of 6 office hours per week, with more in the week leading up to an exam.

We will make use of the following (informal) rules:

While you're awaiting your turn, we encourage you to listen in to the answers to any publicly answered questions. Please be courteous and allow the student who posed the question to primarily direct the discussion. We also encourage you to collaborate with others (following our collaboration policies below) while waiting.

6. General Policies

Late homework policy

You have a total of 6 grace days for use on any homework assignment. We will automatically keep a tally of these grace days for you; they will be applied greedily. You may not use more than 2 grace days on any single assignment. Submitting a homework late when you are out of grace days will result in the following penalties:

Grace days may only be applied to homeworks. Grace days may not be used on project deliverables. You may not combine grace days and late penalties to submit more than 48 hours late; Any homework submitted more than 48 hours late will receive a score of 0.

Extensions

In general, we do not grant extensions on assignments. There are several exceptions:

For any of the above situations, you may request an extension by emailing Jacob at jrast@andrew.cmu.edudo not email the instructor or TAs. Please be specific about which assessment(s) you are requesting an extension for and the number of hours requested. The email should be sent as soon as you are aware of the conflict and at least 24 hours prior to the deadline. In the case of an emergency, no notice is needed.

If this is a medical emergency or mental health crisis, you must also CC your CMU college liaison and/or your academic advisor. Do not submit any medical documentation to the course staff. If necessary, your college liaison and The Division of Student Affairs (DoSA) will request such documentation and they will view the health documentation and conclude whether a retroactive extension is appropriate. If you haven’t interacted with your college liaison before, they are experienced student affairs staff who work in partnership with students, housefellows, advisors, faculty, and associate deans in each college to assure support for students regarding their overall Carnegie Mellon experience.

Audit Policy

Formal auditing of this course is permitted. You must follow the official procedures for a course audit as outlined by the HUB/registrar. Please do not email the instructor requesting permission to audit. Instead, you should first register for the appropriate section. Next fill out the Course Audit Approval form, obtain your academic advisor's signature and then approach the instructor in-person immediately after lecture to obtain their signature.

Auditors are required to:

Pass/Fail Policy

You are allowed to take this course as Pass/Fail; instructor permission is not required. What letter grade is the cutoff for a Pass will depend on your specific program; we do not specify whether or not you Pass but rather we compute your letter grade the same as everyone else in the class and your program converts that letter grade to a Pass or Fail depending on their cutoff. Be sure to check with your program/department as to whether you can count a Pass/Fail course towards your degree requirements.

Accommodations for Students with Disabilities

If you have a disability and have an accommodations letter from the Disability Resources office, please email Jacob at jrast@andrew.cmu.edu to set up a meeting for the purposes of discussing your accommodations and needs as early in the semester as possible. He will work with you to ensure that accommodations are provided as appropriate. If you suspect that you may have a disability and would benefit from accommodations but are not yet registered with the Office of Disability Resources, I encourage you to contact them at access@andrew.cmu.edu.

7. Technologies

We will use a variety of technologies throughout the course:

8. Collaboration and Academic Integrity

Read this section carefully!

Collaboration among Students

The purpose of student collaboration is to facilitate learning, not to circumvent it. Studying the material in groups is strongly encouraged. You are also allowed to seek help from other students in understanding the material needed to solve a particular homework problem, provided any written notes (including code) are taken on an impermanent surface (e.g., whiteboard, chalkboard), and provided learning is facilitated, not circumvented. The actual solution must be written by each student alone.

A good method to follow when collaborating is to meet with your peers, discuss ideas at a high level, but do not copy down any notes from each other or from a shared work surface. Any scratch work done at this time should be your own only. Before writing the assignment solutions, you should make sure that you are doing this without anyone else present, putting all notes away, closing all tabs on your computer, and writing it completely by yourself with no other resources.

You are absolutely not allowed to share/compare answers or screen share your work with one another.

The presence or absence of any form of help or collaboration, whether given or received, must be explicitly stated and disclosed in full by all involved. Specifically, each assignment solution must include answers to the following questions:

  1. Did you receive any help whatsoever from anyone in solving this assignment? Yes / No.
    • If you answered ‘yes’, give full details: ____________
    • (e.g., "Jane Doe explained to me what is asked in Question 3.4")
  2. Did you give any help whatsoever to anyone in solving this assignment? Yes / No.
    • If you answered ‘yes’, give full details: _____________
    • (e.g., "I pointed Joe Smith to section 2.3 since he didn’t know how to proceed with Question 2")
  3. Did you find or come across code that implements any part of this assignment? Yes / No. (See below policy on "found code")
    • If you answered ‘yes’, give full details: _____________
    • (book & page, URL & location within the page, etc.).

If you gave help after turning in your own assignment and/or after answering the questions above, you must update your answers before the assignment’s deadline, if necessary by emailing the course staff.

Collaboration without full disclosure will be handled severely, in compliance with CMU’s Policy on Academic Integrity.

Previously Used Assignments

Some of the homework assignments used in this class may have been used in prior offerings, in classes at other institutions, or elsewhere. Solutions to them may be, or may have been, available online, or from other people or sources. It is explicitly forbidden to use any such sources, or to consult people who have solved these problems before. It is explicitly forbidden to search for these problems or their solutions on the internet. You must complete the homework assignments completely on your own. We will be actively monitoring your compliance. Collaboration with other students who are currently taking the class is allowed, but only under the conditions stated above.

AI Assistance

To best support your own learning, you should complete all graded assignments in this course yourself. We strongly suggest that you refrain from using AI tools to generate any content (text, video, audio, images, code, etc.) for an assessment. However, students are allowed to consult with AI chatbots provided that they do not do so while actively working on a course deliverable. There should never be a situation while your assignment is open at the same time as any chatbot. In this regard, you should treat an AI the same way as any other collaborator or internet reference and divulge usage in the collaboration section of your homework.

Policy Regarding "Found Code"

You are encouraged to read books and other instructional materials, both online and offline, to help you understand the concepts and algorithms taught in class. These materials may contain example code or pseudocode, which may help you better understand an algorithm or an implementation detail. However, when you implement your own solution to an assignment, you must put all materials aside, and write your code completely on your own, starting "from scratch". Specifically, you may not use any code you found or came across. If you find or come across code that implements any part of your assignment, you must disclose this fact in your collaboration statement.

Duty to Protect One’s Work

Students are responsible for proactively protecting their work from copying and misuse by other students. If a student’s work is copied by another student, the original author is also considered to be in violation of the course policies. It does not matter whether the author allowed the work to be copied or was merely negligent in preventing it from being copied. When overlapping work is submitted by different students, both students will be punished.

To protect future students, do not post your solutions publicly, neither during the course nor afterwards.

Penalties for Violations of Course Policies

All violations of course policies (even the first one) will always be reported to the university authorities (your department head, associate dean, the dean of Student Affairs, etc.) as an official Academic Integrity Violation and will carry severe penalties.

  1. The penalty for the first violation is a negative 100% on the assignment i.e., it would have been better to submit nothing and receive a 0%.
  2. The penalty for the second violation is failure in the course, and can even lead to dismissal from the university.

9. Support

Take care of yourself. Do your best to maintain a healthy lifestyle by eating well, exercising, avoiding drugs and alcohol, getting enough sleep and taking some time to relax. This will help you achieve your goals and cope with stress.

All of us benefit from support during times of struggle. You are not alone. There are many helpful resources available on campus and an important part of the college experience is learning how to ask for help. Asking for support sooner rather than later is often helpful.

If you or anyone you know experiences any academic stress, difficult life events, or feelings like anxiety or depression, we strongly encourage you to seek support. Counseling and Psychological Services (CaPS) is here to help: call 412-268-2922 and visit their website at http://www.cmu.edu/counseling/.

If you or someone you know is feeling suicidal or in danger of self-harm, call someone immediately, day or night:

10. Diversity

We must treat every individual with respect. We are diverse in many ways, and this diversity is fundamental to building and maintaining an equitable and inclusive campus community. Diversity can refer to multiple ways that we identify ourselves, including but not limited to race, color, national origin, language, sex, disability, age, sexual orientation, gender identity, religion, creed, ancestry, belief, veteran status, or genetic information. Each of these diverse identities, along with many others not mentioned here, shape the perspectives our students, faculty, and staff bring to our campus. We, at CMU, will work to promote diversity, equity and inclusion not only because diversity fuels excellence and innovation, but because we want to pursue justice. We acknowledge our imperfections while we also fully commit to the work, inside and outside of our classrooms, of building and sustaining a campus community that increasingly embraces these core values.

Each of us is responsible for creating a safer, more inclusive environment.

Unfortunately, incidents of bias or discrimination do occur, whether intentional or unintentional. They contribute to creating an unwelcoming environment for individuals and groups at the university. Therefore, the university encourages anyone who experiences or observes unfair or hostile treatment on the basis of identity to speak out for justice and support, within the moment of the incident or after the incident has passed. Anyone can share these experiences using the following resources:

All reports will be documented and deliberated to determine if there should be any following actions. Regardless of incident type, the university will use all shared experiences to transform our campus climate to be more equitable and just.


Staff

Instructor

Geoff Gordon
Pradeep Ravikumar

Education Associate

Jacob Rast
Jacob

Teaching Assistants

Roger Yang
Alan Song
Terrance Liu
Efe Mert Karagozlu
Roger Yang
Zixi (Alan) Song

Schedule

Lectures are the primary mode of content delivery in this course. Attending lectures is highly recommended; there will be regular in-class activities and polls which will constitute a small portion of your final grade. Engaging in these real-time activities can greatly improve your understanding of the material. Lectures will be recorded and made available to you after the fact. However, the primary purpose of these recordings is to allow you to refer back to the content; watching recordings in lieu of attending lectures is not encouraged.

Date Topic Slides Readings/Resources
Mon, Jan. 12 Introduction and ML as function approximation Lecture 1 (Inked)
Wed, Jan. 14 Decision Trees Lecture 2 (Inked) Mitchell, Chapter 3
Daumé III, Chapter 1
Mon, Jan. 19 No Class (MLK Day)
Wed, Jan. 21 KNNs and Model Selection Lecture 3 (Inked) Mitchell, Sections 8.1 - 8.2
Daumé III, Chapter 3
Mon, Jan. 26 Linear Regression Lecture 4 (Inked) Murphy, Sections 7.1 - 7.3
Wed, Jan. 28 MLE/MAP Lecture 5 Mitchell, Estimating Probabilities
Fri, Jan. 30 Homework Quiz 1
Mon, Feb. 2 Optimization for ML Lecture 6 Bottou, Stochastic Gradient Descent Tricks
Goodfellow, Bengio, Courville, Deep Learning, Chapter 8
Wed, Feb. 4 Logistic Regression Lecture 7 Murphy, Sections 8.1 - 8.3
Mon, Feb. 9 Regularization and Generalization Lecture 8 Murphy, Sections 7.5 & 14.4
Wed, Feb. 11 Neural Networks, Backpropagation and AutoDiff Lecture 9 (Inked) Mitchell, Chapter 4.1 - 4.6
Zhang, Lipton, Li & Smola, Chapter 5.1 - 5.3
Fri, Feb 13 Homework Quiz 2
Mon, Feb. 16 CNNs Lecture 10 (Inked) Zhang, Lipton, Li & Smola, Chapter 7
Wed, Feb. 18 CNNs (cont'd) and RNNs Lecture 11 (Inked) Zhang, Lipton, Li & Smola, Chapters 9 & 10.1
Mon, Feb. 23 Vector-space embeddings Lecture 12 (Inked)
Wed, Feb. 25 Midterm
Mon, Mar. 2 No Class (Spring Break)
Wed, Mar. 4 No Class (Spring Break)
Mon, Mar. 9 Attention and Transformers Lecture 13 Zhang, Lipton, Li & Smola, Chapter 11
Wed, Mar. 11 Societal impacts of ML and algorithmic bias Lecture 14
Fri, Mar 13 Homework Quiz 3
Mon, Mar. 16 Clustering Lecture 15 Daumé III, Chapter 15
Murphy, Sections 11.1 - 11.4.2
Wed, Mar. 18 Dimensionality reduction Lecture 16 Daumé III, Chapter 15
Murphy, Sections 12.2.1 - 12.2.3
Mon, Mar. 23 RL Fundamentals Lecture 17 (Inked) Mitchell, Chapter 13
Wed, Mar. 25 Policy gradients and actor-critic methods Lecture 18 (Inked) Sutton and Barto
Fri, Mar 27 Homework Quiz 4
Mon, Mar. 30 Pretraining, fine-tuning, in-context learning Lecture 19 (Inked)
Wed, Apr. 1 Learning theory I Lecture 20
Mon, Apr. 6 Learning theory II Lecture 21
Wed, Apr. 8 Bagging Lecture 22
Fri, Apr. 10 No Class (Spring Carnival)
Mon, Apr. 13 Boosting Lecture 23 (Inked) Freund and Schapire, A Short Introduction to Boosting
Wed, Apr. 15 SVMs and Duality Lecture 24 (Inked)
Mon, Apr. 20 Kernel methods and Gaussian processes Lecture 25 Murphy, Sections 14.1 - 14.5
Wed, Apr. 22 Generative Models Lecture 26
Thu, Apr. 30 Final

Recitations

Attendance at recitations is not required, but strongly encouraged. Recitations will be interactive and focus on problem solving; we strongly encourage you to actively participate. A problem sheet will usually be released prior to the recitation. If you are unable to attend one or you missed an important detail, feel free to stop by office hours to ask the TAs about the content that was covered. Of course, we also encourage you to exchange notes with your peers.

Date Topic Handout Solutions
Fri, Jan. 16 No Recitation
Fri, Jan. 23 Recitation 1: Decision Trees & kNNs Recitation 1 Solutions
Fri, Jan. 30 Recitation 2: Linear Regression & MLE/MAP Recitation 2 Solutions
Fri, Feb. 6 Recitation 3: Logistic Regression Recitation 3 Solutions
Fri, Feb. 13 Recitation 4: Neural Networks and PyTorch Recitation 4 Solutions
Fri, Feb. 20 Midterm Practice Problem Review Problems Solutions
Fri, Feb. 27 No Recitation (Midterm Week)
Fri, Mar. 6 No Recitation (Spring Break)
Fri, Mar. 20 Recitation 5: Working with Text Data Slides (Notebook)
Fri, Mar. 27 Recitation 6: Reinforcement Learning Recitation 6 Solutions
Fri, Apr. 3 Recitation 7: Learning Theory Recitation 7 Solutions
Fri, Apr. 10 No Recitation (Spring Carnival)
Fri, Apr. 17 Recitation 8: Ensemble Methods & SVMs Recitation 8 Solutions
Fri, Apr. 24 Final Project Video Showcase Screening
Final Practice Problems Problems Solutions

Assignments

Our homework assignments are an opportunity for you all to reason about and build/experiment with some of the models that we introduce in class. All programming questions must be completed in Python and you must use LaTeX to typeset your responses to the written questions. You will submit both your code and your written responses using Gradescope; note that each assignment will have separate submissions for the code and the written portion.

Release Date Topic Files Due Date
Fri, Jan. 16 HW1: Decision Trees & kNNs HW1 Handout Tue, Jan. 27
Fri, Jan. 30 HW2: Linear Regression, Logistic Regression, and MLE/MAP HW2 Handout Tue, Feb. 10
Fri, Feb. 20 HW3: Regularization and Neural Networks HW3 Handout v2 Tue, Mar. 10
Fri, Mar. 13 HW4: CNNs, RNNs, and Transformers HW4 Handout Tue, Mar. 24
Mon, Mar. 30 HW5: Reinforcement Learning and Unsupervised Learning HW5 Handout Wed, Apr. 8
Mon, Apr. 13 HW6: Learning Theory and Ensemble Methods HW6 Handout Wed, Apr. 22

Project

Details about the project can be found in the Project Specifications Handout.

Deliverable Link Due Date
Proposal Fri, Mar. 13
First Mentor Meeting NA Fri, Mar. 20
Check-in Wed, Apr. 15
Video Showcase Wed, Apr. 22
Final Report Thur, Apr. 23

Course Calendar