16-311 Introduction to Robotics
         Main         Schedule         Homework         Labs         Links

  16-311 Homework 4

16-311 Homework 4


Learning Objectives

  1. Implement a discounted reward policy for a simplified motion planning problem by hand.
  2. Think critically about path planning and reward applications.
  3. Implement imitation learning for a cartpole.
  4. Observe the effects of policy improvement.

Background

Homework Requirements

The specifications for the homework are here Homework 4 Handout

Extensions

Last updated 02/07/2024 by Ananya Rao
(c) 1999-2023: Howie Choset, Carnegie Mellon