16-281 General Robotics
Main Schedule Homework Labs Links

16-311 Homework 4

Learning Objectives

Implement a discounted reward policy for a simplified motion planning problem by hand.
Think critically about path planning and reward applications.
Implement imitation learning for a cartpole.
Observe the effects of policy improvement.

Background

The specifications for the homework are here Homework 4 Handout

Last updated 02/07/2024 by Ananya Rao
(c) 1999-2023: Howie Choset, Carnegie Mellon