Slide 4 of 7
Notes:
LQR wins big on this problem. It represents the cart at the goal position
with the pendulum slightly perturbed from its resting position. This is
a very clean solution. The graph can be generated by
nrdp cartpole lqrcon x0 1 -0.5 0 0 graphstates