Robotics Institute Seminar, October 22, 1999
Place and Time |
Seminar Abstract |
Speaker Biography |
Speaker Appointments
See also:
Seminar Schedule |
This Week's Seminar |
Last Week's Seminar |
Seminar Archive
Variable Resolution Discretization in Optimal Control
Remi Munos
Robotics Institute
Carnegie Mellon University
1305 Newell-Simon Hall
Refreshments 3:15 pm
Talk 3:30 pm
The problem of making decisions in stochastic environments is central to many
areas, including robotics, finance, industrial manufacturing, and game playing.
When we consider optimal control problems described in terms of continuous space
and time variables, we deduce highly non-linear partial differential equations:
the well-known Hamilton-Jacobi-Bellman equations. Consistent discretizations of
these equations (for example by using Finite-Elements or Finite-Differences
methods) generate Dynamic Programming equations whose solutions approximate the
value function and the optimal policy. Similar adaptive methods provide
convergent Reinforcement Learning algorithms.
Here I will consider variable resolution discretization methods built in a
top-down approach: an initial coarse grid is successively refined according to
some splitting criterion. I will introduce and evaluate several splitting
methods, from local to global approaches in which we take into account the impact
of a cell on the whole state-space when deciding wether to split. I will
illustrate their performance on several benchmark problems: ``Car on the Hill'',
the ``Acrobot'', and the ``Inverted pendulum''. Futher research using sparse
representations and Monte-Carlo methods will also be discussed.
In 1991, Munos graduated from the engineering school `Ecole Nationale Superieure
des Telecommunications' in Paris. Following that, he pursued a diploma (DEA) in
Cognitive Sciences and in Mathematics at the `University Pierre et Marie Curie',
Paris. In 1997, he received a PhD from the `Ecole des Hautes Etudes en Sciences
Sociales' where he worked on theoretical aspects of Reinforcement Learning in the
continuous case and the link with Viscosity Solutions.
Since May 1998, he has been working as a postdoctoral researcher at the
Auton Lab supervised by Prof. Andrew Moore at the Robotics Institute, Carnegie
Mellon University.
For appointments, please contact the speaker, Remi Munos (Remi Munos).
The Robotics Institute
is part of the School of Computer
Science, Carnegie Mellon
University.
This page automatically generated at 10:56:57 AM on
Wednesday, November 1, 2000.
This page can be found on the world wide web at
http://www.ri.cmu.edu/seminar/1999.october.22.html.
This system maintained by Salvatore Domenick Desiano (sal@ri.cmu.edu).