Robotics Institute: Seminar, October 22, 1999: Remi Munos

RI | Seminar | October 22, 1999

Robotics Institute Seminar, October 22, 1999
Place and Time | Seminar Abstract | Speaker Biography | Speaker Appointments

See also: Seminar Schedule | This Week's Seminar | Last Week's Seminar | Seminar Archive

Variable Resolution Discretization in Optimal Control

Remi Munos
Robotics Institute
Carnegie Mellon University

Place and Time

1305 Newell-Simon Hall
Refreshments 3:15 pm
Talk 3:30 pm

Abstract

The problem of making decisions in stochastic environments is central to many areas, including robotics, finance, industrial manufacturing, and game playing. When we consider optimal control problems described in terms of continuous space and time variables, we deduce highly non-linear partial differential equations: the well-known Hamilton-Jacobi-Bellman equations. Consistent discretizations of these equations (for example by using Finite-Elements or Finite-Differences methods) generate Dynamic Programming equations whose solutions approximate the value function and the optimal policy. Similar adaptive methods provide convergent Reinforcement Learning algorithms.

Here I will consider variable resolution discretization methods built in a top-down approach: an initial coarse grid is successively refined according to some splitting criterion. I will introduce and evaluate several splitting methods, from local to global approaches in which we take into account the impact of a cell on the whole state-space when deciding wether to split. I will illustrate their performance on several benchmark problems: ``Car on the Hill'', the ``Acrobot'', and the ``Inverted pendulum''. Futher research using sparse representations and Monte-Carlo methods will also be discussed.

Speaker Biography

In 1991, Munos graduated from the engineering school `Ecole Nationale Superieure des Telecommunications' in Paris. Following that, he pursued a diploma (DEA) in Cognitive Sciences and in Mathematics at the `University Pierre et Marie Curie', Paris. In 1997, he received a PhD from the `Ecole des Hautes Etudes en Sciences Sociales' where he worked on theoretical aspects of Reinforcement Learning in the continuous case and the link with Viscosity Solutions.

Since May 1998, he has been working as a postdoctoral researcher at the Auton Lab supervised by Prof. Andrew Moore at the Robotics Institute, Carnegie Mellon University.

Speaker Appointments

For appointments, please contact the speaker, Remi Munos (Remi Munos).

The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
This page automatically generated at 10:56:57 AM on Wednesday, November 1, 2000.
This page can be found on the world wide web at http://www.ri.cmu.edu/seminar/1999.october.22.html.
This system maintained by Salvatore Domenick Desiano (sal@ri.cmu.edu).