Human Behavior Modeling with Maximum Entropy Inverse Optimal Control
Brian D. Ziebart, Andrew Maas, J. Andrew Bagnell and Anind K. Dey
AAAI Spring Symposium on Human Behavior Modeling. 2009.
[
pdf]
Abstract:
In our research, we view human behavior as a structured sequence
of context-sensitive decisions. We develop a conditional probabilistic
model for predicting human decisions given the contextual situation.
Our approach employs the
principle of maximum entropy within the Markov Decision
Process framework. Modeling human behavior is reduced
to recovering a context-sensitive utility function that explains
demonstrated behavior within the probabilistic model.
In this work, we review the development of our probabilistic model
(Ziebart et al. 2008a) and the results of its application
to modeling the context-sensitive route preferences of
drivers (Ziebart et al. 2008b). We additionally expand the
approach's applicability to domains with stochastic dynamics,
present preliminary experiments on modeling time-usage,
and discuss remaining challenges for applying our approach
to other human behavior modeling problems.
Brian Ziebart's Homepage