next up previous contents
Next: Timeline Up: Summary Previous: Summary

Contributions

This thesis will contribute to the state-of-the-art in machine learning, reinforcement learning, heuristic search and combinatorial optimization. Its specific contributions will include:

  1. A source reference for researchers interested in automatic learning and tuning of evaluation functions.
  2. A new algorithm for learning approximate value functions in large, acyclic Markov decision problems.
  3. Formulations for posing general combinatorial optimization problems as Value Function Approximation tasks, extending [Zhang1996].
  4. A new algorithm for using prediction learning to bootstrap search algorithms.
  5. Development of new techniques, based on memory-based stochastic optimization, for direct meta-optimization of evaluation functions.
  6. An empirical comparison, on several large-scale applications, of the VFA and direct-optimization approaches.
  7. A software product for public distribution that implements the major algorithms for evaluation function learning.

The following research topics are closely related to this thesis but nonetheless outside the scope of what I plan to explore:



Justin A. Boyan
Sat Jun 22 20:49:48 EDT 1996