Journal of Artificial Intelligence
Complete Table of Contents (with abstracts)
Volume1
Wellman, M.P. (1993)
"A Market-Oriented Programming Environment and its Application to
Distributed Multicommodity Flow Problems", Volume 1, pages 1-23.
PostScript: volume1/wellman93a.ps (348K)
PDF: volume1/wellman93a.pdf (265K)
Abstract: Market price systems constitute a well-understood class of
mechanisms that under certain conditions provide effective
decentralization of decision making with minimal communication
overhead. In a {\em market-oriented programming\/} approach to
distributed problem solving, we derive the activities and resource
allocations for a set of computational agents by computing the
competitive equilibrium of an artificial economy. {\sc Walras}
provides basic constructs for defining computational market
structures, and protocols for deriving their corresponding price
equilibria. In a particular realization of this approach for a form
of multicommodity flow problem, we see that careful construction of
the decision process according to economic principles can lead to
efficient distributed resource allocation, and that the behavior of
the system can be meaningfully analyzed in economic terms.
Ginsberg, M.L. (1993)
"Dynamic Backtracking", Volume 1, pages 25-46.
PostScript: volume1/ginsberg93a.ps (211K)
Online Appendix: volume1/ginsberg93a-appendix.lisp (7K), Crossword data
PDF: volume1/ginsberg93a.pdf (244K)
Abstract: Because of their occasional need to return to shallow points
in a search tree, existing backtracking methods can sometimes erase
meaningful progress toward solving a search problem. In this paper,
we present a method by which backtrack points can be moved deeper in
the search space, thereby avoiding this difficulty. The technique
developed is a variant of dependency-directed backtracking that uses
only polynomial space while still providing useful control information
and retaining the completeness guarantees provided by earlier
approaches.
Gent, I.P. and Walsh, T. (1993)
"An Empirical Analysis of Search in GSAT", Volume 1, pages 47-59.
PostScript: volume1/gent93a.ps (500K)
PDF: volume1/gent93a.pdf (234K)
Abstract: We describe an extensive study of search in GSAT, an
approximation procedure for propositional satisfiability. GSAT
performs greedy hill-climbing on the number of satisfied clauses in a
truth assignment. Our experiments provide a more complete picture of
GSAT's search than previous accounts. We describe in detail the two
phases of search: rapid hill-climbing followed by a long plateau
search. We demonstrate that when applied to randomly generated 3SAT
problems, there is a very simple scaling with problem size for both
the mean number of satisfied clauses and the mean branching rate. Our
results allow us to make detailed numerical conjectures about the
length of the hill-climbing phase, the average gradient of this phase,
and to conjecture that both the average score and average branching
rate decay exponentially during plateau search. We end by showing how
these results can be used to direct future theoretical analysis. This
work provides a case study of how computer experiments can be used to
improve understanding of the theoretical properties of algorithms.
Schlimmer, J.C. and Hermens, L.A. (1993)
"Software Agents: Completing Patterns and Constructing User Interfaces",
Volume 1, pages 61-89.
PostScript: volume1/schlimmer93a.ps (1.2 M)
compressed, volume1/schlimmer93a.ps.Z (278K)
Online Appendix: volume1/schlimmer93a-appendix.hqx (1.3M), Quicktime Demo
PDF: volume1/schlimmer93a.pdf (160K)
Abstract: To support the goal of allowing users to record and retrieve
information, this paper describes an interactive note-taking system
for pen-based computers with two distinctive features. First, it
actively predicts what the user is going to write. Second, it
automatically constructs a custom, button-box user interface on
request. The system is an example of a learning-apprentice software-
agent. A machine learning component characterizes the syntax and
semantics of the user's information. A performance system uses this
learned information to generate completion strings and construct a
user interface.
Description of Online Appendix: People like to record information. Doing
this on paper is initially efficient, but lacks flexibility. Recording
information on a computer is less efficient but more powerful. In our
new note taking softwre, the user records information directly on a
computer. Behind the interface, an agent acts for the user. To help,
it provides defaults and constructs a custom user interface.
The demonstration is a QuickTime movie of the note taking agent in
action. The file is a binhexed self-extracting archive. Macintosh
utilities for binhex are available from mac.archive.umich.edu. QuickTime
is available from ftp.apple.com in the dts/mac/sys.soft/quicktime.
Bergadano, F., Gunetti, D. and Trinchero, U. (1993)
"The Difficulties of Learning Logic Programs with Cut", Volume 1,
pages 91-107.
PostScript: volume1/bergadano93a.ps (185K)
PDF: volume1/bergadano93a.pdf (214K)
Abstract: As real logic programmers normally use cut (!), an
effective learning procedure for logic programs should be able to deal
with it. Because the cut predicate has only a procedural meaning,
clauses containing cut cannot be learned using an extensional
evaluation method, as is done in most learning systems. On the other
hand, searching a space of possible programs (instead of a space of
independent clauses) is unfeasible. An alternative solution is to
generate first a candidate base program which covers the positive
examples, and then make it consistent by inserting cut where
appropriate. The problem of learning programs with cut has not been
investigated before and this seems to be a natural and reasonable
approach. We generalize this scheme and investigate the difficulties
that arise. Some of the major shortcomings are actually caused, in
general, by the need for intensional evaluation. As a conclusion, the
analysis of this paper suggests, on precise and technical grounds,
that learning cut is difficult, and current induction techniques
should probably be restricted to purely declarative logic languages.
Buchheit, M., Donini, F.M. and Schaerf, A. (1993)
"Decidable Reasoning in Terminological Knowledge Representation Systems",
Volume 1, pages 109-138.
PostScript: volume1/buchheit93a.ps (311K)
PDF: volume1/buchheit93a.pdf (336K)
Abstract: Terminological knowledge representation systems (TKRSs) are
tools for designing and using knowledge bases that make use of
terminological languages (or concept languages). We analyze from a
theoretical point of view a TKRS whose capabilities go beyond the ones
of presently available TKRSs. The new features studied, often required
in practical applications, can be summarized in three main points.
First, we consider a highly expressive terminological language, called
ALCNR, including general complements of concepts, number restrictions
and role conjunction. Second, we allow to express inclusion statements
between general concepts, and terminological cycles as a particular
case. Third, we prove the decidability of a number of desirable
TKRS-deduction services (like satisfiability, subsumption and instance
checking) through a sound, complete and terminating calculus for
reasoning in ALCNR-knowledge bases. Our calculus extends the general
technique of constraint systems. As a byproduct of the proof, we get
also the result that inclusion statements in ALCNR can be simulated
by terminological cycles, if descriptive semantics is adopted.
Nilsson, N. (1994)
"Teleo-Reactive Programs for Agent Control", Volume 1, pages 139-158.
PostScript: volume1/nilsson94a.ps (477K)
compressed, volume1/nilsson94a.ps.Z (155K)
PDF: volume1/nilsson94a.pdf (259K)
Abstract: A formalism is presented for computing and organizing actions
for autonomous agents in dynamic environments. We introduce the notion
of {\it teleo-reactive (T-R) programs} whose execution entails the
construction of circuitry for the continuous computation of the
parameters and conditions on which agent action is based. In addition
to continuous feedback, T-R programs support parameter binding and
recursion. A primary difference between T-R programs and many other
circuit-based systems is that the circuitry of T-R programs is more
compact; it is constructed at run time and thus does not have to
anticipate all the contingencies that might arise over all possible
runs. In addition, T-R programs are intuitive and easy to write and
are written in a form that is compatible with automatic planning and
learning methods. We briefly describe some experimental applications
of T-R programs in the control of simulated and actual mobile robots.
Koppel, M., Feldman R. and Segre, A.M. (1994)
"Bias-Driven Revision of Logical Domain Theories", Volume 1, pages 159-208.
PostScript: volume1/koppel94a.ps (465K)
compressed, volume1/koppel94a.ps.Z (203K)
PDF: volume1/koppel94a.pdf (257K)
Abstract: The theory revision problem is the problem of how best to
go about revising a deficient domain theory using information
contained in examples that expose inaccuracies. In this paper we
present our approach to the theory revision problem for propositional
domain theories. The approach described here, called PTR, uses
probabilities associated with domain theory elements to numerically
track the ``flow'' of proof through the theory. This allows us to
measure the precise role of a clause or literal in allowing or
preventing a (desired or undesired) derivation for a given example.
This information is used to efficiently locate and repair flawed
elements of the theory. PTR is proved to converge to a theory which
correctly classifies all examples, and shown experimentally to be fast
and accurate even for deep theories.
Ling, C.X. (1994)
"Learning the Past Tense of English Verbs: The Symbolic Pattern Associator
vs. Connectionist Models", Volume 1, pages 209-229.
PostScript: volume1/ling94a.ps (247K)
Online Appendix: volume1/ling94a-appendix.Z (109K) data file, compressed
PDF: volume1/ling94a.pdf (268K)
Abstract: Learning the past tense of English verbs - a seemingly minor
aspect of language acquisition - has generated heated debates since
1986, and has become a landmark task for testing the adequacy of
cognitive modeling. Several artificial neural networks (ANNs) have
been implemented, and a challenge for better symbolic models has been
posed. In this paper, we present a general-purpose Symbolic Pattern
Associator (SPA) based upon the decision-tree learning algorithm ID3.
We conduct extensive head-to-head comparisons on the generalization
ability between ANN models and the SPA under different
representations. We conclude that the SPA generalizes the past tense
of unseen verbs better than ANN models by a wide margin, and we offer
insights as to why this should be the case. We also discuss a new
default strategy for decision-tree learning algorithms.
Cook, D.J. and Holder, L.B. (1994)
"Substructure Discovery Using Minimum Description Length and Background
Knowledge", Volume 1, pages 231-255.
PostScript: volume1/cook94a.ps (750K)
compressed, volume1/cook94a.ps.Z (266K)
Online Appendix: volume1/cook94a-appendix.tar.Z (36K), SUBDUE source code
PDF: volume1/cook94a.pdf (285K)
Abstract: The ability to identify interesting and repetitive
substructures is an essential component to discovering knowledge in
structural data. We describe a new version of our SUBDUE substructure
discovery system based on the minimum description length principle.
The SUBDUE system discovers substructures that compress the original
data and represent structural concepts in the data. By replacing
previously-discovered substructures in the data, multiple passes of
SUBDUE produce a hierarchical description of the structural
regularities in the data. SUBDUE uses a computationally-bounded
inexact graph match that identifies similar, but not identical,
instances of a substructure and finds an approximate measure of
closeness of two substructures when under computational constraints.
In addition to the minimum description length principle, other
background knowledge can be used by SUBDUE to guide the search towards
more appropriate substructures. Experiments in a variety of domains
demonstrate SUBDUE's ability to find substructures capable of
compressing the original data and to discover structural concepts
important to the domain.
Description of Online Appendix: This is a compressed tar file containing
the SUBDUE discovery system, written in C. The program accepts as
input databases represented in graph form, and will output discovered
substructures with their corresponding value.
Murphy, P.M. and Pazzani, M.J. (1994)
"Exploring the Decision Forest: An Empirical Investigation of Occam's
Razor in Decision Tree Induction", Volume 1, pages 257-275.
PostScript: volume1/murphy94a.ps (868K)
compressed, volume1/murphy94a.ps.Z (215K)
PDF: volume1/murphy94a.pdf (480K)
Abstract: We report on a series of experiments in which all decision
trees consistent with the training data are constructed. These
experiments were run to gain an understanding of the properties of the
set of consistent decision trees and the factors that affect the
accuracy of individual trees. In particular, we investigated the
relationship between the size of a decision tree consistent with some
training data and the accuracy of the tree on test data. The
experiments were performed on a massively parallel Maspar computer.
The results of the experiments on several artificial and two real
world problems indicate that, for many of the problems investigated,
smaller consistent decision trees are on average less accurate than
the average accuracy of slightly larger trees.
Borgida, A. and Patel-Schneider, P.F. (1994)
"A Semantics and Complete Algorithm for Subsumption in the CLASSIC
Description Logic", Volume 1, pages 277-308.
PostScript: volume1/borgida94a.ps (319K)
PDF: volume1/borgida94a.pdf (349K)
Abstract: This paper analyzes the correctness of the subsumption
algorithm used in CLASSIC, a description logic-based knowledge
representation system that is being used in practical applications.
In order to deal efficiently with individuals in CLASSIC descriptions,
the developers have had to use an algorithm that is incomplete with
respect to the standard, model-theoretic semantics for description
logics. We provide a variant semantics for descriptions with respect
to which the current implementation is complete, and which can be
independently motivated. The soundness and completeness of the
polynomial-time subsumption algorithm is established using description
graphs, which are an abstracted version of the implementation
structures used in CLASSIC, and are of independent interest.
Sebastiani, R. (1994)
"Applying GSAT to Non-Clausal Formulas" (Research Note), Volume 1,
pages 309-314.
PostScript: volume1/sebastiani94a.ps (144K)
PDF: volume1/sebastiani94a.pdf (175K)
Abstract: In this paper we describe how to modify GSAT so that it can
be applied to non-clausal formulas. The idea is to use a particular
``score'' function which gives the number of clauses of the CNF
conversion of a formula which are false under a given truth assignment.
Its value is computed in linear time, without constructing the CNF
conversion itself. The proposed methodology applies to most of the
variants of GSAT proposed so far.
Volume2
Murthy, S.K., Kasif, S. and Salzberg, S. (1994)
"A System for Induction of Oblique Decision Trees", Volume 2, pages 1-32.
PostScript: volume2/murthy94a.ps (475K)
Online Appendix: volume2/murthy94a-appendix.tar.Z (297K), OC1 source code
PDF: volume2/murthy94a.pdf (373K)
Abstract: This article describes a new system for induction of oblique
decision trees. This system, OC1, combines deterministic
hill-climbing with two forms of randomization to find a good oblique
split (in the form of a hyperplane) at each node of a decision tree.
Oblique decision tree methods are tuned especially for domains in
which the attributes are numeric, although they can be adapted to
symbolic or mixed symbolic/numeric attributes. We present extensive
empirical studies, using both real and artificial data, that analyze
OC1's ability to construct oblique trees that are smaller and more
accurate than their axis-parallel counterparts. We also examine the
benefits of randomization for the construction of oblique decision trees.
Grove, A.J., Halpern, J.Y. and Koller, D. (1994)
"Random Worlds and Maximum Entropy", Volume 2, pages 33-88.
PostScript: volume2/grove94a.ps (624K)
compressed, volume2/grove94a.ps.Z (243K)
PDF: volume2/grove94a.pdf (2.6M)
Abstract: Given a knowledge base KB containing first-order and
statistical facts, we consider a principled method, called the
random-worlds method, for computing a degree of belief that some
formula Phi holds given KB. If we are reasoning about a world or
system consisting of N individuals, then we can consider all possible
worlds, or first-order models, with domain {1,...,N} that satisfy KB,
and compute the fraction of them in which Phi is true. We define the
degree of belief to be the asymptotic value of this fraction as N
grows large. We show that when the vocabulary underlying Phi and KB
uses constants and unary predicates only, we can naturally associate
an entropy with each world. As N grows larger, there are many more
worlds with higher entropy. Therefore, we can use a maximum-entropy
computation to compute the degree of belief. This result is in a
similar spirit to previous work in physics and artificial
intelligence, but is far more general. Of equal interest to the
result itself are the limitations on its scope. Most importantly, the
restriction to unary predicates seems necessary. Although the
random-worlds method makes sense in general, the connection to maximum
entropy seems to disappear in the non-unary case. These observations
suggest unexpected limitations to the applicability of maximum-entropy
methods.
Kitani, T., Eriguchi, Y. and Hara, M. (1994)
"Pattern Matching and Discourse Processing in Information Extraction
from Japanese Text", Volume 2, pages 89-110.
PostScript: volume2/kitani94a.ps (465K)
PDF: volume2/kitani94a.pdf (547K)
Abstract: Information extraction is the task of automatically picking
up information of interest from an unconstrained text. Information of
interest is usually extracted in two steps. First, sentence level
processing locates relevant pieces of information scattered throughout
the text; second, discourse processing merges coreferential
information to generate the output. In the first step, pieces of
information are locally identified without recognizing any
relationships among them. A key word search or simple pattern search
can achieve this purpose. The second step requires deeper knowledge
in order to understand relationships among separately identified
pieces of information. Previous information extraction systems
focused on the first step, partly because they were not required to
link up each piece of information with other pieces. To link the
extracted pieces of information and map them onto a structured output
format, complex discourse processing is essential. This paper reports
on a Japanese information extraction system that merges information
using a pattern matcher and discourse processor. Evaluation results
show a high level of system performance which approaches human performance.
Safra, S. and Tennenholtz, M. (1994)
"On Planning while Learning", Volume 2, pages 111-129.
PostScript: volume2/safra94a.ps (202K)
PDF: volume2/safra94a.pdf (733K)
Abstract: This paper introduces a framework for Planning while
Learning where an agent is given a goal to achieve in an environment
whose behavior is only partially known to the agent.
We discuss the tractability of various plan-design processes. We
show that for a large natural class of Planning while Learning
systems, a plan can be presented and verified in a reasonable time.
However, coming up algorithmically with a plan, even for simple
classes of systems is apparently intractable.
We emphasize the role of off-line plan-design processes, and show
that, in most natural cases, the verification (projection) part can be
carried out in an efficient algorithmic manner.
Soderland, S. and Lehnert. W. (1994)
"Wrap-Up: a Trainable Discourse Module for Information Extraction",
Volume 2, pages 131-158.
PostScript: volume2/soderland4a.ps (442K)
PDF: volume2/soderland94a.pdf (234K)
Abstract: The vast amounts of on-line text now available have led to
renewed interest in information extraction (IE) systems that analyze
unrestricted text, producing a structured representation of selected
information from the text. This paper presents a novel approach that
uses machine learning to acquire knowledge for some of the higher
level IE processing. Wrap-Up is a trainable IE discourse component
that makes intersentential inferences and identifies logical relations
among information extracted from the text. Previous corpus-based
approaches were limited to lower level processing such as
part-of-speech tagging, lexical disambiguation, and dictionary
construction. Wrap-Up is fully trainable, and not only automatically
decides what classifiers are needed, but even derives the feature set
for each classifier automatically. Performance equals that of a
partially trainable discourse module requiring manual customization
for each domain.
Buntine, W.L. (1994)
"Operations for Learning with Graphical Models", Volume 2, pages 159-225.
PostScript: volume2/buntine94a.ps (1.53M)
compressed, volume2/buntine94a.ps.Z (568K)
PDF: volume2/buntine94a.pdf (2.9M)
Abstract: This paper is a multidisciplinary review of empirical,
statistical learning from a graphical model perspective. Well-known
examples of graphical models include Bayesian networks, directed
graphs representing a Markov chain, and undirected networks
representing a Markov field. These graphical models are extended to
model data analysis and empirical learning using the notation of
plates. Graphical operations for simplifying and manipulating a
problem are provided including decomposition, differentiation, and the
manipulation of probability models from the exponential family. Two
standard algorithm schemas for learning are reviewed in a graphical
framework: Gibbs sampling and the expectation maximization algorithm.
Using these operations and schemas, some popular algorithms can be
synthesized from their graphical specification. This includes
versions of linear regression, techniques for feed-forward networks,
and learning Gaussian and discrete Bayesian networks from data. The
paper concludes by sketching some implications for data analysis and
summarizing how some popular algorithms fall within the framework
presented.
The main original contributions here are the decomposition techniques
and the demonstration that graphical models provide a framework for
understanding and developing complex learning algorithms.
Minton, S., Bresina, J. and Drummond, M. (1994)
"Total-Order and Partial-Order Planning: A Comparative Analysis",
Volume 2, pages 227-262.
PostScript: volume2/minton94a.ps (520K)
Online Appendix: volume2/minton94a-appendix.tar.Z (64K), source code & data
HTML: www.cs.washington.edu/research/jair/volume2/minton94a-html/paper.html
PDF: volume2/minton94a.pdf (391K)
Abstract: For many years, the intuitions underlying partial-order
planning were largely taken for granted. Only in the past few years
has there been renewed interest in the fundamental principles
underlying this paradigm. In this paper, we present a rigorous
comparative analysis of partial-order and total-order planning by
focusing on two specific planners that can be directly compared. We
show that there are some subtle assumptions that underly the
wide-spread intuitions regarding the supposed efficiency of
partial-order planning. For instance, the superiority of partial-order
planning can depend critically upon the search strategy and the
structure of the search space. Understanding the underlying
assumptions is crucial for constructing efficient planners.
Dietterich, T.G. and Bakiri, G. (1995)
"Solving Multiclass Learning Problems via Error-Correcting Output Codes",
Volume 2, pages 263-286.
PostScript: volume2/dietterich95a.ps (265K)
PDF: volume2/dietterich95a.pdf (259K)
Abstract: Multiclass learning problems involve finding a definition
for an unknown function f(x) whose range is a discrete set containing
k>2 values (i.e., k ``classes''). The definition is acquired by
studying collections of training examples of the form .
Existing approaches to multiclass learning problems include direct
application of multiclass algorithms such as the decision-tree
algorithms C4.5 and CART, application of binary concept learning
algorithms to learn individual binary functions for each of the k
classes, and application of binary concept learning algorithms with
distributed output representations. This paper compares these three
approaches to a new technique in which error-correcting codes are
employed as a distributed output representation. We show that these
output representations improve the generalization performance of both
C4.5 and backpropagation on a wide range of multiclass learning tasks.
We also demonstrate that this approach is robust with respect to
changes in the size of the training sample, the assignment of
distributed representations to particular classes, and the application
of overfitting avoidance techniques such as decision-tree pruning.
Finally, we show that---like the other methods---the error-correcting
code technique can provide reliable class probability estimates.
Taken together, these results demonstrate that error-correcting output
codes provide a general-purpose method for improving the performance
of inductive learning programs on multiclass problems.
Cichosz, P. (1995)
"Truncating Temporal Differences: On the Efficient Implementation of
TD(lambda) for Reinforcement Learning", Volume 2, pages 287-318.
PostScript: volume2/cichosz95a.ps (313K)
PDF: volume2/cichosz95a.pdf (350K)
Abstract: Temporal difference (TD) methods constitute a class of
methods for learning predictions in multi-step prediction problems,
parameterized by a recency factor lambda. Currently the most important
application of these methods is to temporal credit assignment in
reinforcement learning. Well known reinforcement learning algorithms,
such as AHC or Q-learning, may be viewed as instances of TD learning.
This paper examines the issues of the efficient and general
implementation of TD(lambda) for arbitrary lambda, for use with
reinforcement learning algorithms optimizing the discounted sum of
rewards. The traditional approach, based on eligibility traces, is
argued to suffer from both inefficiency and lack of generality. The
TTD (Truncated Temporal Differences) procedure is proposed as an
alternative, that indeed only approximates TD(lambda), but requires
very little computation per action and can be used with arbitrary
function representation methods. The idea from which it is derived is
fairly simple and not new, but probably unexplored so far. Encouraging
experimental results are presented, suggesting that using lambda>0
with the TTD procedure allows one to obtain a significant learning
speedup at essentially the same cost as usual TD(0) learning.
Hanks, S. and Weld, D.S. (1995)
"A Domain-Independent Algorithm for Plan Adaptation", Volume 2,
pages 319-360.
PostScript: volume2/hanks95a.ps (500K)
PDF: volume2/hanks95a.ps (234K)
Abstract: The paradigms of transformational planning, case-based
planning, and plan debugging all involve a process known as plan
adaptation --- modifying or repairing an old plan so it solves a new
problem. In this paper we provide a domain-independent algorithm for
plan adaptation, demonstrate that it is sound, complete, and
systematic, and compare it to other adaptation algorithms in the
literature.
Our approach is based on a view of planning as searching a graph of
partial plans. Generative planning starts at the graph's root and
moves from node to node using plan-refinement operators. In planning
by adaptation, a library plan---an arbitrary node in the plan
graph---is the starting point for the search, and the plan-adaptation
algorithm can apply both the same refinement operators available to a
generative planner and can also retract constraints and steps from the
plan. Our algorithm's completeness ensures that the adaptation
algorithm will eventually search the entire graph and its systematicity
ensures that it will do so without redundantly searching any parts of
the graph.
Ortega, J. (1995)
"On the Informativeness of the DNA Promoter Sequences Domain Theory"
(Research Note), Volume 2, pages 361-367.
PostScript: volume2/ortega95a.ps (134K)
PDF: volume2/ortega95a.pdf (149K)
Abstract: The DNA promoter sequences domain theory and database have
become popular for testing systems that integrate empirical and
analytical learning. This note reports a simple change and
reinterpretation of the domain theory in terms of M-of-N concepts,
involving no learning, that results in an accuracy of 93.4% on the 106
items of the database. Moreover, an exhaustive search of the space of
M-of-N domain theory interpretations indicates that the expected
accuracy of a randomly chosen interpretation is 76.5%, and that a
maximum accuracy of 97.2% is achieved in 12 cases. This demonstrates
the informativeness of the domain theory, without the complications of
understanding the interactions between various learning algorithms and
the theory. In addition, our results help characterize the difficulty
of learning using the DNA promoters theory.
Turney, P.D. (1995)
"Cost-Sensitive Classification: Empirical Evaluation of a Hybrid
Genetic Decision Tree Induction Algorithm", Volume 2, pages 369-409.
PostScript: volume2/turney95a.ps (474K)
compressed, volume2/turney95a.ps.Z (183K)
PDF: volume2/turney95a.pdf (203K)
Abstract: This paper introduces ICET, a new algorithm for
cost-sensitive classification. ICET uses a genetic algorithm to evolve
a population of biases for a decision tree induction algorithm. The
fitness function of the genetic algorithm is the average cost of
classification when using the decision tree, including both the costs
of tests (features, measurements) and the costs of classification
errors. ICET is compared here with three other algorithms for
cost-sensitive classification - EG2, CS-ID3, and IDX - and also with
C4.5, which classifies without regard to cost. The five algorithms are
evaluated empirically on five real-world medical datasets. Three sets
of experiments are performed. The first set examines the baseline
performance of the five algorithms on the five datasets and
establishes that ICET performs significantly better than its
competitors. The second set tests the robustness of ICET under a
variety of conditions and shows that ICET maintains its advantage. The
third set looks at ICET's search in bias space and discovers a way to
improve the search.
Donoho, S.K. and Rendell, L.A. (1995)
"Rerepresenting and Restructuring Domain Theories: A Constructive
Induction Approach", Volume 2, pages 411-446.
PostScript: volume2/donoho95a.ps (501K)
compressed, volume2/donoho95a.ps.Z (179K)
Online Appendix: volume2/donoho94a-appendix.tar.Z (150K), source code & data
PDF: volume2/donoho95a.pdf (343K)
Abstract: Theory revision integrates inductive learning and background
knowledge by combining training examples with a coarse domain theory
to produce a more accurate theory. There are two challenges that
theory revision and other theory-guided systems face. First, a
representation language appropriate for the initial theory may be
inappropriate for an improved theory. While the original
representation may concisely express the initial theory, a more
accurate theory forced to use that same representation may be bulky,
cumbersome, and difficult to reach. Second, a theory structure
suitable for a coarse domain theory may be insufficient for a
fine-tuned theory. Systems that produce only small, local changes to
a theory have limited value for accomplishing complex structural
alterations that may be required.
Consequently, advanced theory-guided learning systems require flexible
representation and flexible structure. An analysis of various theory
revision systems and theory-guided learning systems reveals specific
strengths and weaknesses in terms of these two desired properties.
Designed to capture the underlying qualities of each system, a new
system uses theory-guided constructive induction. Experiments in
three domains show improvement over previous theory-guided systems.
This leads to a study of the behavior, limitations, and potential of
theory-guided constructive induction.
David, P. (1995)
"Using Pivot Consistency to Decompose and Solve Functional CSPs",
Volume 2, pages 447-474.
PostScript: volume2/david95a.ps (489K)
compressed, volume2/david95a.ps.Z (220K)
PDF: volume2/david95a.pdf (1.3M)
Abstract: Many studies have been carried out in order to increase the
search efficiency of constraint satisfaction problems; among them,
some make use of structural properties of the constraint
network; others take into account semantic properties of the
constraints, generally assuming that all the constraints possess
the given property. In this paper, we propose a new decomposition
method benefiting from both semantic properties of functional
constraints (not bijective constraints) and structural
properties of the network; furthermore, not all the constraints need
to be functional. We show that under some conditions, the existence
of solutions can be guaranteed. We first characterize a particular
subset of the variables, which we name a root set. We then
introduce pivot consistency, a new local consistency which is a
weak form of path consistency and can be achieved in O(n^2d^2)
complexity (instead of O(n^3d^3) for path consistency), and we
present associated properties; in particular, we show that any
consistent instantiation of the root set can be linearly extended to a
solution, which leads to the presentation of the aforementioned new
method for solving by decomposing functional CSPs.
Schaerf, A., Shoham, Y. and Tennenholtz, M. (1995)
"Adaptive Load Balancing: A Study in Multi-Agent Learning",
Volume 2, pages 475-500.
PostScript: volume2/schaerf95a.ps (265K)
compressed, volume2/schaerf95a.ps.Z (107K)
PDF: volume2/schaerf95a.pdf (286K)
Abstract: We study the process of multi-agent reinforcement learning
in the context of load balancing in a distributed system, without use
of either central coordination or explicit communication. We first
define a precise framework in which to study adaptive load balancing,
important features of which are its stochastic nature and the purely
local information available to individual agents. Given this
framework, we show illuminating results on the interplay between basic
adaptive behavior parameters and their effect on system efficiency.
We then investigate the properties of adaptive load balancing in
heterogeneous populations, and address the issue of exploration vs.
exploitation in that context. Finally, we show that naive use of
communication may not improve, and might even harm system efficiency.
Cohen, W.W. (1995a)
"Pac-Learning Recursive Logic Programs: Efficient Algorithms",
Volume 2, pages 501-539.
PostScript: volume2/cohen95a.ps (339K)
compressed, volume2/cohen95a.ps.Z (135K)
PDF: volume2/cohen95a.pdf (367K)
Abstract: We present algorithms that learn certain classes of
function-free recursive logic programs in polynomial time from
equivalence queries. In particular, we show that a single k-ary
recursive constant-depth determinate clause is learnable. Two-clause
programs consisting of one learnable recursive clause and one
constant-depth determinate non-recursive clause are also learnable, if
an additional ``basecase'' oracle is assumed. These results
immediately imply the pac-learnability of these classes. Although
these classes of learnable recursive programs are very constrained, it
is shown in a companion paper that they are maximally general, in that
generalizing either class in any natural way leads to a
computationally difficult learning problem. Thus, taken together with
its companion paper, this paper establishes a boundary of efficient
learnability for recursive logic programs.
Cohen, W.W. (1995b)
"Pac-learning Recursive Logic Programs: Negative Results",
Volume 2, pages 541-573.
PostScript: volume2/cohen95b.ps (342K)
compressed, volume2/cohen95b.ps.Z (136K)
PDF: volume2/cohen95b.pdf (375K)
Abstract: In a companion paper it was shown that the class of
constant-depth determinate k-ary recursive clauses is efficiently
learnable. In this paper we present negative results showing that any
natural generalization of this class is hard to learn in Valiant's
model of pac-learnability. In particular, we show that the following
program classes are cryptographically hard to learn: programs with an
unbounded number of constant-depth linear recursive clauses; programs
with one constant-depth determinate clause containing an unbounded
number of recursive calls; and programs with one linear recursive
clause of constant locality. These results immediately imply the
non-learnability of any more general class of programs. We also show
that learning a constant-depth determinate program with either two
linear recursive clauses or one linear recursive clause and one
non-recursive clause is as hard as learning boolean DNF. Together
with positive results from the companion paper, these negative results
establish a boundary of efficient learnability for recursive
function-free clauses.
Russell, S.J., and Subramanian, D. (1995)
"Provably Bounded-Optimal Agents",
Volume 2, pages 575-609.
PostScript: volume2/russell95a.ps (513K)
compressed, volume2/russell95a.ps.Z (195K)
PDF: volume2/russell95a.pdf (394K)
Abstract: Since its inception, artificial intelligence has relied
upon a theoretical foundation centered around perfect rationality as
the desired property of intelligent systems. We argue, as others have
done, that this foundation is inadequate because it imposes
fundamentally unsatisfiable requirements. As a result, there has
arisen a wide gap between theory and practice in AI, hindering
progress in the field. We propose instead a property called bounded
optimality. Roughly speaking, an agent is bounded-optimal if its
program is a solution to the constrained optimization problem
presented by its architecture and the task environment. We show how to
construct agents with this property for a simple class of machine
architectures in a broad class of real-time environments. We
illustrate these results using a simple model of an automated mail
sorting facility. We also define a weaker property, asymptotic
bounded optimality (ABO), that generalizes the notion of optimality in
classical complexity theory. We then construct universal ABO
programs, i.e., programs that are ABO no matter what real-time
constraints are applied. Universal ABO programs can be used as
building blocks for more complex systems. We conclude with a
discussion of the prospects for bounded optimality as a theoretical
basis for AI, and relate it to similar trends in philosophy,
economics, and game theory.
Volume3
Mooney, R.J. and Califf, M.E. (1995)
"Induction of First-Order Decision Lists: Results on Learning the Past
Tense of English Verbs", Volume 3, pages 1-24.
PostScript: volume3/mooney95a.ps (252K)
compressed, volume3/mooney95a.ps.Z (102K)
PDF: volume3/mooney95a.pdf (277K)
Abstract: This paper presents a method for inducing logic programs
from examples that learns a new class of concepts called first-order
decision lists, defined as ordered lists of clauses each ending in a
cut. The method, called FOIDL, is based on FOIL (Quinlan, 1990) but
employs intensional background knowledge and avoids the need for
explicit negative examples. It is particularly useful for problems
that involve rules with specific exceptions, such as learning the
past-tense of English verbs, a task widely studied in the context of
the symbolic/connectionist debate. FOIDL is able to learn concise,
accurate programs for this problem from significantly fewer examples
than previous methods (both connectionist and symbolic).
Veloso, M. and Stone, P. (1995)
"FLECS: Planning with a Flexible Commitment Strategy",
Volume 3, pages 25-52.
PostScript: volume3/veloso95a.ps (294K)
compressed, volume3/veloso95a.ps.Z (126K)
Online Appendix: volume3/veloso95a-appendix.tar.Z (20K) data file
PDF: volume3/veloso95a.pdf (341K)
Abstract: There has been evidence that least-commitment planners can
efficiently handle planning problems that involve difficult goal
interactions. This evidence has led to the common belief that
delayed-commitment is the "best" possible planning strategy. However,
we recently found evidence that eager-commitment planners can handle a
variety of planning problems more efficiently, in particular those
with difficult operator choices. Resigned to the futility of trying
to find a universally successful planning strategy, we devised a
planner that can be used to study which domains and problems are best
for which planning strategies. In this article we introduce this new
planning algorithm, FLECS, which uses a FLExible Commitment Strategy
with respect to plan-step orderings. It is able to use any strategy
from delayed-commitment to eager-commitment. The combination of
delayed and eager operator-ordering commitments allows FLECS to take
advantage of the benefits of explicitly using a simulated execution
state and reasoning about planning constraints. FLECS can vary its
commitment strategy across different problems and domains, and also
during the course of a single planning problem. FLECS represents a
novel contribution to planning in that it explicitly provides the
choice of which commitment strategy to use while planning. FLECS
provides a framework to investigate the mapping from planning domains
and problems to efficient planning strategies.
Bergmann, R. and Wilke, W. (1995)
"Building and Refining Abstract Planning Cases by Change of
Representation Language", Volume 3, pages 53-118.
PostScript: volume3/bergmann95a.ps (850K)
compressed, volume3/bergmann95a.ps.Z (302K)
Online Appendix: volume3/bergmann95a-appendix (11K) data file
PDF: volume3/bergmann95a.pdf (586K)
Abstract: Abstraction is one of the most promising approaches to improve the
performance of problem solvers. In several domains abstraction by
dropping sentences of a domain description -- as used in most
hierarchical planners -- has proven useful. In this paper we present
examples which illustrate significant drawbacks of abstraction by
dropping sentences. To overcome these drawbacks, we propose a more
general view of abstraction involving the change of representation
language. We have developed a new abstraction methodology and a
related sound and complete learning algorithm that allows the complete
change of representation language of planning cases from concrete to
abstract. However, to achieve a powerful change of the representation
language, the abstract language itself as well as rules which describe
admissible ways of abstracting states must be provided in the domain
model. This new abstraction approach is the core of Paris (Plan
Abstraction and Refinement in an Integrated System), a system in which
abstract planning cases are automatically learned from given concrete
cases. An empirical study in the domain of process planning in
mechanical engineering shows significant advantages of the proposed
reasoning from abstract cases over classical hierarchical planning.
Zhao, Q. and Nishida, T. (1995)
"Using Qualitative Hypotheses to Identify Inaccurate Data",
Volume 3, pages 119-145.
PostScript: volume3/zhao95a.ps (626K)
compressed, volume3/zhao95a.ps.Z (200K)
PDF: volume3/zhao95a.pdf (383K)
Abstract: Identifying inaccurate data has long been regarded as a
significant and difficult problem in AI. In this paper, we present a
new method for identifying inaccurate data on the basis of qualitative
correlations among related data. First, we introduce the definitions
of related data and qualitative correlations among related data. Then
we put forward a new concept called support coefficient function
(SCF). SCF can be used to extract, represent, and calculate
qualitative correlations among related data within a dataset. We
propose an approach to determining dynamic shift intervals of
inaccurate data, and an approach to calculating possibility of
identifying inaccurate data, respectively. Both of the approaches are
based on SCF. Finally we present an algorithm for identifying
inaccurate data by using qualitative correlations among related data
as confirmatory or disconfirmatory evidence. We have developed a
practical system for interpreting infrared spectra by applying the
method, and have fully tested the system against several hundred real
spectra. The experimental results show that the method is
significantly better than the conventional methods used in many
similar systems.
Giraud-Carrier, C.G. and Martinez, T.R. (1995)
"An Integrated Framework for Learning and Reasoning",
Volume 3, pages 147-185.
Online Appendix: volume3/giraud-carrier95a-appendix.tar (90K) source, data
PostScript: volume3/giraud-carrier95a.ps (375K)
compressed, volume3/giraud-carrier95a.ps.Z (152K)
Abstract: Learning and reasoning are both aspects of what is
considered to be intelligence. Their studies within AI have been
separated historically, learning being the topic of machine learning
and neural networks, and reasoning falling under classical (or
symbolic) AI. However, learning and reasoning are in many ways
interdependent. This paper discusses the nature of some of these
interdependencies and proposes a general framework called FLARE, that
combines inductive learning using prior knowledge together with
reasoning in a propositional setting. Several examples that test the
framework are presented, including classical induction, many important
reasoning protocols and two simple expert systems.
Woods, K., Cook, D., Hall, L., Bowyer, K. and Stark, L. (1995)
"Learning Membership Functions in a Function-Based Object Recognition
System", Volume 3, pages 187-222.
PostScript: volume3/woods95a.ps (1.4M)
compressed, volume3/woods95a.ps.Z (463K)
HTML: http://seraphim.csee.usf.edu/omlet/omlet-JAIR.html
PDF: volume3/woods95a.pdf (544K)
Abstract: Functionality-based recognition systems recognize objects at
the category level by reasoning about how well the objects support the
expected function. Such systems naturally associate a ``measure of
goodness'' or ``membership value'' with a recognized object. This
measure of goodness is the result of combining individual measures, or
membership values, from potentially many primitive evaluations of
different properties of the object's shape. A membership function is
used to compute the membership value when evaluating a primitive of a
particular physical property of an object. In previous versions of a
recognition system known as Gruff, the membership function for each of
the primitive evaluations was hand-crafted by the system designer. In
this paper, we provide a learning component for the Gruff system,
called Omlet, that automatically learns membership functions given a
set of example objects labeled with their desired category measure.
The learning algorithm is generally applicable to any problem in which
low-level membership values are combined through an and-or tree
structure to give a final overall membership value.
Pinkas, G. and Dechter, R. (1995)
"Improving Connectionist Energy Minimization", Volume 3, pages 223-248.
PostScript: volume3/pinkas95a.ps (358K)
compressed, volume3/pinkas95a.ps.Z (138K)
PDF: volume3/pinkas95a.pdf (293K)
Abstract: Symmetric networks designed for energy minimization such as
Boltzman machines and Hopfield nets are frequently investigated for
use in optimization, constraint satisfaction and approximation of
NP-hard problems. Nevertheless, finding a global solution (i.e., a
global minimum for the energy function) is not guaranteed and even a
local solution may take an exponential number of steps. We propose an
improvement to the standard local activation function used for such
networks. The improved algorithm guarantees that a global minimum is
found in linear time for tree-like subnetworks. The algorithm, called
activate, is uniform and does not assume that the network is
tree-like. It can identify tree-like subnetworks even in cyclic
topologies (arbitrary networks) and avoid local minima along these
trees. For acyclic networks, the algorithm is guaranteed to converge
to a global minimum from any initial state of the system
(self-stabilization) and remains correct under various types of
schedulers. On the negative side, we show that in the presence of
cycles, no uniform algorithm exists that guarantees optimality even
under a sequential asynchronous scheduler. An asynchronous scheduler
can activate only one unit at a time while a synchronous scheduler can
activate any number of units in a single time step. In addition, no
uniform algorithm exists to optimize even acyclic networks when the
scheduler is synchronous. Finally, we show how the algorithm can be
improved using the cycle-cutset scheme. The general algorithm, called
activate-with-cutset, improves over activate and has some performance
guarantees that are related to the size of the network's cycle-cutset.
Bengio, Y. and Frasconi, P. (1995)
"Diffusion of Context and Credit Information in Markovian Models",
Volume 3, pages 249-270.
PostScript: volume3/bengio95a.ps (397K)
compressed, volume3/bengio95a.ps.Z (158K)
PDF: volume3/bengio95a.pdf (1.2M)
Abstract: This paper studies the problem of ergodicity of transition
probability matrices in Markovian models, such as hidden Markov models
(HMMs), and how it makes very difficult the task of learning to
represent long-term context for sequential data. This phenomenon
hurts the forward propagation of long-term context information, as
well as learning a hidden state representation to represent long-term
context, which depends on propagating credit information backwards in
time. Using results from Markov chain theory, we show that this
problem of diffusion of context and credit is reduced when the
transition probabilities approach 0 or 1, i.e., the transition
probability matrices are sparse and the model essentially
deterministic. The results found in this paper apply to learning
approaches based on continuous optimization, such as gradient descent
and the Baum-Welch algorithm.
Huffman, S.B. and Laird, J.E. (1995)
"Flexibly Instructable Agents", Volume 3, pages 271-324.
PostScript: volume3/huffman95a.ps (1599K)
compressed, volume3/huffman95a.ps.Z (476K)
PDF: volume3/huffman95a.pdf (539K)
Abstract: This paper presents an approach to learning from situated,
interactive tutorial instruction within an ongoing agent. Tutorial
instruction is a flexible (and thus powerful) paradigm for teaching
tasks because it allows an instructor to communicate whatever types of
knowledge an agent might need in whatever situations might arise. To
support this flexibility, however, the agent must be able to learn
multiple kinds of knowledge from a broad range of instructional
interactions. Our approach, called situated explanation, achieves
such learning through a combination of analytic and inductive
techniques. It combines a form of explanation-based learning that is
situated for each instruction with a full suite of contextually guided
responses to incomplete explanations. The approach is implemented in
an agent called Instructo-Soar that learns hierarchies of new tasks
and other domain knowledge from interactive natural language
instructions. Instructo-Soar meets three key requirements of flexible
instructability that distinguish it from previous systems: (1) it can
take known or unknown commands at any instruction point; (2) it can
handle instructions that apply to either its current situation or to a
hypothetical situation specified in language (as in, for instance,
conditional instructions); and (3) it can learn, from instructions,
each class of knowledge it uses to perform tasks.
Broggi, A. and Berte, S. (1995)
"Vision-Based Road Detection in Automotive Systems: A Real-Time
Expectation-Driven Approach", Volume 3, pages 325-348.
PostScript: volume3/broggi95a.ps (1762K)
compressed, volume3/broggi95a.ps.Z (620K)
PDF: volume3/broggi95a.pdf (430K)
Abstract: The main aim of this work is the development of a
vision-based road detection system fast enough to cope with the
difficult real-time constraints imposed by moving vehicle
applications. The hardware platform, a special-purpose massively
parallel system, has been chosen to minimize system production and
operational costs.
This paper presents a novel approach to expectation-driven low-level
image segmentation, which can be mapped naturally onto mesh-connected
massively parallel SIMD architectures capable of handling hierarchical
data structures. The input image is assumed to contain a distorted
version of a given template; a multiresolution stretching process is
used to reshape the original template in accordance with the acquired
image content, minimizing a potential function. The distorted
template is the process output.
Khardon, R. (1995)
"Translating between Horn Representations and their Characteristic Models",
Volume 3, pages 349-372.
PostScript: volume3/khardon95a.ps (284K)
compressed, volume3/khardon95a.ps.Z (114K)
PDF: volume3/khardon95a.pdf (1.0M)
Abstract: Characteristic models are an alternative, model based,
representation for Horn expressions. It has been shown that these two
representations are incomparable and each has its advantages over the
other. It is therefore natural to ask what is the cost of translating,
back and forth, between these representations. Interestingly, the same
translation questions arise in database theory, where it has
applications to the design of relational databases. This paper studies
the computational complexity of these problems.
Our main result is that the two translation problems are equivalent under
polynomial reductions, and that they are equivalent to the corresponding
decision problem. Namely, translating is equivalent to deciding whether a
given set of models is the set of characteristic models for a given Horn
expression.
We also relate these problems to the hypergraph transversal problem, a
well known problem which is related to other applications in AI and for
which no polynomial time algorithm is known. It is shown that in general
our translation problems are at least as hard as the hypergraph
transversal problem, and in a special case they are equivalent to it.
Buro, M. (1995)
"Statistical Feature Combination for the Evaluation of Game Positions",
Volume 3, pages 373-382.
PostScript: volume3/buro95a.ps (178K)
compressed, volume3/buro95a.ps.Z (80K)
PDF: volume3/buro95a.pdf (240K)
Abstract: This article describes an application of three well-known
statistical methods in the field of game-tree search: using a large
number of classified Othello positions, feature weights for evaluation
functions with a game-phase-independent meaning are estimated by means
of logistic regression, Fisher's linear discriminant, and the
quadratic discriminant function for normally distributed
features. Thereafter, the playing strengths are compared by means of
tournaments between the resulting versions of a world-class Othello
program. In this application, logistic regression - which is used here
for the first time in the context of game playing - leads to better
results than the other approaches.
Weiss, S.M. and Indurkhya, N. (1995)
"Rule-based Machine Learning Methods for Functional Prediction",
Volume 3, pages 383-403.
PostScript: volume3/weiss95a.ps (527K)
compressed, volume3/weiss95a.ps.Z (166K)
PDF: volume3/weiss95a.pdf (304K)
Abstract: We describe a machine learning method for predicting the
value of a real-valued function, given the values of multiple input
variables. The method induces solutions from samples in the form of
ordered disjunctive normal form (DNF) decision rules. A central
objective of the method and representation is the induction of
compact, easily interpretable solutions. This rule-based decision
model can be extended to search efficiently for similar cases prior to
approximating function values. Experimental results on real-world data
demonstrate that the new techniques are competitive with existing
machine learning and statistical methods and can sometimes yield
superior regression performance.
Heckerman, D. and Shachter, R. (1995)
"Decision-Theoretic Foundations for Causal Reasoning",
Volume 3, pages 405-430.
PostScript: volume3/heckerman95a.ps (327K)
compressed, volume3/heckerman95a.ps.Z (136K)
PDF: volume3/heckerman95a.pdf (307K)
Abstract: We present a definition of cause and effect in terms of
decision-theoretic primitives and thereby provide a principled
foundation for causal reasoning. Our definition departs from the
traditional view of causation in that causal assertions may vary with
the set of decisions available. We argue that this approach provides
added clarity to the notion of cause. Also in this paper, we examine
the encoding of causal relationships in directed acyclic graphs. We
describe a special class of influence diagrams, those in canonical
form, and show its relationship to Pearl's representation of cause and
effect. Finally, we show how canonical form facilitates
counterfactual reasoning.
Webb, G.I. (1995)
"OPUS: An Efficient Admissible Algorithm for Unordered Search",
Volume 3, pages 431-465.
PostScript: volume3/webb95a.ps (1503K)
compressed, volume3/webb95a.ps.Z (769K)
HTML: http://www.cs.washington.edu/research/jair/volume4/webb96a-html/webb96a.html
PDF: volume3/webb95a.pdf (373K)
Abstract: OPUS is a branch and bound search algorithm that enables
efficient admissible search through spaces for which the order of
search operator application is not significant. The algorithm's
search efficiency is demonstrated with respect to very large machine
learning search spaces. The use of admissible search is of potential
value to the machine learning community as it means that the exact
learning biases to be employed for complex learning tasks can be
precisely specified and manipulated. OPUS also has potential for
application in other areas of artificial intelligence, notably, truth
maintenance.
Idestam-Almquist, P. (1995)
"Generalization of Clauses under Implication",
Volume 3, pages 467-489.
PostScript: volume3/idestam95a.ps (273K)
compressed, volume3/idestam95a.ps.Z (101K)
PDF: volume3/idestam95a.pdf (1.0M)
Abstract: In the area of inductive learning, generalization is a main
operation, and the usual definition of induction is based on logical
implication. Recently there has been a rising interest in clausal
representation of knowledge in machine learning. Almost all inductive
learning systems that perform generalization of clauses use the
relation theta-subsumption instead of implication. The main reason is
that there is a well-known and simple technique to compute least
general generalizations under theta-subsumption, but not under
implication. However generalization under theta-subsumption is
inappropriate for learning recursive clauses, which is a crucial
problem since recursion is the basic program structure of logic
programs.
We note that implication between clauses is undecidable, and we
therefore introduce a stronger form of implication, called
T-implication, which is decidable between clauses. We show that for
every finite set of clauses there exists a least general
generalization under T-implication. We describe a technique to reduce
generalizations under implication of a clause to generalizations under
theta-subsumption of what we call an expansion of the original
clause. Moreover we show that for every non-tautological clause there
exists a T-complete expansion, which means that every generalization
under T-implication of the clause is reduced to a generalization under
theta-subsumption of the expansion.
Volume4
van Beek, P. and Manchak, D.W. (1996)
"The Design and Experimental Analysis of Algorithms for Temporal Reasoning",
Volume 4, pages 1-18.
PostScript: volume4/vanbeek96a.ps (219K)
compressed, volume4/vanbeek96a.ps.Z (90K)
HTML: http://www.cs.washington.edu/research/jair/volume4/vanbeek96a-html/paper.html
PDF: volume4/vanbeek96a.pdf (228K)
Abstract: Many applications -- from planning and scheduling to
problems in molecular biology -- rely heavily on a temporal reasoning
component. In this paper, we discuss the design and empirical
analysis of algorithms for a temporal reasoning system based on
Allen's influential interval-based framework for representing temporal
information. At the core of the system are algorithms for determining
whether the temporal information is consistent, and, if so, finding
one or more scenarios that are consistent with the temporal
information. Two important algorithms for these tasks are a path
consistency algorithm and a backtracking algorithm. For the path
consistency algorithm, we develop techniques that can result in up to
a ten-fold speedup over an already highly optimized implementation.
For the backtracking algorithm, we develop variable and value ordering
heuristics that are shown empirically to dramatically improve the
performance of the algorithm. As well, we show that a previously
suggested reformulation of the backtracking search problem can reduce
the time and space requirements of the backtracking search. Taken
together, the techniques we develop allow a temporal reasoning
component to solve problems that are of practical size.
Brewka, G. (1996)
"Well-Founded Semantics for Extended Logic Programs with Dynamic
Preferences", Volume 4, pages 19-36.
PostScript: volume4/brewka96a.ps (464K)
compressed, volume4/brewka96a.ps.Z (135K)
PDF: volume4/brewka96a.pdf (776K)
HTML: http://www.cs.washington.edu/research/jair/volume4/brewka96a-html/LPR.html
Abstract: The paper describes an extension of well-founded
semantics for logic programs with two types of negation. In this
extension information about preferences between rules can be expressed
in the logical language and derived dynamically. This is achieved by
using a reserved predicate symbol and a naming technique. Conflicts
among rules are resolved whenever possible on the basis of derived
preference information. The well-founded conclusions of prioritized
logic programs can be computed in polynomial time. A legal reasoning
example illustrates the usefulness of the approach.
Delcher, A.L., Grove, A.J., Kasif, S. and Pearl, J. (1996)
"Logarithmic-Time Updates and Queries in Probabilistic Networks",
Volume 4, pages 37-59.
PostScript: volume4/delcher96a.ps (277K)
compressed, volume4/delcher96a.ps.Z (107K)
PDF: volume4/delcher96a.pdf (309K)
Abstract: Traditional databases commonly support efficient query and
update procedures that operate in time which is sublinear in the size
of the database. Our goal in this paper is to take a first step
toward dynamic reasoning in probabilistic databases with comparable
efficiency. We propose a dynamic data structure that supports
efficient algorithms for updating and querying singly connected
Bayesian networks. In the conventional algorithm, new evidence is
absorbed in O(1) time and queries are processed in time O(N), where N
is the size of the network. We propose an algorithm which, after a
preprocessing phase, allows us to answer queries in time O(log N) at
the expense of O(log N) time per evidence absorption. The usefulness
of sub-linear processing time manifests itself in applications
requiring (near) real-time response over large probabilistic
databases. We briefly discuss a potential application of dynamic
probabilistic reasoning in computational biology.
Saul, L.K., Jaakkola, T. and Jordan, M.I. (1996)
"Mean Field Theory for Sigmoid Belief Networks",
Volume 4, pages 61-76.
PostScript: volume4/saul96a.ps (302K)
compressed, volume4/saul96a.ps.Z (123K)
PDF: volume4/saul96a.pdf (260K)
Abstract: We develop a mean field theory for sigmoid belief networks
based on ideas from statistical mechanics. Our mean field theory
provides a tractable approximation to the true probability
distribution in these networks; it also yields a lower bound on the
likelihood of evidence. We demonstrate the utility of this framework
on a benchmark problem in statistical pattern recognition---the
classification of handwritten digits.
Quinlan, J.R. (1996)
"Improved Use of Continuous Attributes in C4.5",
Volume 4, pages 77-90.
PostScript: volume4/quinlan96a.ps (414K)
compressed, volume4/quinlan96a.ps.Z (124K)
PDF: volume4/quinlan96a.pdf (259K)
Abstract: A reported weakness of C4.5 in domains with continuous
attributes is addressed by modifying the formation and evaluation of
tests on continuous attributes. An MDL-inspired penalty is applied to
such tests, eliminating some of them from consideration and altering
the relative desirability of all tests. Empirical trials show that
the modifications lead to smaller decision trees with higher
predictive accuracies. Results also confirm that a new version of
C4.5 incorporating these changes is superior to recent approaches that
use global discretization and that construct small trees with
multi-interval splits.
Hogg, T. (1996)
"Quantum Computing and Phase Transitions in Combinatorial Search",
Volume 4, pages 91-128.
PostScript: volume4/hogg96a.ps (565K)
compressed, volume4/hogg96a.ps.Z (230K)
Online Appendix: volume4/hogg96a-appendix.txt (32K) contains matrix values
HTML: http://www.cs.washington.edu/research/jair/volume4/hogg96a-html/quantumHTML.html
PDF: volume4/hogg96a.pdf (385K)
Abstract: We introduce an algorithm for combinatorial search on
quantum computers that is capable of significantly concentrating
amplitude into solutions for some NP search problems, on average. This
is done by exploiting the same aspects of problem structure as used by
classical backtrack methods to avoid unproductive search choices. This
quantum algorithm is much more likely to find solutions than the
simple direct use of quantum parallelism. Furthermore, empirical
evaluation on small problems shows this quantum algorithm displays the
same phase transition behavior, and at the same location, as seen in
many previously studied classical search methods. Specifically,
difficult problem instances are concentrated near the abrupt change
from underconstrained to overconstrained problems.
Cohn, D.A., Ghahramani, Z., and Jordan, M.I. (1996)
"Active Learning with Statistical Models",
Volume 4, pages 129-145.
PostScript: volume4/cohn96a.ps (325K)
compressed, volume4/cohn96a.ps.Z (121K)
HTML: http://www.cs.washington.edu/research/jair/volume4/cohn96a-html/statmodels.html
PDF: volume4/cohn96a.pdf (269K)
Abstract: For many types of machine learning algorithms, one can
compute the statistically `optimal' way to select training data. In
this paper, we review how optimal data selection techniques have been
used with feedforward neural networks. We then show how the same
principles may be used to select data for two alternative,
statistically-based learning architectures: mixtures of Gaussians and
locally weighted regression. While the techniques for neural networks
are computationally expensive and approximate, the techniques for
mixtures of Gaussians and locally weighted regression are both
efficient and accurate. Empirically, we observe that the optimality
criterion sharply decreases the number of training examples the
learner needs in order to achieve good performance.
Fisher, D. (1996)
"Iterative Optimization and Simplification of Hierarchical Clusterings",
Volume 4, pages 147-178.
PostScript: volume4/fisher96a.ps (286K)
compressed, volume4/fisher96a.ps.Z (130K)
HTML: http://www.cs.washington.edu/research/jair/volume4/fisher96a-html/html-final.html
PDF: volume4/fisher96a.pdf (347K)
Abstract: Clustering is often used for discovering structure in data.
Clustering systems differ in the objective function used to evaluate
clustering quality and the control strategy used to search the space
of clusterings. Ideally, the search strategy should consistently
construct clusterings of high quality, but be computationally
inexpensive as well. In general, we cannot have it both ways, but we
can partition the search so that a system inexpensively constructs a
`tentative' clustering for initial examination, followed by iterative
optimization, which continues to search in background for improved
clusterings. Given this motivation, we evaluate an inexpensive
strategy for creating initial clusterings, coupled with several
control strategies for iterative optimization, each of which
repeatedly modifies an initial clustering in search of a better
one. One of these methods appears novel as an iterative optimization
strategy in clustering contexts. Once a clustering has been
constructed it is judged by analysts -- often according to
task-specific criteria. Several authors have abstracted these criteria
and posited a generic performance task akin to pattern completion,
where the error rate over completed patterns is used to `externally'
judge clustering utility. Given this performance task, we adapt
resampling-based pruning strategies used by supervised learning
systems to the task of simplifying hierarchical clusterings, thus
promising to ease post-clustering analysis. Finally, we propose a
number of objective functions, based on attribute-selection measures
for decision-tree induction, that might perform well on the error rate
and simplicity dimensions.
Marchiori, E. (1996)
"Practical Methods for Proving Termination of General Logic Programs",
Volume 4, pages 179-208.
PostScript: volume4/marchiori96a.ps (331K)
compressed, volume4/marchiori96a.ps.Z (154K)
PDF: volume4/marchiori96a.pdf (1.3M)
Abstract: Termination of logic programs with negated body atoms (here
called general logic programs) is an important topic. One reason is
that many computational mechanisms used to process negated atoms, like
Clark's negation as failure and Chan's constructive negation, are
based on termination conditions. This paper introduces a methodology
for proving termination of general logic programs w.r.t. the Prolog
selection rule. The idea is to distinguish parts of the program
depending on whether or not their termination depends on the selection
rule. To this end, the notions of low-, weakly up-, and up-acceptable
program are introduced. We use these notions to develop a methodology
for proving termination of general logic programs, and show how
interesting problems in non-monotonic reasoning can be formalized and
implemented by means of terminating general logic programs.
Walsh, T. (1996)
"A Divergence Critic for Inductive Proof",
Volume 4, pages 209-235.
PostScript: volume4/walsh96a.ps (242K)
compressed, volume4/walsh96a.ps.Z (96K)
HTML: http://www.cs.washington.edu/research/jair/volume4/walsh96a-html/Final.html
PDF: volume4/walsh96a.pdf (282K)
Abstract: Inductive theorem provers often diverge. This paper
describes a simple critic, a computer program which monitors the
construction of inductive proofs attempting to identify diverging
proof attempts. Divergence is recognized by means of a ``difference
matching'' procedure. The critic then proposes lemmas and
generalizations which ``ripple'' these differences away so that the
proof can go through without divergence. The critic enables the
theorem prover Spike to prove many theorems completely automatically
from the definitions alone.
Kaelbling, L.P., Littman, M.L., and Moore, A.W. (1996)
"Reinforcement Learning: A Survey",
Volume 4, pages 237-285.
PostScript: volume4/kaelbling96a.ps (903K)
compressed, volume4/kaelbling96a.ps.Z (362K)
HTML: http://www.cs.washington.edu/research/jair/volume4/kaelbling96a-html/rl-survey.html
PDF: volume4/kaelbling96a.pdf (523K)
Abstract: This paper surveys the field of reinforcement learning from
a computer-science perspective. It is written to be accessible to
researchers familiar with machine learning. Both the historical basis
of the field and a broad selection of current work are summarized.
Reinforcement learning is the problem faced by an agent that learns
behavior through trial-and-error interactions with a dynamic
environment. The work described here has a resemblance to work in
psychology, but differs considerably in the details and in the use of
the word ``reinforcement.'' The paper discusses central issues of
reinforcement learning, including trading off exploration and
exploitation, establishing the foundations of the field via Markov
decision theory, learning from delayed reinforcement, constructing
empirical models to accelerate learning, making use of generalization
and hierarchy, and coping with hidden state. It concludes with a
survey of some implemented systems and an assessment of the practical
utility of current methods for reinforcement learning.
Pryor, L. and Collins, G. (1996)
"Planning for Contingencies: A Decision-based Approach",
Volume 4, pages 287-339.
PostScript: volume4/pryor96a.ps (544K)
compressed, volume4/pryor96a.ps.Z (239K)
HTML: http://www.cs.washington.edu/research/jair/volume4/pryor96a-html/final-jair.html
PDF: volume4/pryor96a.pdf (509K)
Abstract: A fundamental assumption made by classical AI planners is
that there is no uncertainty in the world: the planner has full
knowledge of the conditions under which the plan will be executed and
the outcome of every action is fully predictable. These planners
cannot therefore construct contingency plans, i.e., plans in which
different actions are performed in different circumstances. In this
paper we discuss some issues that arise in the representation and
construction of contingency plans and describe Cassandra, a
partial-order contingency planner. Cassandra uses explicit
decision-steps that enable the agent executing the plan to decide
which plan branch to follow. The decision-steps in a plan result in
subgoals to acquire knowledge, which are planned for in the same way
as any other subgoals. Cassandra thus distinguishes the process of
gathering information from the process of making decisions. The
explicit representation of decisions in Cassandra allows a coherent
approach to the problems of contingent planning, and provides a solid
base for extensions such as the use of different decision-making
procedures.
Nienhuys-Cheng, S.-H. and de Wolf, R. (1996)
"Least Generalizations and Greatest Specializations of Sets of Clauses",
Volume 4, pages 341-363.
PostScript: volume4/cheng96a.ps (281K)
compressed, volume4/cheng96a.ps.Z (109K)
PDF: volume4/cheng96a.pdf (1.1M)
Abstract: The main operations in Inductive Logic Programming (ILP) are
generalization and specialization, which only make sense in a
generality order. In ILP, the three most important generality orders
are subsumption, implication and implication relative to background
knowledge. The two languages used most often are languages of clauses
and languages of only Horn clauses. This gives a total of six
different ordered languages. In this paper, we give a systematic
treatment of the existence or non-existence of least generalizations
and greatest specializations of finite sets of clauses in each of
these six ordered sets. We survey results already obtained by others
and also contribute some answers of our own.
Our main new results are, firstly, the existence of a computable least
generalization under implication of every finite set of clauses
containing at least one non-tautologous function-free clause (among
other, not necessarily function-free clauses). Secondly, we show that
such a least generalization need not exist under relative implication,
not even if both the set that is to be generalized and the background
knowledge are function-free. Thirdly, we give a complete discussion
of existence and non-existence of greatest specializations in each of
the six ordered languages.
Gratch, J. and Chien, S. (1996)
"Adaptive Problem-solving for Large-scale Scheduling Problems: A Case Study",
Volume 4, pages 365-396.
PostScript: volume4/gratch96a.ps (595K)
compressed, volume4/gratch96a.ps.Z (180K)
PDF: volume4/gratch96a.pdf (172K)
Abstract: Although most scheduling problems are NP-hard, domain
specific techniques perform well in practice but are quite expensive
to construct. In adaptive problem-solving solving, domain specific
knowledge is acquired automatically for a general problem solver with
a flexible control architecture. In this approach, a learning system
explores a space of possible heuristic methods for one well-suited to
the eccentricities of the given domain and problem distribution. In
this article, we discuss an application of the approach to scheduling
satellite communications. Using problem distributions based on actual
mission requirements, our approach identifies strategies that not only
decrease the amount of CPU time required to produce schedules, but
also increase the percentage of problems that are solvable within
computational resource limitations.
Webb, G.I. (1996)
"Further Experimental Evidence against the Utility of Occam's Razor",
Volume 4, pages 397-417.
PostScript: volume4/webb96a.ps (217K)
compressed, volume4/webb96a.ps.Z (89K)
Online Appendix: volume4/webb96a-appendix.tar (41K) source code
PDF: volume4/webb96a.pdf (250K)
Abstract: This paper presents new experimental evidence against the
utility of Occam's razor. A~systematic procedure is presented for
post-processing decision trees produced by C4.5. This procedure was
derived by rejecting Occam's razor and instead attending to the
assumption that similar objects are likely to belong to the same
class. It increases a decision tree's complexity without altering the
performance of that tree on the training data from which it is
inferred. The resulting more complex decision trees are demonstrated
to have, on average, for a variety of common learning tasks, higher
predictive accuracy than the less complex original decision trees.
This result raises considerable doubt about the utility of Occam's
razor as it is commonly applied in modern machine learning.
Bhansali, S., Kramer, G.A. and Hoar, T.J. (1996)
"A Principled Approach Towards Symbolic Geometric Constraint Satisfaction",
Volume 4, pages 419-443.
PostScript: volume4/bhansali96a.ps (933K)
compressed, volume4/bhansali96a.ps.Z (325K)
Online Appendix: volume4/bhansali96a-appendix.txt (7K) example plan fragment
HTML: http://www.cs.washington.edu/research/jair/volume4/bhansali96a-html/paper.htm
PDF: volume4/bhansali96a.pdf (100K)
Abstract: An important problem in geometric reasoning is to find the
configuration of a collection of geometric bodies so as to satisfy a
set of given constraints. Recently, it has been suggested that this
problem can be solved efficiently by symbolically reasoning about
geometry. This approach, called degrees of freedom analysis, employs a
set of specialized routines called plan fragments that specify how to
change the configuration of a set of bodies to satisfy a new
constraint while preserving existing constraints. A potential
drawback, which limits the scalability of this approach, is concerned
with the difficulty of writing plan fragments. In this paper we
address this limitation by showing how these plan fragments can be
automatically synthesized using first principles about geometric
bodies, actions, and topology.
Tadepalli, P. and Natarajan, B.K. (1996)
"A Formal Framework for Speedup Learning from Problems and Solutions",
Volume 4, pages 445-475.
PostScript: volume4/tadepalli96a.ps (333K)
compressed, volume4/tadepalli96a.ps.Z (132K)
PDF: volume4/tadepalli96a.pdf (1.2M)
Abstract: Speedup learning seeks to improve the computational
efficiency of problem solving with experience. In this paper, we
develop a formal framework for learning efficient problem solving from
random problems and their solutions. We apply this framework to two
different representations of learned knowledge, namely control rules
and macro-operators, and prove theorems that identify sufficient
conditions for learning in each representation. Our proofs are
constructive in that they are accompanied with learning algorithms.
Our framework captures both empirical and explanation-based
speedup learning in a unified fashion. We illustrate our framework
with implementations in two domains: symbolic integration and Eight
Puzzle. This work integrates many strands of experimental and
theoretical work in machine learning, including empirical learning of
control rules, macro-operator learning, Explanation-Based Learning
(EBL), and Probably Approximately Correct (PAC) Learning.
Brafman, R.I. and Tennenholtz, M. (1996)
"On Partially Controlled Multi-Agent Systems",
Volume 4, pages 477-507.
PostScript: volume4/brafman96a.ps (356K)
compressed, volume4/brafman96a.ps.Z (146K)
PDF: volume4/brafman96a.pdf (349K)
Abstract: Motivated by the control theoretic distinction between
controllable and uncontrollable events, we distinguish between two
types of agents within a multi-agent system: controllable agents,
which are directly controlled by the system's designer, and
uncontrollable agents, which are not under the designer's direct
control. We refer to such systems as partially controlled multi-agent
systems, and we investigate how one might influence the behavior of
the uncontrolled agents through appropriate design of the controlled
agents. In particular, we wish to understand which problems are
naturally described in these terms, what methods can be applied to
influence the uncontrollable agents, the effectiveness of such
methods, and whether similar methods work across different
domains. Using a game-theoretic framework, this paper studies the
design of partially controlled multi-agent systems in two contexts: in
one context, the uncontrollable agents are expected utility
maximizers, while in the other they are reinforcement learners. We
suggest different techniques for controlling agents' behavior in each
domain, assess their success, and examine their relationship.
Volume5
Yip, K. and Zhao, F. (1996)
"Spatial Aggregation: Theory and Applications",
Volume 5, pages 1-26.
PostScript: volume5/yip96a.ps (828K)
compressed, volume5/yip96a.ps.Z (248K)
PDF: volume5/yip96a.pdf (391K)
Abstract: Visual thinking plays an important role in scientific
reasoning. Based on the research in automating diverse reasoning
tasks about dynamical systems, nonlinear controllers, kinematic
mechanisms, and fluid motion, we have identified a style of visual
thinking, imagistic reasoning. Imagistic reasoning organizes
computations around image-like, analogue representations so that
perceptual and symbolic operations can be brought to bear to infer
structure and behavior. Programs incorporating imagistic reasoning
have been shown to perform at an expert level in domains that defy
current analytic or numerical methods.
We have developed a computational paradigm, spatial aggregation, to
unify the description of a class of imagistic problem solvers. A
program written in this paradigm has the following properties. It
takes a continuous field and optional objective functions as input,
and produces high-level descriptions of structure, behavior, or
control actions. It computes a multi-layer of intermediate
representations, called spatial aggregates, by forming equivalence
classes and adjacency relations. It employs a small set of generic
operators such as aggregation, classification, and localization to
perform bidirectional mapping between the information-rich field and
successively more abstract spatial aggregates. It uses a data
structure, the neighborhood graph, as a common interface to modularize
computations. To illustrate our theory, we describe the computational
structure of three implemented problem solvers -- KAM, MAPS, and
HIPAIR --- in terms of the spatial aggregation generic operators by
mixing and matching a library of commonly used routines.
Ben-Eliyahu, R. (1996)
"A Hierarchy of Tractable Subsets for Computing Stable Models",
Volume 5, pages 27-52.
PostScript: volume5/ben-eliyahu96a.ps (317K)
compressed, volume5/ben-eliyahu96a.ps.Z (126K)
Abstract: Finding the stable models of a knowledge base is a
significant computational problem in artificial intelligence. This
task is at the computational heart of truth maintenance systems,
autoepistemic logic, and default logic. Unfortunately, it is NP-hard.
In this paper we present a hierarchy of classes of knowledge bases,
Omega_1,Omega_2,..., with the following properties: first, Omega_1 is
the class of all stratified knowledge bases; second, if a knowledge
base Pi is in Omega_k, then Pi has at most k stable models, and all of
them may be found in time O(lnk), where l is the length of the
knowledge base and n the number of atoms in Pi; third, for an
arbitrary knowledge base Pi, we can find the minimum k such that Pi
belongs to Omega_k in time polynomial in the size of Pi; and, last,
where K is the class of all knowledge bases, it is the case that
union{i=1 to infty} Omega_i = K, that is, every knowledge base
belongs to some class in the hierarchy.
Litman, D.J. (1996)
"Cue Phrase Classification Using Machine Learning",
Volume 5, pages 53-94.
PostScript: volume5/litman96a.ps (347K)
compressed, volume5/litman96a.ps.Z (135K)
PDF: volume5/litman96a.pdf (379K)
Abstract: Cue phrases may be used in a discourse sense to explicitly
signal discourse structure, but also in a sentential sense to convey
semantic rather than structural information. Correctly classifying
cue phrases as discourse or sentential is critical in natural language
processing systems that exploit discourse structure, e.g., for
performing tasks such as anaphora resolution and plan recognition.
This paper explores the use of machine learning for classifying cue
phrases as discourse or sentential. Two machine learning programs
(Cgrendel and C4.5) are used to induce classification models from sets
of pre-classified cue phrases and their features in text and speech.
Machine learning is shown to be an effective technique for not only
automating the generation of classification models, but also for
improving upon previous results. When compared to manually derived
classification models already in the literature, the learned models
often perform with higher accuracy and contain new linguistic insights
into the data. In addition, the ability to automatically construct
classification models makes it easier to comparatively analyze the
utility of alternative feature representations of the data. Finally,
the ease of retraining makes the learning approach more scalable and
flexible than manual methods.
Gerevini, A. and Schubert, L. (1996)
"Accelerating Partial-Order Planners: Some Techniques for Effective
Search Control and Pruning", Volume 5, pages 95-137.
PostScript: volume5/gerevini96a.ps (401K)
compressed, volume5/gerevini96a.ps.Z (164K)
Online Appendices: volume5/gerevini96a-appendix1 (68K), source code
volume5/gerevini96a-appendix2 (47K), domain description
PDF: volume5/gerevini96a.pdf (432K)
Abstract: We propose some domain-independent techniques for bringing
well-founded partial-order planners closer to practicality. The first
two techniques are aimed at improving search control while keeping
overhead costs low. One is based on a simple adjustment to the
default A* heuristic used by UCPOP to select plans for refinement. The
other is based on preferring ``zero commitment'' (forced) plan
refinements whenever possible, and using LIFO prioritization
otherwise. A more radical technique is the use of operator parameter
domains to prune search. These domains are initially computed from the
definitions of the operators and the initial and goal conditions,
using a polynomial-time algorithm that propagates sets of constants
through the operator graph, starting in the initial conditions.
During planning, parameter domains can be used to prune nonviable
operator instances and to remove spurious clobbering threats. In
experiments based on modifications of UCPOP, our improved plan and
goal selection strategies gave speedups by factors ranging from 5 to
more than 1000 for a variety of problems that are nontrivial for the
unmodified version. Crucially, the hardest problems gave the greatest
improvements. The pruning technique based on parameter domains often
gave speedups by an order of magnitude or more for difficult problems,
both with the default UCPOP search strategy and with our improved
strategy. The Lisp code for our techniques and for the test problems
is provided in on-line appendices.
Quinlan, J.R. (1996)
"Learning First-Order Definitions of Functions",
Volume 5, pages 139-161.
PostScript: volume5/quinlan96b.ps (528K)
compressed, volume5/quinlan96b.ps.Z (156K)
PDF: volume5/quinlan96b.pdf (337K)
Abstract: First-order learning involves finding a clause-form
definition of a relation from examples of the relation and relevant
background information. In this paper, a particular first-order
learning system is modified to customize it for finding definitions of
functional relations. This restriction leads to faster learning times
and, in some cases, to definitions that have higher predictive
accuracy. Other first-order learning systems might benefit from
similar specialization.
Zlotkin, G. and Rosenschein, J.S. (1996)
"Mechanisms for Automated Negotiation in State Oriented Domains",
Volume 5, pages 163-238.
PostScript: volume5/zlotkin96a.ps (946K)
compressed, volume5/zlotkin96a.ps.Z (374K)
PDF: volume5/zlotkin96a.pdf (691K)
Abstract: This paper lays part of the groundwork for a domain theory of
negotiation, that is, a way of classifying interactions so that it is
clear, given a domain, which negotiation mechanisms and strategies are
appropriate. We define State Oriented Domains, a general category of
interaction. Necessary and sufficient conditions for cooperation are
outlined. We use the notion of worth in an altered definition of
utility, thus enabling agreements in a wider class of joint-goal
reachable situations. An approach is offered for conflict resolution,
and it is shown that even in a conflict situation, partial cooperative
steps can be taken by interacting agents (that is, agents in
fundamental conflict might still agree to cooperate up to a certain
point).
A Unified Negotiation Protocol (UNP) is developed that can be used in
all types of encounters. It is shown that in certain borderline
cooperative situations, a partial cooperative agreement (i.e., one
that does not achieve all agents' goals) might be preferred by all
agents, even though there exists a rational agreement that would
achieve all their goals.
Finally, we analyze cases where agents have incomplete information on
the goals and worth of other agents. First we consider the case where
agents' goals are private information, and we analyze what goal
declaration strategies the agents might adopt to increase their
utility. Then, we consider the situation where the agents' goals (and
therefore stand-alone costs) are common knowledge, but the worth they
attach to their goals is private information. We introduce two
mechanisms, one 'strict', the other 'tolerant', and analyze their
affects on the stability and efficiency of negotiation outcomes.
Helzerman, R.A and Harper, M.P. (1996)
"MUSE CSP: An Extension to the Constraint Satisfaction Problem",
Volume 5, pages 239-288.
PostScript: volume5/helzerman96a.ps (2.1M)
compressed, volume5/helzerman96a.ps.Z (439K)
HTML: http://www.cs.washington.edu/research/jair/volume5/helzerman96a-html/muse-csp.html
PDF: volume5/helzerman96a.pdf (555K)
Abstract: This paper describes an extension to the constraint
satisfaction problem (CSP) called MUSE CSP (MUltiply SEgmented
Constraint Satisfaction Problem). This extension is especially useful
for those problems which segment into multiple sets of partially
shared variables. Such problems arise naturally in signal processing
applications including computer vision, speech processing, and
handwriting recognition. For these applications, it is often
difficult to segment the data in only one way given the low-level
information utilized by the segmentation algorithms. MUSE CSP can be
used to compactly represent several similar instances of the
constraint satisfaction problem. If multiple instances of a CSP have
some common variables which have the same domains and constraints,
then they can be combined into a single instance of a MUSE CSP,
reducing the work required to apply the constraints. We introduce the
concepts of MUSE node consistency, MUSE arc consistency, and MUSE path
consistency. We then demonstrate how MUSE CSP can be used to compactly
represent lexically ambiguous sentences and the multiple sentence
hypotheses that are often generated by speech recognition algorithms
so that grammar constraints can be used to provide parses for all
syntactically correct sentences. Algorithms for MUSE arc and path
consistency are provided. Finally, we discuss how to create a MUSE
CSP from a set of CSPs which are labeled to indicate when the same
variable is shared by more than a single CSP.
de Campos, L.M. (1996)
"Characterizations of Decomposable Dependency Models" (research note),
Volume 5, pages 289-300.
PostScript: volume5/campos96a.ps (172K)
compressed, volume5/campos96a.ps.Z (67K)
PDF: volume5/campos96a.pdf (197K)
Abstract: Decomposable dependency models possess a number of
interesting and useful properties. This paper presents new
characterizations of decomposable models in terms of independence
relationships, which are obtained by adding a single axiom to the
well-known set characterizing dependency models that are isomorphic to
undirected graphs. We also briefly discuss a potential application of
our results to the problem of learning graphical models from data.
Zhang, N.L. and Poole, D. (1996)
"Exploiting Causal Independence in Bayesian Network Inference",
Volume 5, pages 301-328.
PostScript: volume5/zhang96a.ps (276K)
compressed, volume5/zhang96a.ps.Z (113K)
HTML: http://www.cs.ubc.ca/spider/poole/papers/ZhangPoole96/ZhangPoole96.html
PDF: volume5/zhang96a.pdf (260K)
Abstract: A new method is proposed for exploiting causal
independencies in exact Bayesian network inference. A Bayesian
network can be viewed as representing a factorization of a joint
probability into the multiplication of a set of conditional
probabilities. We present a notion of causal independence that
enables one to further factorize the conditional probabilities into a
combination of even smaller factors and consequently obtain a
finer-grain factorization of the joint probability. The new
formulation of causal independence lets us specify the conditional
probability of a variable given its parents in terms of an associative
and commutative operator, such as ``or'', ``sum'' or ``max'', on the
contribution of each parent. We start with a simple algorithm VE for
Bayesian network inference that, given evidence and a query variable,
uses the factorization to find the posterior distribution of the
query. We show how this algorithm can be extended to exploit causal
independence. Empirical studies, based on the CPCS networks for
medical diagnosis, show that this method is more efficient than
previous methods and allows for inference in larger networks than
previous algorithms.
Schlimmer, J.C. and Wells, P.C. (1996)
"Quantitative Results Comparing Three Intelligent Interfaces for
Information Capture: A Case Study Adding Name Information into an
Electronic Personal Organizer",
Volume 5, pages 329-349.
PostScript: volume5/schlimmer96a.ps (2.6M)
compressed, volume5/schlimmer96a.ps.Z (236K)
Online Appendix: volume5/schlimmer96a-appendix.hqx (45K) Source code, bin hexed
HTML: http://www.cs.washington.edu/research/jair/volume5/schlimmer96a-html/schlimmer96-0.html
PDF: volume5/schlimmer96a.pdf (177K)
Abstract: Efficiently entering information into a computer is key to
enjoying the benefits of computing. This paper describes three
intelligent user interfaces: handwriting recognition, adaptive menus,
and predictive fillin. In the context of adding a personUs name and
address to an electronic organizer, tests show handwriting recognition
is slower than typing on an on-screen, soft keyboard, while adaptive
menus and predictive fillin can be twice as fast. This paper also
presents strategies for applying these three interfaces to other
information collection domains.
Volume6
Wilson, D.R. and Martinez, T.R. (1997)
"Improved Heterogeneous Distance Functions", Volume 6, pages 1-34.
PostScript: volume6/wilson97a.ps (536K)
compressed, volume6/wilson97a.ps.Z (192K)
HTML: http://axon.cs.byu.edu/~randy/jair/wilson1.html
PDF: volume6/wilson97a.pdf (219K)
Abstract: Instance-based learning techniques typically handle
continuous and linear input values well, but often do not handle
nominal input attributes appropriately. The Value Difference Metric
(VDM) was designed to find reasonable distance values between nominal
attribute values, but it largely ignores continuous attributes,
requiring discretization to map continuous values into nominal values.
This paper proposes three new heterogeneous distance functions, called
the Heterogeneous Value Difference Metric (HVDM), the Interpolated
Value Difference Metric (IVDM), and the Windowed Value Difference
Metric (WVDM). These new distance functions are designed to handle
applications with nominal attributes, continuous attributes, or both.
In experiments on 48 applications the new distance metrics achieve
higher classification accuracy on average than three previous distance
functions on those datasets that have both nominal and continuous
attributes.
Wermter, S. and Weber, V. (1997)
"SCREEN: Learning a Flat Syntactic and Semantic Spoken Language Analysis Using Artificial Neural Networks",
Volume 6, pages 35-85.
PostScript: volume6/wermter97a.ps (1.1M)
compressed, volume6/wermter97a.ps.Z (290K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume6/wermter97a-html/abstract.html
PDF: volume6/wermter97a.pdf (533K)
Abstract: Previous approaches of analyzing spontaneously spoken
language often have been based on encoding syntactic and semantic
knowledge manually and symbolically. While there has been some
progress using statistical or connectionist language models, many
current spoken- language systems still use a relatively brittle,
hand-coded symbolic grammar or symbolic semantic component.
In contrast, we describe a so-called screening approach for learning
robust processing of spontaneously spoken language. A screening
approach is a flat analysis which uses shallow sequences of category
representations for analyzing an utterance at various syntactic,
semantic and dialog levels. Rather than using a deeply structured
symbolic analysis, we use a flat connectionist analysis. This
screening approach aims at supporting speech and language processing
by using (1) data-driven learning and (2) robustness of connectionist
networks. In order to test this approach, we have developed the
SCREEN system which is based on this new robust, learned and flat
analysis.
In this paper, we focus on a detailed description of SCREEN's
architecture, the flat syntactic and semantic analysis, the
interaction with a speech recognizer, and a detailed evaluation
analysis of the robustness under the influence of noisy or incomplete
input. The main result of this paper is that flat representations
allow more robust processing of spontaneous spoken language than
deeply structured representations. In particular, we show how the
fault-tolerance and learning capability of connectionist networks can
support a flat analysis for providing more robust spoken-language
processing within an overall hybrid symbolic/connectionist framework.
De Giacomo, G. and Lenzerini, M. (1997)
"A Uniform Framework for Concept Definitions in Description Logics",
Volume 6, pages 87-110.
PostScript: volume6/degiacomo97a.ps (603K)
compressed, volume6/degiacomo97a.ps.Z (174K)
PDF: volume6/degiacomo97a.pdf (386K)
Abstract: Most modern formalisms used in Databases and Artificial
Intelligence for describing an application domain are based on the
notions of class (or concept) and relationship among classes. One
interesting feature of such formalisms is the possibility of defining
a class, i.e., providing a set of properties that precisely
characterize the instances of the class. Many recent articles point
out that there are several ways of assigning a meaning to a class
definition containing some sort of recursion. In this paper, we argue
that, instead of choosing a single style of semantics, we achieve
better results by adopting a formalism that allows for different
semantics to coexist. We demonstrate the feasibility of our argument,
by presenting a knowledge representation formalism, the description
logic muALCQ, with the above characteristics. In addition to the
constructs for conjunction, disjunction, negation, quantifiers, and
qualified number restrictions, muALCQ includes special fixpoint
constructs to express (suitably interpreted) recursive definitions.
These constructs enable the usual frame-based descriptions to be
combined with definitions of recursive data structures such as
directed acyclic graphs, lists, streams, etc. We establish several
properties of muALCQ, including the decidability and the computational
complexity of reasoning, by formulating a correspondence with a
particular modal logic of programs called the modal mu-calculus.
Agre, P. and Horswill, I. (1997)
"Lifeworld Analysis",
Volume 6, pages 111-145.
PostScript: volume6/agre97a.ps (548K)
compressed, volume6/agre97a.ps.Z (243K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume6/agre97a-html/lifeworlds.html
PDF: volume6/agre97a.pdf (447K)
Abstract: We argue that the analysis of agent/environment interactions
should be extended to include the conventions and invariants
maintained by agents throughout their activity. We refer to this
hicker notion of environment as a lifeworld and present a partial set
of formal tools for describing structures of lifeworlds and the ways
in which they computationally simplify activity. As one specific
example, we apply the tools to the analysis of the Toast system and
show how versions of the system with very different control structures
in fact implement a common control structure together with different
conventions for encoding task state in the positions or states of
objects in the environment.
Darwiche, A. and Provan, G. (1997)
"Query DAGs: A Practical Paradigm for Implementing Belief-Network Inference",
Volume 6, pages 147-176.
PostScript: volume6/darwiche97a.ps (412K)
compressed, volume6/darwiche97a.ps.Z (162K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume6/darwiche97a-html/jair-f.html
PDF: volume6/darwiche97a.pdf (337K)
Abstract: We describe a new paradigm for implementing inference in
belief networks, which consists of two steps: (1) compiling a belief
network into an arithmetic expression called a Query DAG (Q-DAG); and
(2) answering queries using a simple evaluation algorithm. Each node
of a Q-DAG represents a numeric operation, a number, or a symbol for
evidence. Each leaf node of a Q-DAG represents the answer to a
network query, that is, the probability of some event of interest. It
appears that Q-DAGs can be generated using any of the standard
algorithms for exact inference in belief networks (we show how they
can be generated using clustering and conditioning algorithms). The
time and space complexity of a Q-DAG generation algorithm is no worse
than the time complexity of the inference algorithm on which it is
based. The complexity of a Q-DAG evaluation algorithm is linear in the
size of the Q-DAG, and such inference amounts to a standard evaluation
of the arithmetic expression it represents. The intended value of
Q-DAGs is in reducing the software and hardware resources required to
utilize belief networks in on-line, real-world applications. The
proposed framework also facilitates the development of on-line
inference on different software and hardware platforms due to the
simplicity of the Q-DAG evaluation algorithm. Interestingly enough,
Q-DAGs were found to serve other purposes: simple techniques for
reducing Q-DAGs tend to subsume relatively complex optimization
techniques for belief-network inference, such as network-pruning and
computation-caching.
Opitz, D.W. and Shavlik, J.W. (1997)
"Connectionist Theory Refinement: Genetically Searching the Space of Network Topologies",
Volume 6, pages 177-209.
PostScript: volume6/opitz97a.ps (578K)
compressed, volume6/opitz97a.ps.Z (267K)
HTML: http://www.cs.umt.edu/CS/FAC/OPITZ/JAIR97/main.html
PDF: volume6/opitz97a.pdf (436K)
Abstract: An algorithm that learns from a set of examples should
ideally be able to exploit the available resources of (a) abundant
computing power and (b) domain-specific knowledge to improve its
ability to generalize. Connectionist theory-refinement systems, which
use background knowledge to select a neural network's topology and
initial weights, have proven to be effective at exploiting
domain-specific knowledge; however, most do not exploit available
computing power. This weakness occurs because they lack the ability to
refine the topology of the neural networks they produce, thereby
limiting generalization, especially when given impoverished domain
theories. We present the REGENT algorithm which uses (a) domain-specific
knowledge to help create an initial population of knowledge-based
neural networks and (b) genetic operators of crossover and mutation
(specifically designed for knowledge-based networks) to continually
search for better network topologies. Experiments on three real-world
domains indicate that our new algorithm is able to significantly
increase generalization compared to a standard connectionist
theory-refinement system, as well as our previous algorithm for
growing knowledge-based networks.
Jonsson, P. and Drakengren, T. (1997)
"A Complete Classification of Tractability in RCC-5", Volume 6, pages 211-221.
PostScript: volume6/jonsson97a.ps (168K)
compressed, volume6/jonsson97a.ps.Z (69K)
PDF: volume6/jonsson97a.pdf (205K)
Online Appendix: volume6/jonsson97a-appendix.tar (39K) Source code
Abstract: We investigate the computational properties of the spatial
algebra RCC-5 which is a restricted version of the RCC framework for
spatial reasoning. The satisfiability problem for RCC-5 is known to
be NP-complete but not much is known about its approximately four
billion subclasses. We provide a complete classification of
satisfiability for all these subclasses into polynomial and
NP-complete respectively. In the process, we identify all maximal
tractable subalgebras which are four in total.
Pollack, M.E., Joslin, D. and Paolucci, M.. (1997)
"Flaw Selection Strategies for Partial-Order Planning",
Volume 6, pages 223-262.
PostScript: volume6/pollack97a.ps (1.1M)
compressed, volume6/pollack97a.ps.Z (387K)
Online Appendix: volume6/pollack97a-appendix.sit.hqx (80K) Excel Spreadsheet, Experimental Data
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume6/pollack97a-html/base.html
PDF: volume6/pollack97a.pdf (592K)
Abstract: Several recent studies have compared the relative efficiency
of alternative flaw selection strategies for partial-order causal link
(POCL) planning. We review this literature, and present new
experimental results that generalize the earlier work and explain some
of the discrepancies in it. In particular, we describe the Least-Cost
Flaw Repair (LCFR) strategy developed and analyzed by Joslin and
Pollack (1994), and compare it with other strategies, including
Gerevini and Schubert's (1996) ZLIFO strategy. LCFR and ZLIFO make
very different, and apparently conflicting claims about the most
effective way to reduce search-space size in POCL planning. We
resolve this conflict, arguing that much of the benefit that Gerevini
and Schubert ascribe to the LIFO component of their ZLIFO strategy is
better attributed to other causes. We show that for many problems, a
strategy that combines least-cost flaw selection with the delay of
separable threats will be effective in reducing search-space size, and
will do so without excessive computational overhead. Although such a
strategy thus provides a good default, we also show that certain
domain characteristics may reduce its effectiveness.
Volume7
Halpern, J.Y. (1997)
"Defining Relative Likelihood in Partially-Ordered Preferential Structures",
Volume 7, pages 1-24.
PostScript: volume7/halpern97a.ps (573K)
compressed, volume7/halpern97a.ps.Z (173K)
PDF: volume7/halpern97a.pdf (395K)
Abstract: Starting with a likelihood or preference order on worlds, we
extend it to a likelihood ordering on sets of worlds in a natural way,
and examine the resulting logic. Lewis earlier considered such a
notion of relative likelihood in the context of studying
counterfactuals, but he assumed a total preference order on worlds.
Complications arise when examining partial orders that are not present
for total orders. There are subtleties involving the exact approach
to lifting the order on worlds to an order on sets of worlds. In
addition, the axiomatization of the logic of relative likelihood in
the case of partial orders gives insight into the connection between
relative likelihood and default reasoning.
Drakengren, T. and Jonsson, P. (1997)
"Eight Maximal Tractable Subclasses of Allen's Algebra with Metric Time",
Volume 7, pages 25-45.
PostScript: volume7/drakengren97a.ps (245K)
compressed, volume7/drakengren97a.ps.Z (97K)
PDF: volume7/drakengren97a.pdf (958K)
Online Appendix: volume7/drakengren97a-appendix.tar (359K) tar file containing algebras
Abstract: This paper combines two important directions of research in
temporal resoning: that of finding maximal tractable subclasses of
Allen's interval algebra, and that of reasoning with metric temporal
information. Eight new maximal tractable subclasses of Allen's
interval algebra are presented, some of them subsuming previously
reported tractable algebras. The algebras allow for metric temporal
constraints on interval starting or ending points, using the recent
framework of Horn DLRs. Two of the algebras can express the notion of
sequentiality between intervals, being the first such algebras
admitting both qualitative and metric time.
Mammen, D.L. and Hogg, T. (1997)
"A New Look at the Easy-Hard-Easy Pattern of Combinatorial Search Difficulty",
Volume 7, pages 47-66.
PostScript: volume7/mammen97a.ps (563K)
compressed, volume7/mammen97a.ps.Z (175K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume7/mammen97a-html/ehe3.html
PDF: volume7/mammen97a.pdf (277K)
Abstract: The easy-hard-easy pattern in the difficulty of
combinatorial search problems as constraints are added has been
explained as due to a competition between the decrease in number of
solutions and increased pruning. We test the generality of this
explanation by examining one of its predictions: if the number of
solutions is held fixed by the choice of problems, then increased
pruning should lead to a monotonic decrease in search cost. Instead,
we find the easy-hard-easy pattern in median search cost even when the
number of solutions is held constant, for some search methods. This
generalizes previous observations of this pattern and shows that the
existing theory does not explain the full range of the peak in search
cost. In these cases the pattern appears to be due to changes in the
size of the minimal unsolvable subproblems, rather than changing
numbers of solutions.
Nevill-Manning, C.G. and Witten, I.H. (1997)
"Identifying Hierarchical Structure in Sequences: A linear-time algorithm",
Volume 7, pages 67-82.
PostScript: volume7/nevill97a.ps (287K)
compressed, volume7/nevill97a.ps.Z (118K)
PDF: volume7/nevill97a.pdf (106K)
Online Appendix: volume7/nevill97a-appendix.html, WWW Interface to SEQUITUR
HTML: http://www.cs.waikato.ac.nz/sequitur/jair
PDF: volume7/nevill97a.pdf (106K)
Abstract: SEQUITUR is an algorithm that infers a hierarchical
structure from a sequence of discrete symbols by replacing repeated
phrases with a grammatical rule that generates the phrase, and
continuing this process recursively. The result is a hierarchical
representation of the original sequence, which offers insights into
its lexical structure. The algorithm is driven by two constraints that
reduce the size of the grammar, and produce structure as a by-product.
SEQUITUR breaks new ground by operating incrementally. Moreover, the
method's simple structure permits a proof that it operates in space
and time that is linear in the size of the input. Our implementation
can process 50,000 symbols per second and has been applied to an
extensive range of real world sequences.
Tambe, M. (1997)
"Towards Flexible Teamwork",
Volume 7, pages 83-124.
PostScript: volume7/tambe97a.ps (940K)
compressed, volume7/tambe97a.ps.Z (223K)
PDF: volume7/tambe97a.pdf (1.7M)
Online Appendix: http://www.isi.edu/soar/tambe/steam/steam.html, STEAM 1.0
Abstract: Many AI researchers are today striving to build agent teams
for complex, dynamic multi-agent domains, with intended applications
in arenas such as education, training, entertainment, information
integration, and collective robotics. Unfortunately, uncertainties in
these complex, dynamic domains obstruct coherent teamwork. In
particular, team members often encounter differing, incomplete, and
possibly inconsistent views of their environment. Furthermore, team
members can unexpectedly fail in fulfilling responsibilities or
discover unexpected opportunities. Highly flexible coordination and
communication is key in addressing such uncertainties. Simply fitting
individual agents with precomputed coordination plans will not do, for
their inflexibility can cause severe failures in teamwork, and their
domain-specificity hinders reusability.
Our central hypothesis is that the key to such flexibility and
reusability is providing agents with general models of teamwork.
Agents exploit such models to autonomously reason about coordination
and communication, providing requisite flexibility. Furthermore, the
models enable reuse across domains, both saving implementation effort
and enforcing consistency. This article presents one general,
implemented model of teamwork, called STEAM. The basic building block
of teamwork in STEAM is joint intentions (Cohen & Levesque, 1991b);
teamwork in STEAM is based on agents' building up a (partial)
hierarchy of joint intentions (this hierarchy is seen to parallel
Grosz & Kraus's partial SharedPlans, 1996). Furthermore, in STEAM,
team members monitor the team's and individual members' performance,
reorganizing the team as necessary. Finally, decision-theoretic
communication selectivity in STEAM ensures reduction in communication
overheads of teamwork, with appropriate sensitivity to the
environmental conditions. This article describes STEAM's application
in three different complex domains, and presents detailed empirical
results.
Leherte, L., Glasgow, J., Baxter, K., Steeg, E. and Fortier, S. (1997)
"Analysis of Three-Dimensional Protein Images",
Volume 7, pages 125-159.
PostScript: volume7/leherte97a.ps (3.1M)
compressed, volume7/leherte97a.ps.Z (525K)
PDF: volume7/leherte97a.pdf (471K)
Abstract: A fundamental goal of research in molecular biology is to
understand protein structure. Protein crystallography is currently the
most successful method for determining the three-dimensional (3D)
conformation of a protein, yet it remains labor intensive and relies
on an expert's ability to derive and evaluate a protein scene model.
In this paper, the problem of protein structure determination is
formulated as an exercise in scene analysis. A computational
methodology is presented in which a 3D image of a protein is segmented
into a graph of critical points. Bayesian and certainty factor
approaches are described and used to analyze critical point graphs and
identify meaningful substructures, such as alpha-helices and
beta-sheets. Results of applying the methodologies to protein images
at low and medium resolution are reported. The research is related to
approaches to representation, segmentation and classification in
vision, as well as to top-down approaches to protein structure
prediction.
Ihrig, L.H. and Kambhampati, S. (1997)
"Storing and Indexing Plan Derivations through Explanation-based Analysis
of Retrieval Failures", Volume 7, pages 161-198.
PostScript: volume7/ihrig97a.ps (1.0M)
compressed, volume7/ihrig97a.ps.Z (350K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume7/ihrig97a-html/ihrig-kambh97.html
PDF: volume7/ihrig97a.pdf (496K)
Abstract: Case-Based Planning (CBP) provides a way of scaling up
domain-independent planning to solve large problems in complex
domains. It replaces the detailed and lengthy search for a solution
with the retrieval and adaptation of previous planning experiences.
In general, CBP has been demonstrated to improve performance over
generative (from-scratch) planning. However, the performance
improvements it provides are dependent on adequate judgements as to
problem similarity. In particular, although CBP may substantially
reduce planning effort overall, it is subject to a mis-retrieval
problem. The success of CBP depends on these retrieval errors being
relatively rare. This paper describes the design and implementation of
a replay framework for the case-based planner DERSNLP+EBL. DERSNLP+EBL
extends current CBP methodology by incorporating explanation-based
learning techniques that allow it to explain and learn from the
retrieval failures it encounters. These techniques are used to refine
judgements about case similarity in response to feedback when a wrong
decision has been made. The same failure analysis is used in building
the case library, through the addition of repairing cases. Large
problems are split and stored as single goal subproblems. Multi-goal
problems are stored only when these smaller cases fail to be merged
into a full solution. An empirical evaluation of this approach
demonstrates the advantage of learning from experienced retrieval
failure.
Zhang, N.L. and Liu, W. (1997)
"A Model Approximation Scheme for Planning in Partially Observable
Stochastic Domains", Volume 7, pages 199-230.
PostScript: volume7/zhang97a.ps (399K)
compressed, volume7/zhang97a.ps.Z (184K)
PDF: volume7/zhang97a.pdf (405K)
Abstract: Partially observable Markov decision processes (POMDPs) are
a natural model for planning problems where effects of actions are
nondeterministic and the state of the world is not completely
observable. It is difficult to solve POMDPs exactly. This paper
proposes a new approximation scheme. The basic idea is to transform a
POMDP into another one where additional information is provided by an
oracle. The oracle informs the planning agent that the current state
of the world is in a certain region. The transformed POMDP is
consequently said to be region observable. It is easier to solve than
the original POMDP. We propose to solve the transformed POMDP and use
its optimal policy to construct an approximate policy for the original
POMDP. By controlling the amount of additional information that the
oracle provides, it is possible to find a proper tradeoff between
computational time and approximation quality. In terms of algorithmic
contributions, we study in details how to exploit region observability
in solving the transformed POMDP. To facilitate the study, we also
propose a new exact algorithm for general POMDPs. The algorithm is
conceptually simple and yet is significantly more efficient than all
previous exact algorithms.
Monderer, D. and Tennenholtz, M. (1997)
"Dynamic Non-Bayesian Decision Making",
Volume 7, pages 231-248.
PostScript: volume7/monderer97a.ps (238K)
compressed, volume7/monderer97a.ps.Z (95K)
PDF: volume7/monderer97a.pdf (269K)
Abstract: The model of a non-Bayesian agent who faces a repeated game
with incomplete information against Nature is an appropriate tool for
modeling general agent-environment interactions. In such a model the
environment state (controlled by Nature) may change arbitrarily, and
the feedback/reward function is initially unknown. The agent is not
Bayesian, that is he does not form a prior probability neither on the
state selection strategy of Nature, nor on his reward function. A
policy for the agent is a function which assigns an action to every
history of observations and actions. Two basic feedback structures
are considered. In one of them -- the perfect monitoring case -- the
agent is able to observe the previous environment state as part of his
feedback, while in the other -- the imperfect monitoring case -- all
that is available to the agent is the reward obtained. Both of these
settings refer to partially observable processes, where the current
environment state is unknown. Our main result refers to the
competitive ratio criterion in the perfect monitoring case. We prove
the existence of an efficient stochastic policy that ensures that the
competitive ratio is obtained at almost all stages with an arbitrarily
high probability, where efficiency is measured in terms of rate of
convergence. It is further shown that such an optimal policy does not
exist in the imperfect monitoring case. Moreover, it is proved that in
the perfect monitoring case there does not exist a deterministic
policy that satisfies our long run optimality criterion. In addition,
we discuss the maxmin criterion and prove that a deterministic
efficient optimal strategy does exist in the imperfect monitoring case
under this criterion. Finally we show that our approach to long-run
optimality can be viewed as qualitative, which distinguishes it from
previous work in this area.
Frank, J., Cheeseman, P. and Stutz, J. (1997)
"When Gravity Fails: Local Search Topology",
Volume 7, pages 249-281.
PostScript: volume7/frank97a.ps (612K)
compressed, volume7/frank97a.ps.Z (186K)
PDF: volume7/frank97a.pdf (409K)
Abstract: Local search algorithms for combinatorial search problems
frequently encounter a sequence of states in which it is impossible to
improve the value of the objective function; moves through these
regions, called plateau moves, dominate the time spent in local
search. We analyze and characterize plateaus for three different
classes of randomly generated Boolean Satisfiability problems. We
identify several interesting features of plateaus that impact the
performance of local search algorithms. We show that local minima
tend to be small but occasionally may be very large. We also show
that local minima can be escaped without unsatisfying a large number
of clauses, but that systematically searching for an escape route may
be computationally expensive if the local minimum is large. We show
that plateaus with exits, called benches, tend to be much larger than
minima, and that some benches have very few exit states which local
search can use to escape. We show that the solutions (i.e., global
minima) of randomly generated problem instances form clusters, which
behave similarly to local minima. We revisit several enhancements of
local search algorithms and explain their performance in light of our
results. Finally we discuss strategies for creating the next
generation of local search algorithms.
Kaindl, H. and Kainz, G. (1997)
"Bidirectional Heuristic Search Reconsidered",
Volume 7, pages 283-317.
PostScript: volume7/kaindl97a.ps (858K)
compressed, volume7/kaindl97a.ps.Z (338K)
PDF: volume7/kaindl97a.pdf (386K)
Abstract: The assessment of bidirectional heuristic search has been
incorrect since it was first published more than a quarter of a
century ago. For quite a long time, this search strategy did not
achieve the expected results, and there was a major misunderstanding
about the reasons behind it. Although there is still wide-spread
belief that bidirectional heuristic search is afflicted by the problem
of search frontiers passing each other, we demonstrate that this
conjecture is wrong. Based on this finding, we present both a new
generic approach to bidirectional heuristic search and a new approach
to dynamically improving heuristic values that is feasible in
bidirectional search only. These approaches are put into perspective
with both the traditional and more recently proposed approaches in
order to facilitate a better overall understanding. Empirical results
of experiments with our new approaches show that bidirectional
heuristic search can be performed very efficiently and also with
limited memory. These results suggest that bidirectional heuristic
search appears to be better for solving certain difficult problems
than corresponding unidirectional search. This provides some evidence
for the usefulness of a search strategy that was long neglected. In
summary, we show that bidirectional heuristic search is viable and
consequently propose that it be reconsidered.
Volume8
Engelfriet, J. (1998)
"Monotonicity and Persistence in Preferential Logics",
Volume 8, pages 1-21.
PostScript: volume8/engelfriet98a.ps (266K)
compressed, volume8/engelfriet98a.ps.Z (121K)
PDF: volume8/engelfriet98a.pdf (310K)
Abstract: An important characteristic of many logics for Artificial
Intelligence is their nonmonotonicity. This means that adding a
formula to the premises can invalidate some of the consequences. There
may, however, exist formulae that can always be safely added to the
premises without destroying any of the consequences: we say they
respect monotonicity. Also, there may be formulae that, when they are
a consequence, can not be invalidated when adding any formula to the
premises: we call them conservative. We study these two classes of
formulae for preferential logics, and show that they are closely
linked to the formulae whose truth-value is preserved along the
(preferential) ordering. We will consider some preferential logics for
illustration, and prove syntactic characterization results for them.
The results in this paper may improve the efficiency of theorem
provers for preferential logics.
Gogic, G., Papadimitriou, C.H., and Sideri, M. (1998)
"Incremental Recompilation of Knowledge",
Volume 8, pages 23-37.
PostScript: volume8/gogic98a.ps (189K)
compressed, volume8/gogic98a.ps.Z (77K)
PDF: volume8/gogic98a.pdf (659K)
Abstract: Approximating a general formula from above and below by Horn
formulas (its Horn envelope and Horn core, respectively) was proposed
by Selman and Kautz (1991, 1996) as a form of ``knowledge
compilation,'' supporting rapid approximate reasoning; on the negative
side, this scheme is static in that it supports no updates, and has
certain complexity drawbacks pointed out by Kavvadias, Papadimitriou
and Sideri (1993). On the other hand, the many frameworks and schemes
proposed in the literature for theory update and revision are plagued
by serious complexity-theoretic impediments, even in the Horn case, as
was pointed out by Eiter and Gottlob (1992), and is further
demonstrated in the present paper. More fundamentally, these schemes
are not inductive, in that they may lose in a single update any
positive properties of the represented sets of formulas (small size,
Horn structure, etc.). In this paper we propose a new scheme,
incremental recompilation, which combines Horn approximation and
model-based updates; this scheme is inductive and very efficient, free
of the problems facing its constituents. A set of formulas is
represented by an upper and lower Horn approximation. To update, we
replace the upper Horn formula by the Horn envelope of its
minimum-change update, and similarly the lower one by the Horn core of
its update; the key fact which enables this scheme is that Horn
envelopes and cores are easy to compute when the underlying formula is
the result of a minimum-change update of a Horn formula by a clause.
We conjecture that efficient algorithms are possible for more complex
updates.
Argamon-Engelson, S. and Koppel, M. (1998)
"Tractability of Theory Patching",
Volume 8, pages 39-65.
PostScript: volume8/argamon98a.ps (282K)
compressed, volume8/argamon98a.ps.Z (128K)
PDF: volume8/argamon98a.pdf (1.1M)
Abstract: In this paper we consider the problem of `theory patching',
in which we are given a domain theory, some of whose components are
indicated to be possibly flawed, and a set of labeled training
examples for the domain concept. The theory patching problem is to
revise only the indicated components of the theory, such that the
resulting theory correctly classifies all the training examples.
Theory patching is thus a type of theory revision in which revisions
are made to individual components of the theory. Our concern in this
paper is to determine for which classes of logical domain theories the
theory patching problem is tractable. We consider both propositional
and first-order domain theories, and show that the theory patching
problem is equivalent to that of determining what information
contained in a theory is `stable' regardless of what revisions might
be performed to the theory. We show that determining stability is
tractable if the input theory satisfies two conditions: that revisions
to each theory component have monotonic effects on the classification
of examples, and that theory components act independently in the
classification of examples in the theory. We also show how the
concepts introduced can be used to determine the soundness and
completeness of particular theory patching algorithms.
Moore, A. and Lee, M.S. (1998)
"Cached Sufficient Statistics for Efficient Machine Learning with Large Datasets",
Volume 8, pages 67-91.
PostScript: volume8/moore98a.ps (303K)
compressed, volume8/moore98a.ps.Z (122K)
PDF: volume8/moore98a.pdf (300K)
Abstract: This paper introduces new algorithms and data structures for
quick counting for machine learning datasets. We focus on the
counting task of constructing contingency tables, but our approach is
also applicable to counting the number of records in a dataset that
match conjunctive queries. Subject to certain assumptions, the costs
of these operations can be shown to be independent of the number of
records in the dataset and loglinear in the number of non-zero entries
in the contingency table.
We provide a very sparse data structure, the ADtree, to minimize
memory use. We provide analytical worst-case bounds for this structure
for several models of data distribution. We empirically demonstrate
that tractably-sized data structures can be produced for large
real-world datasets by (a) using a sparse tree structure that never
allocates memory for counts of zero, (b) never allocating memory for
counts that can be deduced from other counts, and (c) not bothering to
expand the tree fully near its leaves.
We show how the ADtree can be used to accelerate Bayes net structure
finding algorithms, rule learning algorithms, and feature selection
algorithms, and we provide a number of empirical results comparing
ADtree methods against traditional direct counting approaches. We
also discuss the possible uses of ADtrees in other machine learning
methods, and discuss the merits of ADtrees in comparison with
alternative representations such as kd-trees, R-trees and Frequent Sets.
Srivastava, B. and Kambhampati, S. (1998)
"Synthesizing Customized Planners from Specifications",
Volume 8, pages 93-128.
PostScript: volume8/srivastava98a.ps (750K)
compressed, volume8/srivastava98a.ps.Z (221K)
PDF: volume8/srivastava98a.pdf (478K)
Abstract: Existing plan synthesis approaches in artificial
intelligence fall into two categories -- domain independent and domain
dependent. The domain independent approaches are applicable across a
variety of domains, but may not be very efficient in any one given
domain. The domain dependent approaches need to be (re)designed for
each domain separately, but can be very efficient in the domain for
which they are designed. One enticing alternative to these approaches
is to automatically synthesize domain independent planners given the
knowledge about the domain and the theory of planning. In this paper,
we investigate the feasibility of using existing automated software
synthesis tools to support such synthesis. Specifically, we describe
an architecture called CLAY in which the Kestrel Interactive
Development System (KIDS) is used to derive a domain-customized
planner through a semi-automatic combination of a declarative theory
of planning, and the declarative control knowledge specific to a given
domain, to semi-automatically combine them to derive domain-customized
planners. We discuss what it means to write a declarative theory of
planning and control knowledge for KIDS, and illustrate our approach
by generating a class of domain-specific planners using state space
refinements. Our experiments show that the synthesized planners can
outperform classical refinement planners (implemented as
instantiations of UCP, Kambhampati & Srivastava, 1995), using the same
control knowledge. We will contrast the costs and benefits of the
synthesis approach with conventional methods for customizing domain
independent planners.
Fuernkranz, J. (1998)
"Integrative Windowing",
Volume 8, pages 129-164.
PostScript: volume8/fuernkranz98a.ps (455K)
compressed, volume8/fuernkranz98a.ps.Z (166K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume8/fuernkranz98a-html/fuernkranz98a.html
PDF: volume8/fuernkranz98a.pdf (383K)
Abstract: In this paper we re-investigate windowing for rule learning
algorithms. We show that, contrary to previous results for decision
tree learning, windowing can in fact achieve significant run-time
gains in noise-free domains and explain the different behavior of rule
learning algorithms by the fact that they learn each rule
independently. The main contribution of this paper is integrative
windowing, a new type of algorithm that further exploits this property
by integrating good rules into the final theory right after they have
been discovered. Thus it avoids re-learning these rules in subsequent
iterations of the windowing process. Experimental evidence in a
variety of noise-free domains shows that integrative windowing can in
fact achieve substantial run-time gains. Furthermore, we discuss the
problem of noise in windowing and present an algorithm that is able to
achieve run-time gains in a set of experiments in a simple domain with
artificial noise.
Darwiche, A. (1998)
"Model-Based Diagnosis using Structured System Descriptions",
Volume 8, pages 165-222.
PostScript: volume8/darwiche98a.ps (1.1M)
compressed, volume8/darwiche98a.ps.Z (327K)
PDF: volume8/darwiche98a.pdf (2.6M)
Abstract: This paper presents a comprehensive approach for model-based
diagnosis which includes proposals for characterizing and computing
preferred diagnoses, assuming that the system description is augmented
with a system structure (a directed graph explicating the
interconnections between system components). Specifically, we first
introduce the notion of a consequence, which is a syntactically
unconstrained propositional sentence that characterizes all
consistency-based diagnoses and show that standard characterizations
of diagnoses, such as minimal conflicts, correspond to syntactic
variations on a consequence. Second, we propose a new syntactic
variation on the consequence known as negation normal form (NNF) and
discuss its merits compared to standard variations. Third, we
introduce a basic algorithm for computing consequences in NNF given a
structured system description. We show that if the system structure
does not contain cycles, then there is always a linear-size
consequence in NNF which can be computed in linear time. For arbitrary
system structures, we show a precise connection between the complexity
of computing consequences and the topology of the underlying system
structure. Finally, we present an algorithm that enumerates the
preferred diagnoses characterized by a consequence. The algorithm is
shown to take linear time in the size of the consequence if the
preference criterion satisfies some general conditions.
Finkelstein, L. and Markovitch, S. (1998)
"A Selective Macro-learning Algorithm and its Application to
the NxN Sliding-Tile Puzzle",
Volume 8, pages 223-263.
PostScript: volume8/finkelstein98a.ps (516K)
compressed, volume8/finkelstein98a.ps.Z (209K)
Online Appendix: volume8/finkelstein98a-appendix.html (10K) Source code
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume8/finkelstein98a-html/hillary.html
PDF: volume8/finkelstein98a.pdf (1.6M)
Abstract: One of the most common mechanisms used for speeding up
problem solvers is macro-learning. Macros are sequences of basic
operators acquired during problem solving. Macros are used by the
problem solver as if they were basic operators. The major problem
that macro-learning presents is the vast number of macros that are
available for acquisition. Macros increase the branching factor of
the search space and can severely degrade problem-solving efficiency.
To make macro learning useful, a program must be selective in
acquiring and utilizing macros. This paper describes a general method
for selective acquisition of macros. Solvable training problems are
generated in increasing order of difficulty. The only macros acquired
are those that take the problem solver out of a local minimum to a
better state. The utility of the method is demonstrated in several
domains, including the domain of NxN sliding-tile puzzles. After
learning on small puzzles, the system is able to efficiently solve
puzzles of any size.
Volume9
Littman, M.L., Goldsmith, J. and Mundhenk M. (1998)
"The Computational Complexity of Probabilistic Planning",
Volume 9, pages 1-36.
PostScript: volume9/littman98a.ps (975K)
compressed, volume9/littman98a.ps.Z (324K)
PDF: volume9/littman98a.pdf (488K)
Abstract: We examine the computational complexity of testing and
finding small plans in probabilistic planning domains with both flat
and propositional representations. The complexity of plan evaluation
and existence varies with the plan type sought; we examine totally
ordered plans, acyclic plans, and looping plans, and partially ordered
plans under three natural definitions of plan value. We show that
problems of interest are complete for a variety of complexity classes:
PL, P, NP, co-NP, PP, NP^PP, co-NP^PP, and PSPACE. In the process of
proving that certain planning problems are complete for NP^PP, we
introduce a new basic NP^PP-complete problem, E-MAJSAT, which
generalizes the standard Boolean satisfiability problem to
computations involving probabilistic quantities; our results suggest
that the development of good heuristics for E-MAJSAT could be
important for the creation of efficient algorithms for a wide variety
of problems.
Ledeniov, O. and Markovitch, S. (1998)
"The Divide-and-Conquer Subgoal-Ordering Algorithm for Speeding up Logic Inference",
Volume 9, pages 37-97.
PostScript: volume9/ledeniov98a.ps (644K)
compressed, volume9/ledeniov98a.ps.Z (253K)
PDF: volume9/ledeniov98a.pdf (600K)
Abstract: It is common to view programs as a combination of logic and
control: the logic part defines what the program must do, the control
part -- how to do it. The Logic Programming paradigm was developed
with the intention of separating the logic from the control.
Recently, extensive research has been conducted on automatic
generation of control for logic programs. Only a few of these works
considered the issue of automatic generation of control for improving
the efficiency of logic programs. In this paper we present a novel
algorithm for automatic finding of lowest-cost subgoal orderings. The
algorithm works using the divide-and-conquer strategy. The given set
of subgoals is partitioned into smaller sets, based on co-occurrence
of free variables. The subsets are ordered recursively and merged,
yielding a provably optimal order. We experimentally demonstrate the
utility of the algorithm by testing it in several domains, and discuss
the possibilities of its cooperation with other existing methods.
Backstrom, C. (1998)
"Computational Aspects of Reordering Plans",
Volume 9, pages 99-137.
PostScript: volume9/backstrom98a.ps (815K)
compressed, volume9/backstrom98a.ps.Z (241K)
PDF: volume9/backstrom98a.pdf (653K)
Abstract: This article studies the problem of modifying the action
ordering of a plan in order to optimise the plan according to various
criteria. One of these criteria is to make a plan less constrained
and the other is to minimize its parallel execution time. Three
candidate definitions are proposed for the first of these criteria,
constituting a sequence of increasing optimality guarantees. Two of
these are based on deordering plans, which means that ordering
relations may only be removed, not added, while the third one uses
reordering, where arbitrary modifications to the ordering are allowed.
It is shown that only the weakest one of the three criteria is
tractable to achieve, the other two being NP-hard and even difficult
to approximate. Similarly, optimising the parallel execution time of
a plan is studied both for deordering and reordering of plans. In the
general case, both of these computations are NP-hard. However, it is
shown that optimal deorderings can be computed in polynomial time for
a class of planning languages based on the notions of producers,
consumers and threats, which includes most of the commonly used
planning languages. Computing optimal reorderings can potentially
lead to even faster parallel executions, but this problem remains
NP-hard and difficult to approximate even under quite severe restrictions.
Cook, D.J. and Varnell, R.C. (1998)
"Adaptive Parallel Iterative Deepening Search",
Volume 9, pages 139-165.
PostScript: volume9/cook98a.ps (628K)
compressed, volume9/cook98a.ps.Z (196K)
Online Appendix: volume9/cook98a-appendix.tar (225K) Eureka 2.0 Source code
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume9/cook98a-html/eureka.html
PDF: volume9/cook98a.pdf (276K)
Abstract: Many of the artificial intelligence techniques developed to
date rely on heuristic search through large spaces. Unfortunately,
the size of these spaces and the corresponding computational effort
reduce the applicability of otherwise novel and effective algorithms.
A number of parallel and distributed approaches to search have
considerably improved the performance of the search process.
Our goal is to develop an architecture that automatically selects
parallel search strategies for optimal performance on a variety of
search problems. In this paper we describe one such architecture
realized in the Eureka system, which combines the benefits of
many different approaches to parallel heuristic search. Through
empirical and theoretical analyses we observe that features of the
problem space directly affect the choice of optimal parallel search
strategy. We then employ machine learning techniques to select the
optimal parallel search strategy for a given problem space. When a
new search task is input to the system, Eureka uses features
describing the search space and the chosen architecture to
automatically select the appropriate search strategy. Eureka
has been tested on a MIMD parallel processor, a distributed network of
workstations, and a single workstation using multithreading. Results
generated from fifteen puzzle problems, robot arm motion problems,
artificial search spaces, and planning problems indicate that
Eureka outperforms any of the tested strategies used exclusively for
all problem instances and is able to greatly reduce the search time
for these applications.
Ruiz, A., Lopez-de-Teruel, P.E. and Garrido, M.C. (1998)
"Probabilistic Inference from Arbitrary Uncertainty using Mixtures of Factorized Generalized Gaussians",
Volume 9, pages 167-217.
PostScript: volume9/ruiz98a.ps (3.5M)
compressed, volume9/ruiz98a.ps.Z (1.1M)
PDF: volume9/ruiz98a.pdf (753K)
Abstract: This paper presents a general and efficient framework for
probabilistic inference and learning from arbitrary uncertain
information. It exploits the calculation properties of finite mixture
models, conjugate families and factorization. Both the joint
probability density of the variables and the likelihood function of
the (objective or subjective) observation are approximated by a
special mixture model, in such a way that any desired conditional
distribution can be directly obtained without numerical
integration. We have developed an extended version of the expectation
maximization (EM) algorithm to estimate the parameters of mixture
models from uncertain training examples (indirect observations). As a
consequence, any piece of exact or uncertain information about both
input and output values is consistently handled in the inference and
learning stages. This ability, extremely useful in certain situations,
is not found in most alternative methods. The proposed framework is
formally justified from standard probabilistic principles and
illustrative examples are provided in the fields of nonparametric
pattern classification, nonlinear regression and pattern
completion. Finally, experiments on a real application and comparative
results over standard databases provide empirical evidence of the
utility of the method in a wide range of applications.
Vandegriend, B. and Culberson, J. (1998)
"The Gn,m Phase Transition is Not Hard for the Hamiltonian Cycle Problem",
Volume 9, pages 219-245.
PostScript: volume9/vandegriend98a.ps (574K)
compressed, volume9/vandegriend98a.ps.Z (180K)
Online Appendix1: volume9/vandegriend98a-appendix1.tar (368K) source code for HC program
Online Appendix2: volume9/vandegriend98a-appendix2/append.html (3K) HTML illustration of artifact
PDF: volume9/vandegriend98a.pdf (394K)
Abstract: Using an improved backtrack algorithm with sophisticated
pruning techniques, we revise previous observations correlating a high
frequency of hard to solve Hamiltonian Cycle instances with the Gn,m
phase transition between Hamiltonicity and non-Hamiltonicity. Instead
all tested graphs of 100 to 1500 vertices are easily solved.
When we artificially restrict the degree sequence with a bounded
maximum degree, although there is some increase in difficulty, the
frequency of hard graphs is still low. When we consider more regular
graphs based on a generalization of knight's tours, we observe
frequent instances of really hard graphs, but on these the average
degree is bounded by a constant. We design a set of graphs with a
feature our algorithm is unable to detect and so are very hard for our
algorithm, but in these we can vary the average degree from O(1) to
O(n). We have so far found no class of graphs correlated with the
Gn,m phase transition which asymptotically produces a high frequency
of hard instances.
Wiebe, J.M., O'Hara, T.P., Ohrstrom-Sandgren, T. and McKeever, K.J. (1998)
"An Empirical Approach to Temporal Reference Resolution",
Volume 9, pages 247-293.
PostScript: volume9/wiebe98a.ps (726K)
compressed, volume9/wiebe98a.ps.Z (226K)
Online Appendix1: volume9/wiebe98a-appendix.ps (353K) Temporal reference resolution algorithm
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume9/wiebe98a-html/journal98.html
PDF: volume9/wiebe98a.pdf (483K)
Abstract: Scheduling dialogs, during which people negotiate the
times of appointments, are common in everyday life. This paper
reports the results of an in-depth empirical investigation of
resolving explicit temporal references in scheduling dialogs. There
are four phases of this work: data annotation and evaluation, model
development, system implementation and evaluation, and model
evaluation and analysis. The system and model were developed
primarily on one set of data, and then applied later to a much more
complex data set, to assess the generalizability of the model for the
task being performed. Many different types of empirical methods are
applied to pinpoint the strengths and weaknesses of the approach.
Detailed annotation instructions were developed and an intercoder
reliability study was performed, showing that naive annotators can
reliably perform the targeted annotations. A fully automatic system
has been developed and evaluated on unseen test data, with good
results on both data sets. We adopt a pure realization of a
recency-based focus model to identify precisely when it is and is not
adequate for the task being addressed. In addition to system results,
an in-depth evaluation of the model itself is presented, based on
detailed manual annotations. The results are that few errors occur
specifically due to the model of focus being used, and the set of
anaphoric relations defined in the model are low in ambiguity for both
data sets.
Mazer, E., Ahuactzin, J.M., and Bessiere, P. (1998)
"The Ariadne's Clew Algorithm",
Volume 9, pages 295-316.
PostScript: volume9/mazer98a.ps (1.29M)
compressed, volume9/mazer98a.ps.Z (288K)
Online Appendix1: volume9/mazer98a-appendix1.tar (421K) Source code
Online Appendix2: volume9/mazer98a-appendix2.tar (12.9M) Quicktime movies
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume9/mazer98a-html/ariane.html
PDF: volume9/mazer98a.pdf (825K)
Abstract: We present a new approach to path planning, called the
``Ariadne's clew algorithm''. It is designed to find paths in
high-dimensional continuous spaces and applies to robots with many
degrees of freedom in static, as well as dynamic environments --- ones
where obstacles may move. The Ariadne's clew algorithm comprises two
sub-algorithms, called SEARCH and EXPLORE, applied in an interleaved
manner. EXPLORE builds a representation of the accessible space while
SEARCH looks for the target. Both are posed as optimization problems.
We describe a real implementation of the algorithm to plan paths for a
six degrees of freedom arm in a dynamic environment where another six
degrees of freedom arm is used as a moving obstacle. Experimental
results show that a path is found in about one second without any
pre-processing.
Di Caro, G. and Dorigo, M. (1998)
"AntNet: Distributed Stigmergetic Control for Communications Networks",
Volume 9, pages 317-365.
PostScript: volume9/dicaro98a.ps (1.22M)
compressed, volume9/dicaro98a.ps.Z (390K)
PDF: volume9/dicaro98a.pdf (704K)
Abstract: This paper introduces AntNet, a novel approach to the
adaptive learning of routing tables in communications networks.
AntNet is a distributed, mobile agents based Monte Carlo system that
was inspired by recent work on the ant colony metaphor for solving
optimization problems. AntNet's agents concurrently explore the
network and exchange collected information. The communication among
the agents is indirect and asynchronous, mediated by the network
itself. This form of communication is typical of social insects and is
called stigmergy. We compare our algorithm with six state-of-the-art
routing algorithms coming from the telecommunications and machine
learning fields. The algorithms' performance is evaluated over a set
of realistic testbeds. We run many experiments over real and
artificial IP datagram networks with increasing number of nodes and
under several paradigmatic spatial and temporal traffic distributions.
Results are very encouraging. AntNet showed superior performance
under all the experimental conditions with respect to its competitors.
We analyze the main characteristics of the algorithm and try to
explain the reasons for its superiority.
Fox, M. and Long, D. (1998)
"The Automatic Inference of State Invariants in TIM",
Volume 9, pages 367-421.
PostScript: volume9/fox98a.ps (423K)
compressed, volume9/fox98a.ps.Z (173K)
Online Appendix1: volume9/fox98a-appendix.tar (266K) TIM code and examples
PDF: volume9/fox98a.pdf (433K)
Abstract: As planning is applied to larger and richer domains the
effort involved in constructing domain descriptions increases and
becomes a significant burden on the human application designer. If
general planners are to be applied successfully to large and complex
domains it is necessary to provide the domain designer with some
assistance in building correctly encoded domains. One way of doing
this is to provide domain-independent techniques for extracting, from
a domain description, knowledge that is implicit in that description
and that can assist domain designers in debugging domain
descriptions. This knowledge can also be exploited to improve the
performance of planners: several researchers have explored the
potential of state invariants in speeding up the performance of
domain-independent planners. In this paper we describe a process by
which state invariants can be extracted from the automatically
inferred type structure of a domain. These techniques are being
developed for exploitation by STAN, a Graphplan based planner that
employs state analysis techniques to enhance its performance.
Rintanen, J. (1998)
"Complexity of Prioritized Default Logics",
Volume 9, pages 423-461.
PostScript: volume9/rintanen98a.ps (474K)
compressed, volume9/rintanen98a.ps.Z (211K)
PDF: volume9/rintanen98a.pdf (1.7M)
Abstract: In default reasoning, usually not all possible ways of
resolving conflicts between default rules are acceptable. Criteria
expressing acceptable ways of resolving the conflicts may be hardwired
in the inference mechanism, for example specificity in inheritance
reasoning can be handled this way, or they may be given abstractly as
an ordering on the default rules. In this article we investigate
formalizations of the latter approach in Reiter's default logic. Our
goal is to analyze and compare the computational properties of three
such formalizations in terms of their computational complexity: the
prioritized default logics of Baader and Hollunder, and Brewka, and a
prioritized default logic that is based on lexicographic comparison.
The analysis locates the propositional variants of these logics on the
second and third levels of the polynomial hierarchy, and identifies
the boundary between tractable and intractable inference for
restricted classes of prioritized default theories.
Artale, A. and Franconi, E. (1998)
"A Temporal Description Logic for Reasoning about Actions and Plans",
Volume 9, pages 463-506.
PostScript: volume9/artale98a.ps (1.0M)
compressed, volume9/artale98a.ps.Z (292K)
PDF: volume9/artale98a.pdf (699K)
Abstract: A class of interval-based temporal languages for uniformly
representing and reasoning about actions and plans is presented.
Actions are represented by describing what is true while the action
itself is occurring, and plans are constructed by temporally relating
actions and world states. The temporal languages are members of the
family of Description Logics, which are characterized by high
expressivity combined with good computational properties. The
subsumption problem for a class of temporal Description Logics is
investigated and sound and complete decision procedures are given. The
basic language TL-F is considered first: it is the composition of a
temporal logic TL -- able to express interval temporal networks --
together with the non-temporal logic F -- a Feature Description Logic.
It is proven that subsumption in this language is an NP-complete
problem. Then it is shown how to reason with the more expressive
languages TLU-FU and TL-ALCF. The former adds disjunction both at the
temporal and non-temporal sides of the language, the latter extends
the non-temporal side with set-valued features (i.e., roles) and a
propositionally complete language.
Volume10
Davis, E. (1999)
"Order of Magnitude Comparisons of Distance",
Volume 10, pages 1-38.
PostScript: volume10/davis99a.ps (416K)
compressed, volume10/davis99a.ps.Z (188K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume10/davis99a-html/om-dist.jair.html
PDF: volume10/davis99a.pdf (461K)
Abstract: Order of magnitude reasoning - reasoning by rough
comparisons of the sizes of quantities - is often called `back of
the envelope calculation', with the implication that the calculations
are quick though approximate. This paper exhibits an interesting
class of constraint sets in which order of magnitude reasoning is
demonstrably fast. Specifically, we present a polynomial-time
algorithm that can solve a set of constraints of the form `Points a
and b are much closer together than points c and d.' We prove that
this algorithm can be applied if `much closer together' is
interpreted either as referring to an infinite difference in scale or
as referring to a finite difference in scale, as long as the
difference in scale is greater than the number of variables in the
constraint set. We also prove that the first-order theory over such
constraints is decidable.
Hogg, T. (1999)
"Solving Highly Constrained Search Problems with Quantum Computers",
Volume 10, pages 39-66.
PostScript: volume10/hogg99a.ps (252K)
compressed, volume10/hogg99a.ps.Z (116K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume10/hogg99a-html/paper.html
PDF: volume10/hogg99a.pdf (1076K)
Abstract: A previously developed quantum search algorithm for solving
1-SAT problems in a single step is generalized to apply to a range
of highly constrained k-SAT problems. We identify a bound on the
number of clauses in satisfiability problems for which the
generalized algorithm can find a solution in a constant number of
steps as the number of variables increases. This performance
contrasts with the linear growth in the number of steps required by
the best classical algorithms, and the exponential number required
by classical and quantum methods that ignore the problem
structure. In some cases, the algorithm can also guarantee that
insoluble problems in fact have no solutions, unlike previously
proposed quantum search algorithms.
Halpern, J.Y. (1999)
"A Counterexample to Theorems of Cox and Fine",
Volume 10, pages 67-85.
PostScript: volume10/halpern99a.ps (501K)
compressed, volume10/halpern99a.ps.Z (148K)
PDF: volume10/halpern99a.pdf (257K)
Abstract: Cox's well-known theorem justifying the use of probability
is shown not to hold in finite domains. The counterexample also
suggests that Cox's assumptions are insufficient to prove the result
even in infinite domains. The same counterexample is used to disprove
a result of Fine on comparative conditional probability.
Long, D. and Fox, M. (1999)
"Efficient Implementation of the Plan Graph in STAN",
Volume 10, pages 87-115.
PostScript: volume10/long99a.ps (453K)
compressed, volume10/long99a.ps.Z (179K)
Online Appendix1: volume10/long99a-appendix.tar (931K) Code and data
PDF: volume10/long99a.pdf (422K)
Abstract: STAN is a Graphplan-based planner, so-called because it uses
a variety of STate ANalysis techniques to enhance its performance.
STAN competed in the AIPS-98 planning competition where it compared
well with the other competitors in terms of speed, finding solutions
fastest to many of the problems posed. Although the domain analysis
techniques STAN exploits are an important factor in its overall
performance, we believe that the speed at which STAN solved the
competition problems is largely due to the implementation of its plan
graph. The implementation is based on two insights: that many of the
graph construction operations can be implemented as bit-level logical
operations on bit vectors, and that the graph should not be explicitly
constructed beyond the fix point. This paper describes the
implementation of STAN's plan graph and provides experimental results
which demonstrate the circumstances under which advantages can be
obtained from using this implementation.
Friedman, N. and Halpern, J.Y. (1999)
"Modeling Belief in Dynamic Systems, Part II: Revision and Update",
Volume 10, pages 117-167.
PostScript: volume10/friedman99a.ps (552K)
compressed, volume10/friedman99a.ps.Z (214K)
PDF: volume10/friedman99a.pdf (541K)
Abstract: The study of belief change has been an active area in
philosophy and AI. In recent years two special cases of belief
change, belief revision and belief update, have been studied in
detail. In a companion paper (Friedman & Halpern, 1997), we introduce
a new framework to model belief change. This framework combines
temporal and epistemic modalities with a notion of plausibility,
allowing us to examine the change of beliefs over time. In this paper,
we show how belief revision and belief update can be captured in our
framework. This allows us to compare the assumptions made by each
method, and to better understand the principles underlying them. In
particular, it shows that Katsuno and Mendelzon's notion of belief
update (Katsuno & Mendelzon, 1991a) depends on several strong
assumptions that may limit its applicability in artificial
intelligence. Finally, our analysis allow us to identify a notion of
minimal change that underlies a broad range of belief change
operations including revision and update.
Fuchs, D. and Fuchs, M. (1999)
"Cooperation between Top-Down and Bottom-Up Theorem Provers",
Volume 10, pages 169-198.
PostScript: volume10/fuchs99a.ps (308K)
compressed, volume10/fuchs99a.ps.Z (134K)
PDF: volume10/fuchs99a.pdf (333K)
Abstract: Top-down and bottom-up theorem proving approaches each
have specific advantages and disadvantages. Bottom-up provers profit
from strong redundancy control but suffer from the lack of
goal-orientation, whereas top-down provers are goal-oriented but often
have weak calculi when their proof lengths are considered. In order
to integrate both approaches, we try to achieve cooperation between a
top-down and a bottom-up prover in two different ways: The first
technique aims at supporting a bottom-up with a top-down prover. A
top-down prover generates subgoal clauses, they are then processed by
a bottom-up prover. The second technique deals with the use of
bottom-up generated lemmas in a top-down prover. We apply our concept
to the areas of model elimination and superposition. We discuss the
ability of our techniques to shorten proofs as well as to reorder the
search space in an appropriate manner. Furthermore, in order to
identify subgoal clauses and lemmas which are actually relevant for
the proof task, we develop methods for a relevancy-based filtering.
Experiments with the provers SETHEO and SPASS performed in the problem
library TPTP reveal the high potential of our cooperation approaches.
Lukasiewicz, T. (1999)
"Probabilistic Deduction with Conditional Constraints over Basic Events",
Volume 10, pages 199-241.
PostScript: volume10/lukasiewicz99a.ps (768K)
compressed, volume10/lukasiewicz99a.ps.Z (290K)
PDF: volume10/lukasiewicz99a.pdf (674K)
Abstract: We study the problem of probabilistic deduction with
conditional constraints over basic events. We show that globally
complete probabilistic deduction with conditional constraints over
basic events is NP-hard. We then concentrate on the special case of
probabilistic deduction in conditional constraint trees. We elaborate
very efficient techniques for globally complete probabilistic
deduction. In detail, for conditional constraint trees with point
probabilities, we present a local approach to globally complete
probabilistic deduction, which runs in linear time in the size of the
conditional constraint trees. For conditional constraint trees with
interval probabilities, we show that globally complete probabilistic
deduction can be done in a global approach by solving nonlinear
programs. We show how these nonlinear programs can be transformed into
equivalent linear programs, which are solvable in polynomial time in
the size of the conditional constraint trees.
Cohen, W.W., Schapire, R.E., and Singer, Y. (1999)
"Learning to Order Things",
Volume 10, pages 243-270.
PostScript: volume10/cohen99a.ps (512K)
compressed, volume10/cohen99a.ps.Z (229K)
PDF: volume10/cohen99a.pdf (445K)
Abstract: There are many applications in which it is desirable to
order rather than classify instances. Here we consider the problem of
learning how to order instances given feedback in the form of
preference judgments, i.e., statements to the effect that one instance
should be ranked ahead of another. We outline a two-stage approach in
which one first learns by conventional means a binary preference
function indicating whether it is advisable to rank one instance
before another. Here we consider an on-line algorithm for learning
preference functions that is based on Freund and Schapire's 'Hedge'
algorithm. In the second stage, new instances are ordered so as to
maximize agreement with the learned preference function. We show that
the problem of finding the ordering that agrees best with a learned
preference function is NP-complete. Nevertheless, we describe simple
greedy algorithms that are guaranteed to find a good approximation.
Finally, we show how metasearch can be formulated as an ordering
problem, and present experimental results on learning a combination of
'search experts', each of which is a domain-specific query expansion
strategy for a web search engine.
Ting, K.M. and Witten, I.H. (1999)
"Issues in Stacked Generalization",
Volume 10, pages 271-289.
PostScript: volume10/ting99a.ps (469K)
compressed, volume10/ting99a.ps.Z (140K)
PDF: volume10/ting99a.pdf (245K)
Abstract: Stacked generalization is a general method of using a
high-level model to combine lower-level models to achieve greater
predictive accuracy. In this paper we address two crucial issues
which have been considered to be a `black art' in classification tasks
ever since the introduction of stacked generalization in 1992 by
Wolpert: the type of generalizer that is suitable to derive the
higher-level model, and the kind of attributes that should be used as
its input. We find that best results are obtained when the
higher-level model combines the confidence (and not just the
predictions) of the lower-level ones.
We demonstrate the effectiveness of stacked generalization for combining
three different types of learning algorithms for classification tasks.
We also compare the performance of stacked generalization with
majority vote and published results of arcing and bagging.
Jaakkola, T.S. and Jordan, M.I. (1999)
"Variational Probabilistic Inference and the QMR-DT Network",
Volume 10, pages 291-322.
PostScript: volume10/jaakkola99a.ps (894K)
compressed, volume10/jaakkola99a.ps.Z (249K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume10/jaakkola99a-html/paper.html
PDF: volume10/jaakkola99a.pdf (695K)
Abstract: We describe a variational approximation method for efficient
inference in large-scale probabilistic models. Variational methods
are deterministic procedures that provide approximations to marginal
and conditional probabilities of interest. They provide alternatives
to approximate inference methods based on stochastic sampling or
search. We describe a variational approach to the problem of
diagnostic inference in the `Quick Medical Reference' (QMR) network.
The QMR network is a large-scale probabilistic graphical model built
on statistical and expert knowledge. Exact probabilistic inference is
infeasible in this model for all but a small set of cases. We
evaluate our variational inference algorithm on a large set of
diagnostic test cases, comparing the algorithm to a state-of-the-art
stochastic sampling method.
Rintanen, J. (1999)
"Constructing Conditional Plans by a Theorem-Prover",
Volume 10, pages 323-352.
PostScript: volume10/rintanen99a.ps (354K)
compressed, volume10/rintanen99a.ps.Z (161K)
PDF: volume10/rintanen99a.pdf (1.1M)
Abstract: The research on conditional planning rejects the assumptions
that there is no uncertainty or incompleteness of knowledge with
respect to the state and changes of the system the plans operate on.
Without these assumptions the sequences of operations that achieve the
goals depend on the initial state and the outcomes of nondeterministic
changes in the system. This setting raises the questions of how to
represent the plans and how to perform plan search. The answers are
quite different from those in the simpler classical framework. In
this paper, we approach conditional planning from a new viewpoint that
is motivated by the use of satisfiability algorithms in classical
planning. Translating conditional planning to formulae in the
propositional logic is not feasible because of inherent computational
limitations. Instead, we translate conditional planning to quantified
Boolean formulae. We discuss three formalizations of conditional
planning as quantified Boolean formulae, and present experimental
results obtained with a theorem-prover.
Joslin, D.E. and Clements, D.P. (1999)
"Squeaky Wheel Optimization",
Volume 10, pages 353-373.
PostScript: volume10/joslin99a.ps (474K)
compressed, volume10/joslin99a.ps.Z (148K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume10/joslin99a-html/swo.html
PDF: volume10/joslin99a.pdf (254K)
Abstract: We describe a general approach to optimization which we term
`Squeaky Wheel' Optimization (SWO). In SWO, a greedy algorithm is
used to construct a solution which is then analyzed to find the
trouble spots, i.e., those elements, that, if improved, are likely to
improve the objective function score. The results of the analysis are
used to generate new priorities that determine the order in which the
greedy algorithm constructs the next solution. This
Construct/Analyze/Prioritize cycle continues until some limit is
reached, or an acceptable solution is found.
SWO can be viewed as operating on two search spaces: solutions and
prioritizations. Successive solutions are only indirectly related,
via the re-prioritization that results from analyzing the prior
solution. Similarly, successive prioritizations are generated by
constructing and analyzing solutions. This `coupled search' has some
interesting properties, which we discuss.
We report encouraging experimental results on two domains, scheduling
problems that arise in fiber-optic cable manufacturing, and graph
coloring problems. The fact that these domains are very different
supports our claim that SWO is a general technique for optimization.
Chien, S., Stechert, A. and Mutz, D. (1999)
"Efficient Heuristic Hypothesis Ranking",
Volume 10, pages 375-397.
PostScript: volume10/chien99a.ps (709K)
compressed, volume10/chien99a.ps.Z (263K)
PDF: volume10/chien99a.pdf (460K)
Abstract: This paper considers the problem of learning the ranking of
a set of stochastic alternatives based upon incomplete information
(i.e., a limited number of samples). We describe a system that, at
each decision cycle, outputs either a complete ordering on the
hypotheses or decides to gather additional information (i.e.,
observations) at some cost. The ranking problem is a generalization
of the previously studied hypothesis selection problem - in selection,
an algorithm must select the single best hypothesis, while in ranking,
an algorithm must order all the hypotheses.
The central problem we address is achieving the desired ranking
quality while minimizing the cost of acquiring additional samples. We
describe two algorithms for hypothesis ranking and their application
for the probably approximately correct (PAC) and expected loss (EL)
learning criteria. Empirical results are provided to demonstrate the
effectiveness of these ranking procedures on both synthetic and
real-world datasets.
Borgida, A. (1999)
"Extensible Knowledge Representation: the Case of Description Reasoners",
Volume 10, pages 399-434.
PostScript: volume10/borgida99a.ps (1099K)
compressed, volume10/borgida99a.ps.Z (609K)
PDF: volume10/borgida99a.pdf (534K)
Abstract: This paper offers an approach to extensible knowledge
representation and reasoning for a family of formalisms known as
Description Logics. The approach is based on the notion of adding new
concept constructors, and includes a heuristic methodology for
specifying the desired extensions, as well as a modularized software
architecture that supports implementing extensions. The architecture
detailed here falls in the normalize-compared paradigm, and supports
both intentional reasoning (subsumption) involving concepts, and
extensional reasoning involving individuals after incremental updates
to the knowledge base.
The resulting approach can be used to extend the reasoner with
specialized notions that are motivated by specific problems or
application areas, such as reasoning about dates, plans, etc. In
addition, it provides an opportunity to implement constructors that
are not currently yet sufficiently well understood theoretically, but
are needed in practice. Also, for constructors that are provably hard
to reason with (e.g., ones whose presence would lead to
undecidability), it allows the implementation of incomplete reasoners
where the incompleteness is tailored to be acceptable for the
application at hand.
Barber, D. and van de Laar, P. (1999)
"Variational Cumulant Expansions for Intractable Distributions",
Volume 10, pages 435-455.
PostScript: volume10/barber99a.ps (430K)
compressed, volume10/barber99a.ps.Z (186K)
PDF: volume10/barber99a.pdf (3378K)
Abstract: Intractable distributions present a common difficulty in
inference within the probabilistic knowledge representation framework
and variational methods have recently been popular in providing an
approximate solution. In this article, we describe a perturbational
approach in the form of a cumulant expansion which, to lowest order,
recovers the standard Kullback-Leibler variational bound.
Higher-order terms describe corrections on the variational approach
without incurring much further computational cost. The relationship
to other perturbational approaches such as TAP is also elucidated. We
demonstrate the method on a particular class of undirected graphical
models, Boltzmann machines, for which our simulation results confirm
improved accuracy and enhanced stability during learning.
Birnbaum, E. and Lozinskii, E.L. (1999)
"The Good Old Davis-Putnam Procedure Helps Counting Models",
Volume 10, pages 457-477.
PostScript: volume10/birnbaum99a.ps (738K)
compressed, volume10/birnbaum99a.ps.Z (308K)
PDF: volume10/birnbaum99a.pdf (253K)
Abstract: As was shown recently, many important AI problems require
counting the number of models of propositional formulas. The problem
of counting models of such formulas is, according to present
knowledge, computationally intractable in a worst case. Based on the
Davis-Putnam procedure, we present an algorithm, CDP, that computes
the exact number of models of a propositional CNF or DNF formula
F. Let m and n be the number of clauses and variables of F,
respectively, and let p denote the probability that a literal l of F
occurs in a clause C of F, then the average running time of CDP is
shown to be O(nm^d), where d=-1/log(1-p). The practical
performance of CDP has been estimated in a series of experiments on a
wide variety of CNF formulas.
Volume11
Boutilier, C., Dean, T. and Hanks, S. (1999)
"Decision-Theoretic Planning: Structural Assumptions and Computational Leverage",
Volume 11, pages 1-94.
PostScript: volume11/boutilier99a.ps (1.3M)
compressed, volume11/boutilier99a.ps.Z (527K)
PDF: volume11/boutilier99a.pdf (942K)
Abstract: Planning under uncertainty is a central problem in the study of
automated sequential decision making, and has been addressed by
researchers in many different fields, including AI planning, decision
analysis, operations research, control theory and economics. While
the assumptions and perspectives adopted in these areas often differ
in substantial ways, many planning problems of interest to researchers
in these fields can be modeled as Markov decision processes (MDPs)
and analyzed using the techniques of decision theory.
This paper presents an overview and synthesis of MDP-related methods,
showing how they provide a unifying framework for modeling many
classes of planning problems studied in AI. It also describes
structural properties of MDPs that, when exhibited by particular
classes of problems, can be exploited in the construction of optimal
or approximately optimal policies or plans. Planning problems
commonly possess structure in the reward and value functions used to
describe performance criteria, in the functions used to describe state
transitions and observations, and in the relationships among features
used to describe states, actions, rewards, and observations.
Specialized representations, and algorithms employing these
representations, can achieve computational leverage by exploiting
these various forms of structure. Certain AI techniques -- in
particular those based on the use of structured, intensional
representations -- can be viewed in this way. This paper surveys
several types of representations for both classical and
decision-theoretic planning problems, and planning algorithms that
exploit these representations in a number of different ways to ease
the computational burden of constructing policies or plans. It focuses
primarily on abstraction, aggregation and decomposition techniques
based on AI-style representations.
Resnik, P. (1999)
"Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language",
Volume 11, pages 95-130.
PostScript: volume11/resnik99a.ps (351K)
compressed, volume11/resnik99a.ps.Z (148K)
PDF: volume11/resnik99a.pdf (396K)
Abstract: This article presents a measure of semantic similarity in an
IS-A taxonomy based on the notion of shared information content.
Experimental evaluation against a benchmark set of human similarity
judgments demonstrates that the measure performs better than the
traditional edge-counting approach. The article presents algorithms
that take advantage of taxonomic similarity in resolving syntactic and
semantic ambiguity, along with experimental results demonstrating
their effectiveness.
Brodley, C.E. and Friedl, M.A. (1999)
"Identifying Mislabeled Training Data",
Volume 11, pages 131-167.
PostScript: volume11/brodley99a.ps (3.4M)
compressed, volume11/brodley99a.ps.Z (742K)
PDF: volume11/brodley99a.pdf (455K)
Abstract: This paper presents a new approach to identifying and
eliminating mislabeled training instances for supervised learning. The
goal of this approach is to improve classification accuracies produced
by learning algorithms by improving the quality of the training data.
Our approach uses a set of learning algorithms to create classifiers
that serve as noise filters for the training data. We evaluate single
algorithm, majority vote and consensus filters on five datasets that
are prone to labeling errors. Our experiments illustrate that
filtering significantly improves classification accuracy for noise
levels up to 30 percent. An analytical and empirical evaluation of
the precision of our approach shows that consensus filters are
conservative at throwing away good data at the expense of retaining
bad data and that majority filters are better at detecting bad data at
the expense of throwing away good data. This suggests that for
situations in which there is a paucity of data, consensus filters are
preferable, whereas majority vote filters are preferable for
situations with an abundance of data.
Opitz, D. and Maclin, R. (1999)
"Popular Ensemble Methods: An Empirical Study",
Volume 11, pages 169-198.
PostScript: volume11/opitz99a.ps (1.1M)
compressed, volume11/opitz99a.ps.Z (559K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume11/opitz99a-html/paper.html
PDF: volume11/opitz99a.pdf (281K)
Abstract: An ensemble consists of a set of individually trained
classifiers (such as neural networks or decision trees) whose
predictions are combined when classifying novel instances. Previous
research has shown that an ensemble is often more accurate than any of
the single classifiers in the ensemble. Bagging (Breiman, 1996c) and
Boosting (Freund & Shapire, 1996; Shapire, 1990) are two relatively
new but popular methods for producing ensembles. In this paper we
evaluate these methods on 23 data sets using both neural networks and
decision trees as our classification algorithm. Our results clearly
indicate a number of conclusions. First, while Bagging is almost
always more accurate than a single classifier, it is sometimes much
less accurate than Boosting. On the other hand, Boosting can create
ensembles that are less accurate than a single classifier --
especially when using neural networks. Analysis indicates that the
performance of the Boosting methods is dependent on the
characteristics of the data set being examined. In fact, further
results show that Boosting ensembles may overfit noisy data sets, thus
decreasing its performance. Finally, consistent with previous
studies, our work suggests that most of the gain in an ensemble's
performance comes in the first few classifiers combined; however,
relatively large gains can be seen up to 25 classifiers when Boosting
decision trees.
Calvanese, D., Lenzerini, M. and Nardi, D. (1999)
"Unifying Class-Based Representation Formalisms",
Volume 11, pages 199-240.
PostScript: volume11/calvanese99a.ps (490K)
compressed, volume11/calvanese99a.ps.Z (221K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume11/calvanese99a-html/calvanese99a-html.html
PDF: volume11/calvanese99a.pdf (558K)
Abstract: The notion of class is ubiquitous in computer science and is
central in many formalisms for the representation of structured
knowledge used both in knowledge representation and in databases. In
this paper we study the basic issues underlying such representation
formalisms and single out both their common characteristics and their
distinguishing features. Such investigation leads us to propose a
unifying framework in which we are able to capture the fundamental
aspects of several representation languages used in different
contexts. The proposed formalism is expressed in the style of
description logics, which have been introduced in knowledge
representation as a means to provide a semantically well-founded basis
for the structural aspects of knowledge representation systems. The
description logic considered in this paper is a subset of first order
logic with nice computational characteristics. It is quite expressive
and features a novel combination of constructs that has not been
studied before. The distinguishing constructs are number
restrictions, which generalize existence and functional dependencies,
inverse roles, which allow one to refer to the inverse of a
relationship, and possibly cyclic assertions, which are necessary for
capturing real world domains. We are able to show that it is
precisely such combination of constructs that makes our logic powerful
enough to model the essential set of features for defining class
structures that are common to frame systems, object-oriented database
languages, and semantic data models. As a consequence of the
established correspondences, several significant extensions of each of
the above formalisms become available. The high expressiveness of the
logic we propose and the need for capturing the reasoning in different
contexts forces us to distinguish between unrestricted and finite
model reasoning. A notable feature of our proposal is that reasoning
in both cases is decidable. We argue that, by virtue of the high
expressive power and of the associated reasoning capabilities on both
unrestricted and finite models, our logic provides a common core for
class-based representation formalisms.
Moriarty, D.E., Schultz, A.C., and Grefenstette, J.J. (1999)
"Evolutionary Algorithms for Reinforcement Learning",
Volume 11, pages 241-276.
PostScript: volume11/moriarty99a.ps (406K)
compressed, volume11/moriarty99a.ps.Z (168K)
PDF: volume11/moriarty99a.pdf (299K)
Abstract: There are two distinct approaches to solving reinforcement
learning problems, namely, searching in value function space and
searching in policy space. Temporal difference methods and
evolutionary algorithms are well-known examples of these approaches.
Kaelbling, Littman and Moore recently provided an informative survey
of temporal difference methods. This article focuses on the
application of evolutionary algorithms to the reinforcement learning
problem, emphasizing alternative policy representations, credit
assignment methods, and problem-specific genetic operators. Strengths
and weaknesses of the evolutionary approach to reinforcement learning
are presented, along with a survey of representative applications.
Rosati, R. (1999)
"Reasoning about Minimal Belief and Negation as Failure",
Volume 11, pages 277-300.
PostScript: volume11/rosati99a.ps (530K)
compressed, volume11/rosati99a.ps.Z (157K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume11/rosati99a-html/rosati99a-html.html
PDF: volume11/rosati99a.pdf (353K)
Abstract: We investigate the problem of reasoning in the propositional
fragment of MBNF, the logic of minimal belief and negation as failure
introduced by Lifschitz, which can be considered as a unifying
framework for several nonmonotonic formalisms, including default
logic, autoepistemic logic, circumscription, epistemic queries, and
logic programming. We characterize the complexity and provide
algorithms for reasoning in propositional MBNF. In particular, we
show that entailment in propositional MBNF lies at the third level of
the polynomial hierarchy, hence it is harder than reasoning in all the
above mentioned propositional formalisms for nonmonotonic reasoning.
We also prove the exact correspondence between negation as failure in
MBNF and negative introspection in Moore's autoepistemic logic.
Ygge, F. and Akkermans, H. (1999)
"Decentralized Markets versus Central Control: A Comparative Study",
Volume 11, pages 301-333.
PostScript: volume11/ygge99a.ps (1.0M)
compressed, volume11/ygge99a.ps.Z (459K)
PDF: volume11/ygge99a.pdf (455K)
Abstract: Multi-Agent Systems (MAS) promise to offer solutions to
problems where established, older paradigms fall short. In order to
validate such claims that are repeatedly made in software agent
publications, empirical in-depth studies of advantages and weaknesses
of multi-agent solutions versus conventional ones in practical
applications are needed. Climate control in large buildings is one
application area where multi-agent systems, and market-oriented
programming in particular, have been reported to be very successful,
although central control solutions are still the standard practice.
We have therefore constructed and implemented a variety of market
designs for this problem, as well as different standard control
engineering solutions. This article gives a detailed analysis and
comparison, so as to learn about differences between standard versus
agent approaches, and yielding new insights about benefits and
limitations of computational markets. An important outcome is that
``local information plus market communication produces global
control''.
Argamon-Engelson, S. and Dagan, I. (1999)
"Committee-Based Sample Selection for Probabilistic Classifiers",
Volume 11, pages 335-360.
PostScript: volume11/argamon99a.ps (375K)
compressed, volume11/argamon99a.ps.Z (459K)
PDF: volume11/argamon99a.pdf (135K)
Abstract: In many real-world learning tasks, it is expensive to
acquire a sufficient number of labeled examples for training. This
paper investigates methods for reducing annotation cost by `sample
selection'. In this approach, during training the learning program
examines many unlabeled examples and selects for labeling only those
that are most informative at each stage. This avoids redundantly
labeling examples that contribute little new information.
Our work follows on previous research on Query By Committee, extending
the committee-based paradigm to the context of probabilistic
classification. We describe a family of empirical methods for
committee-based sample selection in probabilistic classification
models, which evaluate the informativeness of an example by measuring
the degree of disagreement between several model variants. These
variants (the committee) are drawn randomly from a probability
distribution conditioned by the training set labeled so far.
The method was applied to the real-world natural language processing
task of stochastic part-of-speech tagging. We find that all variants
of the method achieve a significant reduction in annotation cost,
although their computational efficiency differs. In particular, the
simplest variant, a two member committee with no parameters to tune,
gives excellent results. We also show that sample selection yields a
significant reduction in the size of the model used by the tagger.
Cristani, M. (1999)
"The Complexity of Reasoning about Spatial Congruence",
Volume 11, pages 361-390.
PostScript: volume11/cristani99a.ps (709K)
compressed, volume11/cristani99a.ps.Z (218K)
Online Appendix1: volume11/cristani99a-appendix.ps (102K) Lisp code
PDF: volume11/cristani99a.pdf (553K)
Abstract: In the recent literature of Artificial Intelligence, an
intensive research effort has been spent, for various algebras of
qualitative relations used in the representation of temporal and
spatial knowledge, on the problem of classifying the computational
complexity of reasoning problems for subsets of algebras. The main
purpose of these researches is to describe a restricted set of maximal
tractable subalgebras, ideally in an exhaustive fashion with respect
to the hosting algebras.
In this paper we introduce a novel algebra for reasoning about Spatial
Congruence, show that the satisfiability problem in the spatial
algebra MC-4 is NP-complete, and present a complete classification of
tractability in the algebra, based on the individuation of three
maximal tractable subclasses, one containing the basic relations. The
three algebras are formed by 14, 10 and 9 relations out of 16 which
form the full algebra.
Fox, D., Burgard, W. and Thrun, S. (1999)
"Markov Localization for Mobile Robots in Dynamic Environments",
Volume 11, pages 391-427.
PostScript: volume11/fox99a.ps (13.5M)
compressed, volume11/fox99a.ps.Z (3.2M)
Online Appendix1: http://www.cs.cmu.edu/~dfox/museum-projects.html
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume11/fox99a-html/jair-localize.html
PDF: volume11/fox99a.pdf (5.3M)
Abstract: Localization, that is the estimation of a robot's location
from sensor data, is a fundamental problem in mobile robotics. This
papers presents a version of Markov localization which provides
accurate position estimates and which is tailored towards dynamic
environments. The key idea of Markov localization is to maintain a
probability density over the space of all locations of a robot in its
environment. Our approach represents this space metrically, using a
fine-grained grid to approximate densities. It is able to globally
localize the robot from scratch and to recover from localization
failures. It is robust to approximate models of the environment (such
as occupancy grid maps) and noisy sensors (such as ultrasound
sensors). Our approach also includes a filtering technique which
allows a mobile robot to reliably estimate its position even in
densely populated environments in which crowds of people block the
robot's sensors for extended periods of time. The method described
here has been implemented and tested in several real-world
applications of mobile robots, including the deployments of two mobile
robots as interactive museum tour-guides.
Halpern, J.Y. (1999)
"Cox's Theorem Revisited",
Volume 11, pages 429-435.
PostScript: volume11/halpern99b.ps (128K)
compressed, volume11/halpern99b.ps.Z (52K)
PDF: volume11/halpern99b.pdf (154K)
Abstract: The assumptions needed to prove Cox's Theorem are discussed
and examined. Various sets of assumptions under which a Cox-style
theorem can be proved are provided, although all are rather strong
and, arguably, not natural.
Volume12
Kambhampati, S. (2000)
"Planning Graph as a (Dynamic) CSP: Exploiting EBL, DDB and other CSP Search Techniques in Graphplan",
Volume 12, pages 1-34.
PostScript: volume12/kambhampati00a.ps (859K)
compressed, volume12/kambhampati00a.ps.Z (302K)
PDF: volume12/kambhampati00a.pdf (248K)
Abstract: This paper reviews the connections between Graphplan's
planning-graph and the dynamic constraint satisfaction problem and
motivates the need for adapting CSP search techniques to the Graphplan
algorithm. It then describes how explanation based learning,
dependency directed backtracking, dynamic variable ordering, forward
checking, sticky values and random-restart search strategies can be
adapted to Graphplan. Empirical results are provided to demonstrate
that these augmentations improve Graphplan's performance significantly
(up to 1000x speedups) on several benchmark problems. Special
attention is paid to the explanation-based learning and dependency
directed backtracking techniques as they are empirically found to be
most useful in improving the performance of Graphplan.
Barber, F. (2000)
"Reasoning on Interval and Point-based Disjunctive Metric Constraints in Temporal Contexts",
Volume 12, pages 35-86.
PostScript: volume12/barber00a.ps (2.8M)
compressed, volume12/barber00a.ps.Z (500K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume12/barber00a-html/Barber.html
PDF: volume12/barber00a.pdf (241K)
Abstract: We introduce a temporal model for reasoning on disjunctive
metric constraints on intervals and time points in temporal
contexts. This temporal model is composed of a labeled temporal
algebra and its reasoning algorithms. The labeled temporal algebra
defines labeled disjunctive metric point-based constraints, where each
disjunct in each input disjunctive constraint is univocally associated
to a label. Reasoning algorithms manage labeled constraints,
associated label lists, and sets of mutually inconsistent
disjuncts. These algorithms guarantee consistency and obtain a minimal
network. Additionally, constraints can be organized in a hierarchy of
alternative temporal contexts. Therefore, we can reason on
context-dependent disjunctive metric constraints on intervals and
points. Moreover, the model is able to represent non-binary
constraints, such that logical dependencies on disjuncts in
constraints can be handled. The computational cost of reasoning
algorithms is exponential in accordance with the underlying problem
complexity, although some improvements are proposed.
Neal, R.M. (2000)
"On Deducing Conditional Independence from d-Separation in Causal Graphs with Feedback (Research Note)",
Volume 12, pages 87-91.
PostScript: volume12/neal00a.ps (287K)
compressed, volume12/neal00a.ps.Z (81K)
PDF: volume12/neal00a.pdf (132K)
Abstract: Pearl and Dechter (1996) claimed that the d-separation
criterion for conditional independence in acyclic causal networks also
applies to networks of discrete variables that have feedback cycles,
provided that the variables of the system are uniquely determined by
the random disturbances. I show by example that this is not true in
general. Some condition stronger than uniqueness is needed, such as
the existence of a causal dynamics guaranteed to lead to the unique
solution.
Xu, K. and Li, W. (2000)
"Exact Phase Transitions in Random Constraint Satisfaction Problems",
Volume 12, pages 93-103.
PostScript: volume12/xu00a.ps (208K)
compressed, volume12/xu00a.ps.Z (101K)
PDF: volume12/xu00a.pdf (268K)
Abstract: In this paper we propose a new type of random CSP model,
called Model RB, which is a revision to the standard Model B. It is
proved that phase transitions from a region where almost all problems
are satisfiable to a region where almost all problems are
unsatisfiable do exist for Model RB as the number of variables
approaches infinity. Moreover, the critical values at which the phase
transitions occur are also known exactly. By relating the hardness of
Model RB to Model B, it is shown that there exist a lot of hard
instances in Model RB.
Kaminka, G.A. and Tambe, M. (2000)
"Robust Agent Teams via Socially-Attentive Monitoring",
Volume 12, pages 105-147.
PostScript: volume12/kaminka00a.ps (553K)
compressed, volume12/kaminka00a.ps.Z (243K)
PDF: volume12/kaminka00a.pdf (537K)
Abstract: Agents in dynamic multi-agent environments must monitor
their peers to execute individual and group plans. A key open question
is how much monitoring of other agents' states is required to be
effective: The Monitoring Selectivity Problem. We investigate this
question in the context of detecting failures in teams of cooperating
agents, via Socially-Attentive Monitoring, which focuses on monitoring
for failures in the social relationships between the agents. We
empirically and analytically explore a family of socially-attentive
teamwork monitoring algorithms in two dynamic, complex, multi-agent
domains, under varying conditions of task distribution and
uncertainty. We show that a centralized scheme using a complex
algorithm trades correctness for completeness and requires monitoring
all teammates. In contrast, a simple distributed teamwork monitoring
algorithm results in correct and complete detection of teamwork
failures, despite relying on limited, uncertain knowledge, and
monitoring only key agents in a team. In addition, we report on the
design of a socially-attentive monitoring system and demonstrate its
generality in monitoring several coordination relationships,
diagnosing detected failures, and both on-line and off-line applications.
Baxter, J. (2000)
"A Model of Inductive Bias Learning",
Volume 12, pages 149-198.
PostScript: volume12/baxter00a.ps (476K)
compressed, volume12/baxter00a.ps.Z (216K)
PDF: volume12/baxter00a.pdf (398K)
Abstract: A major problem in machine learning is that of inductive
bias: how to choose a learner's hypothesis space so that it is large
enough to contain a solution to the problem being learnt, yet small
enough to ensure reliable generalization from reasonably-sized
training sets. Typically such bias is supplied by hand through the
skill and insights of experts. In this paper a model for automatically
learning bias is investigated. The central assumption of the model is
that the learner is embedded within an environment of related learning
tasks. Within such an environment the learner can sample from multiple
tasks, and hence it can search for a hypothesis space that contains
good solutions to many of the problems in the environment. Under
certain restrictions on the set of all hypothesis spaces available to
the learner, we show that a hypothesis space that performs well on a
sufficiently large number of training tasks will also perform well
when learning novel tasks in the same environment. Explicit bounds
are also derived demonstrating that learning multiple tasks within an
environment of related tasks can potentially give much better
generalization than learning a single task.
Tobies, S. (2000)
"The Complexity of Reasoning with Cardinality Restrictions and Nominals in Expressive Description Logics",
Volume 12, pages 199-217.
PostScript: volume12/tobies00a.ps (306K)
compressed, volume12/tobies00a.ps.Z (144K)
Online Appendix1: volume12/tobies00a-appendix1.html (2K) online bibliography
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume12/tobies00a-html/tobies00a.html
PDF: volume12/tobies00a.pdf (369K)
Abstract: We study the complexity of the combination of the
Description Logics ALCQ and ALCQI with a terminological formalism
based on cardinality restrictions on concepts. These combinations can
naturally be embedded into C^2, the two variable fragment of predicate
logic with counting quantifiers, which yields decidability in
NExpTime. We show that this approach leads to an optimal solution for
ALCQI, as ALCQI with cardinality restrictions has the same complexity
as C^2 (NExpTime-complete). In contrast, we show that for ALCQ, the
problem can be solved in ExpTime. This result is obtained by a
reduction of reasoning with cardinality restrictions to reasoning with
the (in general weaker) terminological formalism of general axioms for
ALCQ extended with nominals. Using the same reduction, we show that,
for the extension of ALCQI with nominals, reasoning with general
axioms is a NExpTime-complete problem. Finally, we sharpen this result
and show that pure concept satisfiability for ALCQI with nominals is
NExpTime-complete. Without nominals, this problem is known to be
PSpace-complete.
Becker, A., Bar-Yehuda, R. and Geiger, D. (2000)
"Randomized Algorithms for the Loop Cutset Problem",
Volume 12, pages 219-234.
PostScript: volume12/becker00a.ps (215K)
compressed, volume12/becker00a.ps.Z (86K)
PDF: volume12/becker00a.pdf (247K)
Abstract: We show how to find a minimum weight loop cutset in a
Bayesian network with high probability. Finding such a loop cutset is
the first step in the method of conditioning for inference. Our
randomized algorithm for finding a loop cutset outputs a minimum loop
cutset after O(c 6^k kn) steps with probability at least
1 - (1 - 1/(6^k))^c6^k, where c > 1 is a constant specified by the
user, k is the minimal size of a minimum weight loop cutset, and n is
the number of vertices. We also show empirically that a variant of
this algorithm often finds a loop cutset that is closer to the minimum
weight loop cutset than the ones found by the best deterministic
algorithms known.
Singer, J., Gent, I.P. and Smaill, A. (2000)
"Backbone Fragility and the Local Search Cost Peak",
Volume 12, pages 235-270.
PostScript: volume12/singer00a.ps (1.2M)
compressed, volume12/singer00a.ps.Z (326K)
PDF: volume12/singer00a.pdf (811K)
Abstract: The local search algorithm WSat is one of the most
successful algorithms for solving the satisfiability (SAT) problem. It
is notably effective at solving hard Random 3-SAT instances near the
so-called `satisfiability threshold', but still shows a peak in search
cost near the threshold and large variations in cost over different
instances. We make a number of significant contributions to the
analysis of WSat on high-cost random instances, using the
recently-introduced concept of the backbone of a SAT instance. The
backbone is the set of literals which are entailed by an instance. We
find that the number of solutions predicts the cost well for
small-backbone instances but is much less relevant for the
large-backbone instances which appear near the threshold and dominate
in the overconstrained region. We show a very strong correlation
between search cost and the Hamming distance to the nearest solution
early in WSat's search. This pattern leads us to introduce a measure
of the backbone fragility of an instance, which indicates how
persistent the backbone is as clauses are removed. We propose that
high-cost random instances for local search are those with very large
backbones which are also backbone-fragile. We suggest that the decay
in cost beyond the satisfiability threshold is due to increasing
backbone robustness (the opposite of backbone fragility). Our
hypothesis makes three correct predictions. First, that the backbone
robustness of an instance is negatively correlated with the local
search cost when other factors are controlled for. Second, that
backbone-minimal instances (which are 3-SAT instances altered so as to
be more backbone-fragile) are unusually hard for WSat. Third, that the
clauses most often unsatisfied during search are those whose deletion
has the most effect on the backbone. In understanding the pathologies
of local search methods, we hope to contribute to the development of
new and better techniques.
Nebel, B. (2000)
"On the Compilability and Expressive Power of Propositional Planning Formalisms",
Volume 12, pages 271-315.
PostScript: volume12/nebel00a.ps (419K)
compressed, volume12/nebel00a.ps.Z (186K)
PDF: volume12/nebel00a.pdf (293K)
Abstract: The recent approaches of extending the GRAPHPLAN algorithm
to handle more expressive planning formalisms raise the question of
what the formal meaning of ``expressive power'' is. We formalize the
intuition that expressive power is a measure of how concisely planning
domains and plans can be expressed in a particular formalism by
introducing the notion of ``compilation schemes'' between planning
formalisms. Using this notion, we analyze the expressiveness of a
large family of propositional planning formalisms, ranging from basic
STRIPS to a formalism with conditional effects, partial state
specifications, and propositional formulae in the preconditions. One
of the results is that conditional effects cannot be compiled away if
plan size should grow only linearly but can be compiled away if we
allow for polynomial growth of the resulting plans. This result
confirms that the recently proposed extensions to the GRAPHPLAN
algorithm concerning conditional effects are optimal with respect to
the ``compilability'' framework. Another result is that general
propositional formulae cannot be compiled into conditional effects if
the plan size should be preserved linearly. This implies that
allowing general propositional formulae in preconditions and effect
conditions adds another level of difficulty in generating a plan.
Halpern, J.Y. (2000)
"Axiomatizing Causal Reasoning",
Volume 12, pages 317-337.
PostScript: volume12/halpern00a.ps (565K)
compressed, volume12/halpern00a.ps.Z (170K)
PDF: volume12/halpern00a.pdf (275K)
Abstract: Causal models defined in terms of a collection of equations,
as defined by Pearl, are axiomatized here. Axiomatizations are
provided for three successively more general classes of causal models:
(1) the class of recursive theories (those without feedback), (2) the
class of theories where the solutions to the equations are unique,
(3) arbitrary theories (where the equations may not have solutions and, if
they do, they are not necessarily unique). It is shown that to reason
about causality in the most general third class, we must extend the
language used by Galles and Pearl. In addition, the complexity of the
decision procedures is characterized for all the languages and classes
of models considered.
Koehler, J. and Hoffmann, J. (2000)
"On Reasonable and Forced Goal Orderings and their Use in an Agenda-Driven Planning Algorithm",
Volume 12, pages 338-386.
PostScript: volume12/koehler00a.ps (541K)
compressed, volume12/koehler00a.ps.Z (244K)
Online Appendix1: volume12/koehler00a.html (1k) Source code and domains
PDF: volume12/koehler00a.pdf (598K)
Abstract: The paper addresses the problem of computing goal orderings,
which is one of the longstanding issues in AI planning. It makes two
new contributions. First, it formally defines and discusses two
different goal orderings, which are called the reasonable and the
forced ordering. Both orderings are defined for simple STRIPS
operators as well as for more complex ADL operators supporting
negation and conditional effects. The complexity of these orderings is
investigated and their practical relevance is discussed. Secondly, two
different methods to compute reasonable goal orderings are developed.
One of them is based on planning graphs, while the other investigates
the set of actions directly. Finally, it is shown how the ordering
relations, which have been derived for a given set of goals G, can be
used to compute a so-called goal agenda that divides G into an ordered
set of subgoals. Any planner can then, in principle, use the goal
agenda to plan for increasing sets of subgoals. This can lead to an
exponential complexity reduction, as the solution to a complex
planning problem is found by solving easier subproblems. Since only a
polynomial overhead is caused by the goal agenda computation, a
potential exists to dramatically speed up planning algorithms as we
demonstrate in the empirical evaluation, where we use this method in
the IPP planner.
Walker, M.A. (2000)
"An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email",
Volume 12, pages 387-416.
PostScript: volume12/walker00a.ps (346K)
compressed, volume12/walker00a.ps.Z (165K)
PDF: volume12/walker00a.pdf (419K)
Abstract: This paper describes a novel method by which a spoken
dialogue system can learn to choose an optimal dialogue strategy from
its experience interacting with human users. The method is based on a
combination of reinforcement learning and performance modeling of
spoken dialogue systems. The reinforcement learning component applies
Q-learning (Watkins, 1989), while the performance modeling component
applies the PARADISE evaluation framework (Walker et al., 1997) to
learn the performance function (reward) used in reinforcement
learning. We illustrate the method with a spoken dialogue system
named ELVIS (EmaiL Voice Interactive System), that supports access to
email over the phone. We conduct a set of experiments for training an
optimal dialogue strategy on a corpus of 219 dialogues in which human
users interact with ELVIS over the phone. We then test that strategy
on a corpus of 18 dialogues. We show that ELVIS can learn to optimize
its strategy selection for agent initiative, for reading messages, and
for summarizing email folders.
Volume13
Cadoli, M., Donini, F.M., Liberatore, P., and Schaerf, M. (2000)
"Space Efficiency of Propositional Knowledge Representation Formalisms",
Volume 13, pages 1-31.
PostScript: volume13/cadoli00a.ps (511K)
compressed, volume13/cadoli00a.ps.Z (256K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume13/cadoli00a-html/cadoli00a.html
PDF: volume13/cadoli00a.pdf (305K)
Abstract: We investigate the space efficiency of a Propositional
Knowledge Representation (PKR) formalism. Intuitively, the space
efficiency of a formalism F in representing a certain piece of
knowledge A, is the size of the shortest formula of F that
represents A. In this paper we assume that knowledge is
either a set of propositional interpretations (models) or a set of
propositional formulae (theorems). We provide a formal way of
talking about the relative ability of PKR formalisms to compactly
represent a set of models or a set of theorems. We introduce two new
compactness measures, the corresponding classes, and show that the
relative space efficiency of a PKR formalism in representing
models/theorems is directly related to such classes. In particular,
we consider formalisms for nonmonotonic reasoning, such as
circumscription and default logic, as well as belief revision
operators and the stable model semantics for logic programs with
negation. One interesting result is that formalisms with the same
time complexity do not necessarily belong to the same space
efficiency class.
Hauskrecht, M. (2000)
"Value-Function Approximations for Partially Observable Markov Decision Processes",
Volume 13, pages 33-94.
PostScript: volume13/hauskrecht00a.ps (1534K)
compressed, volume13/hauskrecht00a.ps.Z (532K)
PDF: volume13/hauskrecht00a.pdf (654K)
Abstract: Partially observable Markov decision processes (POMDPs)
provide an elegant mathematical framework for modeling complex
decision and planning problems in stochastic domains in which states
of the system are observable only indirectly, via a set of imperfect
or noisy observations. The modeling advantage of POMDPs, however,
comes at a price -- exact methods for solving them are computationally
very expensive and thus applicable in practice only to very simple
problems. We focus on efficient approximation (heuristic) methods that
attempt to alleviate the computational problem and trade off accuracy
for speed. We have two objectives here. First, we survey various
approximation methods, analyze their properties and relations and
provide some new insights into their differences. Second, we present a
number of new approximation methods and novel refinements of existing
techniques. The theoretical results are supported by experiments on a
problem from the agent navigation domain.
Gordon, D.F. (2000)
"Asimovian Adaptive Agents",
Volume 13, pages 95-153.
PostScript: volume13/gordon00a.ps (943K)
compressed, volume13/gordon00a.ps.Z (284K)
PDF: volume13/gordon00a.pdf (584K)
Abstract: The goal of this research is to develop agents that
are adaptive and predictable and timely. At first blush,
these three requirements seem contradictory. For example,
adaptation risks introducing undesirable side effects,
thereby making agents' behavior less predictable. Furthermore,
although formal verification can assist in ensuring
behavioral predictability, it is known to be time-consuming.
Our solution to the challenge of satisfying all three
requirements is the following. Agents have finite-state
automaton plans, which are adapted online via evolutionary
learning (perturbation) operators. To ensure that critical
behavioral constraints are always satisfied, agents' plans
are first formally verified. They are then reverified after
every adaptation. If reverification concludes that constraints
are violated, the plans are repaired. The main objective of
this paper is to improve the efficiency of reverification
after learning, so that agents have a sufficiently rapid
response time. We present two solutions: positive results
that certain learning operators are a priori guaranteed to
preserve useful classes of behavioral assurance constraints
(which implies that no reverification is needed for these
operators), and efficient incremental reverification algorithms
for those learning operators that have negative a priori results.
Cheng, J. and Druzdzel, M.J. (2000)
"AIS-BN: An Adaptive Importance Sampling Algorithm for Evidential Reasoning in Large Bayesian Networks",
Volume 13, pages 155-188.
PostScript: volume13/cheng00a.ps (593K)
compressed, volume13/cheng00a.ps.Z (260K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume13/cheng00a-html/cheng00a-hmtl.html
PDF: volume13/cheng00a.pdf (314K)
Abstract: Stochastic sampling algorithms, while an attractive alternative to
exact algorithms in very large Bayesian network models, have been
observed to perform poorly in evidential reasoning with extremely
unlikely evidence. To address this problem, we propose an adaptive
importance sampling algorithm, AIS-BN, that shows promising
convergence rates even under extreme conditions and seems to
outperform the existing sampling algorithms consistently. Three
sources of this performance improvement are (1) two heuristics for
initialization of the importance function that are based on the
theoretical properties of importance sampling in finite-dimensional
integrals and the structural advantages of Bayesian networks, (2) a
smooth learning method for the importance function, and (3) a dynamic
weighting function for combining samples from different stages of the
algorithm.
We tested the performance of the AIS-BN algorithm along with two state
of the art general purpose sampling algorithms, likelihood weighting
(Fung & Chang, 1989; Shachter & Peot, 1989) and self-importance
sampling (Shachter & Peot, 1989). We used in our tests three large
real Bayesian network models available to the scientific community:
the CPCS network (Pradhan et al., 1994), the PathFinder network
(Heckerman, Horvitz, & Nathwani, 1990), and the ANDES network (Conati,
Gertner, VanLehn, & Druzdzel, 1997), with evidence as unlikely as
10^-41. While the AIS-BN algorithm always performed better than the
other two algorithms, in the majority of the test cases it achieved
orders of magnitude improvement in precision of the results.
Improvement in speed given a desired precision is even more dramatic,
although we are unable to report numerical results here, as the other
algorithms almost never achieved the precision reached even by the
first few iterations of the AIS-BN algorithm.
Jensen, R.M. and Veloso, M.M. (2000)
"OBDD-based Universal Planning for Synchronized Agents in Non-Deterministic Domains",
Volume 13, pages 189-226.
PostScript: volume13/jensen00a.ps (405K)
compressed, volume13/jensen00a.ps.Z (178K)
PDF: volume13/jensen00a.pdf (470K)
Online Appendix1: volume13/jensen00a-appendix1.tar.gz, UMOP Planner (994K)
Abstract: Recently model checking representation and search techniques
were shown to be efficiently applicable to planning, in particular to
non-deterministic planning. Such planning approaches use Ordered
Binary Decision Diagrams (OBDDs) to encode a planning domain as a
non-deterministic finite automaton and then apply fast algorithms from
model checking to search for a solution. OBDDs can effectively scale
and can provide universal plans for complex planning domains. We are
particularly interested in addressing the complexities arising in
non-deterministic, multi-agent domains. In this article, we present
UMOP, a new universal OBDD-based planning framework for
non-deterministic, multi-agent domains. We introduce a new planning
domain description language, NADL, to specify non-deterministic,
multi-agent domains. The language contributes the explicit definition
of controllable agents and uncontrollable environment agents. We
describe the syntax and semantics of NADL and show how to build an
efficient OBDD-based representation of an NADL description. The UMOP
planning system uses NADL and different OBDD-based universal planning
algorithms. It includes the previously developed strong and strong
cyclic planning algorithms. In addition, we introduce our new
optimistic planning algorithm that relaxes optimality guarantees and
generates plausible universal plans in some domains where no strong
nor strong cyclic solution exists. We present empirical results
applying UMOP to domains ranging from deterministic and single-agent
with no environment actions to non-deterministic and multi-agent with
complex environment actions. UMOP is shown to be a rich and efficient
planning system.
Dietterich, T.G. (2000)
"Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition",
Volume 13, pages 227-303.
PostScript: volume13/dietterich00a.ps (1682K)
compressed, volume13/dietterich00a.ps.Z (464K)
PDF: volume13/dietterich00a.pdf (967K)
Abstract: This paper presents a new approach to hierarchical
reinforcement learning based on decomposing the target Markov decision
process (MDP) into a hierarchy of smaller MDPs and decomposing the
value function of the target MDP into an additive combination of the
value functions of the smaller MDPs. The decomposition, known as the
MAXQ decomposition, has both a procedural semantics---as a subroutine
hierarchy---and a declarative semantics---as a representation of the
value function of a hierarchical policy. MAXQ unifies and extends
previous work on hierarchical reinforcement learning by Singh,
Kaelbling, and Dayan and Hinton. It is based on the assumption that
the programmer can identify useful subgoals and define subtasks that
achieve these subgoals. By defining such subgoals, the programmer
constrains the set of policies that need to be considered during
reinforcement learning. The MAXQ value function decomposition can
represent the value function of any policy that is consistent with the
given hierarchy. The decomposition also creates opportunities to
exploit state abstractions, so that individual MDPs within the
hierarchy can ignore large parts of the state space. This is
important for the practical application of the method. This paper
defines the MAXQ hierarchy, proves formal results on its
representational power, and establishes five conditions for the safe
use of state abstractions. The paper presents an online model-free
learning algorithm, MAXQ-Q, and proves that it converges with
probability 1 to a kind of locally-optimal policy known as a
recursively optimal policy, even in the presence of the five kinds of
state abstraction. The paper evaluates the MAXQ representation and
MAXQ-Q through a series of experiments in three domains and shows
experimentally that MAXQ-Q (with state abstractions) converges to a
recursively optimal policy much faster than flat Q learning. The fact
that MAXQ learns a representation of the value function has an
important benefit: it makes it possible to compute and execute an
improved, non-hierarchical policy via a procedure similar to the
policy improvement step of policy iteration. The paper demonstrates
the effectiveness of this non-hierarchical execution experimentally.
Finally, the paper concludes with a comparison to related work and a
discussion of the design tradeoffs in hierarchical reinforcement learning.
Cimatti, A. and Roveri, M. (2000)
"Conformant Planning via Symbolic Model Checking",
Volume 13, pages 305-338.
PostScript: volume13/cimatti00a.ps (600K)
compressed, volume13/cimatti00a.ps.Z (251K)
Online Appendix1: volume13/cimatti00a-appendix1.tar.gz (199K) CMBP Planner
PDF: volume13/cimatti00a.pdf (529K)
Abstract: We tackle the problem of planning in nondeterministic
domains, by presenting a new approach to conformant planning.
Conformant planning is the problem of finding a sequence of actions
that is guaranteed to achieve the goal despite the nondeterminism of
the domain. Our approach is based on the representation of the
planning domain as a finite state automaton. We use Symbolic Model
Checking techniques, in particular Binary Decision Diagrams, to
compactly represent and efficiently search the automaton. In this
paper we make the following contributions. First, we present a general
planning algorithm for conformant planning, which applies to fully
nondeterministic domains, with uncertainty in the initial condition
and in action effects. The algorithm is based on a breadth-first,
backward search, and returns conformant plans of minimal length, if a
solution to the planning problem exists, otherwise it terminates
concluding that the problem admits no conformant solution. Second, we
provide a symbolic representation of the search space based on Binary
Decision Diagrams (BDDs), which is the basis for search techniques
derived from symbolic model checking. The symbolic representation
makes it possible to analyze potentially large sets of states and
transitions in a single computation step, thus providing for an
efficient implementation. Third, we present CMBP (Conformant Model
Based Planner), an efficient implementation of the data structures and
algorithm described above, directly based on BDD manipulations, which
allows for a compact representation of the search layers and an
efficient implementation of the search steps. Finally, we present an
experimental comparison of our approach with the state-of-the-art
conformant planners CGP, QBFPLAN and GPT. Our analysis includes all
the planning problems from the distribution packages of these systems,
plus other problems defined to stress a number of specific factors.
Our approach appears to be the most effective: CMBP is strictly more
expressive than QBFPLAN and CGP and, in all the problems where a
comparison is possible, CMBP outperforms its competitors, sometimes by
orders of magnitude.
Volume 14
Brafman, R.I. (2001)
"On Reachability, Relevance, and Resolution in the Planning as Satisfiability Approach",
Volume 14, pages 1-28.
PostScript: volume14/brafman01a.ps (352K)
compressed, volume14/brafman01a.ps.Z (162K)
PDF: volume14/brafman01a.pdf (354K)
Abstract: In recent years, there is a growing awareness of the
importance of reachability and relevance-based pruning techniques for
planning, but little work specifically targets these techniques. In
this paper, we compare the ability of two classes of algorithms to
propagate and discover reachability and relevance constraints in
classical planning problems. The first class of algorithms operates on
SAT encoded planning problems obtained using the linear and Graphplan
encoding schemes. It applies unit-propagation and more general
resolution steps (involving larger clauses) to these plan encodings.
The second class operates at the plan level and contains two families
of pruning algorithms: Reachable-k and Relevant-k. Reachable-k
provides a coherent description of a number of existing forward
pruning techniques used in numerous algorithms, while Relevant-k
captures different grades of backward pruning. Our results shed light
on the ability of different plan-encoding schemes to propagate
information forward and backward and on the relative merit of
plan-level and SAT-level pruning methods.
Zhang, N.L. and Zhang, W. (2001)
"Speeding Up the Convergence of Value Iteration in Partially Observable Markov Decision Processes",
Volume 14, pages 29-51.
PostScript: volume14/zhang01a.ps (291K)
compressed, volume14/zhang01a.ps.Z (138K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume14/zhang01a-html/zhang01a.html
PDF: volume14/zhang01a.pdf (299K)
Abstract: Partially observable Markov decision processes (POMDPs) have
recently become popular among many AI researchers because they serve
as a natural model for planning under uncertainty. Value iteration is
a well-known algorithm for finding optimal policies for POMDPs. It
typically takes a large number of iterations to converge. This paper
proposes a method for accelerating the convergence of value iteration.
The method has been evaluated on an array of benchmark problems and
was found to be very effective: It enabled value iteration to converge
after only a few iterations on all the test problems.
Chen, X. and van Beek, P. (2001)
"Conflict-Directed Backjumping Revisited",
Volume 14, pages 53-81.
PostScript: volume14/chen01a.ps (391K)
compressed, volume14/chen01a.ps.Z (156K)
Online Appendix1: volume14/chen01a-appendix1.tar (552K) Source code
PDF: volume14/chen01a.pdf (298K)
Abstract: In recent years, many improvements to backtracking algorithms for
solving constraint satisfaction problems have been proposed.
The techniques for improving backtracking algorithms can
be conveniently classified as look-ahead schemes and
look-back schemes. Unfortunately, look-ahead and look-back
schemes are not entirely orthogonal as it has been observed
empirically that the enhancement of look-ahead techniques
is sometimes counterproductive to the effects of look-back
techniques. In this paper, we focus on the relationship between
the two most important look-ahead techniques---using a variable
ordering heuristic and maintaining a level of local consistency
during the backtracking search---and the look-back technique of
conflict-directed backjumping (CBJ). We show that there exists
a ``perfect'' dynamic variable ordering such that CBJ becomes
redundant. We also show theoretically that as the level of local
consistency that is maintained in the backtracking search is
increased, the less that backjumping will be an improvement.
Our theoretical results partially explain why a backtracking
algorithm doing more in the look-ahead phase cannot benefit
more from the backjumping look-back scheme. Finally, we show
empirically that adding CBJ to a backtracking algorithm that
maintains generalized arc consistency (GAC), an algorithm that
we refer to as GAC-CBJ, can still provide orders of magnitude
speedups. Our empirical results contrast with Bessiere and
Regin's conclusion (1996) that CBJ is useless to an algorithm
that maintains arc consistency.
Lusena, C., Goldsmith, J. and Mundhenk, M. (2001)
"Nonapproximability Results for Partially Observable Markov Decision Processes",
Volume 14, pages 83-103.
PostScript: volume14/lusena01a.ps (800K)
compressed, volume14/lusena01a.ps.Z (247K)
PDF: volume14/lusena01a.pdf (401K)
Abstract: We show that for several variations of partially observable
Markov decision processes, polynomial-time algorithms for finding
control policies are unlikely to or simply don't have guarantees of
finding policies within a constant factor or a constant summand of
optimal. Here ``unlikely'' means ``unless some complexity classes
collapse,'' where the collapses considered are P=NP, P=PSPACE, or
P=EXP. Until or unless these collapses are shown to hold, any
control-policy designer must choose between such performance
guarantees and efficient computation.
Boutilier, C. and Brafman, R.I. (2001)
"Partial-Order Planning with Concurrent Interacting Actions",
Volume 14, pages 105-136.
PostScript: volume14/boutilier01a.ps (405K)
compressed, volume14/boutilier01a.ps.Z (164K)
PDF: volume14/boutilier01a.pdf (337K)
Abstract: In order to generate plans for agents with multiple
actuators, agent teams, or distributed controllers, we must be able to
represent and plan using concurrent actions with interacting
effects. This has historically been considered a challenging task
requiring a temporal planner with the ability to reason explicitly
about time. We show that with simple modifications, the STRIPS action
representation language can be used to represent interacting actions.
Moreover, algorithms for partial-order planning require only small
modifications in order to be applied in such multiagent domains. We
demonstrate this fact by developing a sound and complete partial-order
planner for planning with concurrent interacting actions, POMP, that
extends existing partial-order planners in a straightforward
way. These results open the way to the use of partial-order planners
for the centralized control of cooperative multiagent systems.
Straccia, U. (2001)
"Reasoning within Fuzzy Description Logics",
Volume 14, pages 137-166.
PostScript: volume14/straccia01a.ps (1564K)
compressed, volume14/straccia01a.ps.Z (804K)
PDF: volume14/straccia01a.pdf (272K)
Abstract: Description Logics (DLs) are suitable, well-known, logics
for managing structured knowledge. They allow reasoning about
individuals and well defined concepts, i.e., set of individuals with
common properties. The experience in using DLs in applications has
shown that in many cases we would like to extend their capabilities.
In particular, their use in the context of Multimedia Information
Retrieval (MIR) leads to the convincement that such DLs should allow
the treatment of the inherent imprecision in multimedia object content
representation and retrieval.
In this paper we will present a fuzzy extension of ALC, combining
Zadeh's fuzzy logic with a classical DL. In particular, concepts
becomes fuzzy and, thus, reasoning about imprecise concepts is
supported. We will define its syntax, its semantics, describe its
properties and present a constraint propagation calculus for reasoning
in it.
Kusters, R. and Borgida, A. (2001)
"What's in an Attribute? Consequences for the Least Common Subsumer",
Volume 14, pages 167-203.
PostScript: volume14/kuesters01a.ps (484K)
compressed, volume14/kuesters01a.ps.Z (217K)
PDF: volume14/kuesters01a.pdf (449K)
Abstract: Functional relationships between objects, called
`attributes', are of considerable importance in knowledge
representation languages, including Description Logics (DLs). A study
of the literature indicates that papers have made, often implicitly,
different assumptions about the nature of attributes: whether they are
always required to have a value, or whether they can be partial
functions. The work presented here is the first explicit study of this
difference for subclasses of the CLASSIC DL, involving the same-as
concept constructor. It is shown that although determining
subsumption between concept descriptions has the same complexity
(though requiring different algorithms), the story is different in the
case of determining the least common subsumer (lcs). For attributes
interpreted as partial functions, the lcs exists and can be computed
relatively easily; even in this case our results correct and extend
three previous papers about the lcs of DLs. In the case where
attributes must have a value, the lcs may not exist, and even if it
exists it may be of exponential size. Interestingly, it is possible
to decide in polynomial time if the lcs exists.
Debruyne, R. and Bessiere, C. (2001)
"Domain Filtering Consistencies",
Volume 14, pages 205-230.
PostScript: volume14/debruyne01a.ps (489K)
compressed, volume14/debruyne01a.ps.Z (203K)
PDF: volume14/debruyne01a.pdf (338K)
Abstract: Enforcing local consistencies is one of the main features
of constraint reasoning. Which level of local consistency should be
used when searching for solutions in a constraint network is a basic
question. Arc consistency and partial forms of arc consistency have
been widely studied, and have been known for sometime through the
forward checking or the MAC search algorithms. Until recently,
stronger forms of local consistency remained limited to those that
change the structure of the constraint graph, and thus, could not be
used in practice, especially on large networks. This paper focuses on
the local consistencies that are stronger than arc consistency,
without changing the structure of the network, i.e., only removing
inconsistent values from the domains. In the last five years, several
such local consistencies have been proposed by us or by others. We
make an overview of all of them, and highlight some relations between
them. We compare them both theoretically and experimentally,
considering their pruning efficiency and the time required to enforce
them.
Basu, C., Hirsh, H., Cohen, W.W., Nevill-Manning, C. (2001)
"Technical Paper Recommendation: A Study in Combining Multiple Information Sources",
Volume 14, pages 231-252.
PostScript: volume14/basu01a.ps (301K)
compressed, volume14/basu01a.ps.Z (110K)
PDF: volume14/basu01a.pdf (267K)
Abstract: The growing need to manage and exploit the proliferation of
online data sources is opening up new opportunities for bringing
people closer to the resources they need. For instance, consider a
recommendation service through which researchers can receive daily
pointers to journal papers in their fields of interest. We survey some
of the known approaches to the problem of technical paper
recommendation and ask how they can be extended to deal with multiple
information sources. More specifically, we focus on a variant of this
problem -- recommending conference paper submissions to reviewing
committee members -- which offers us a testbed to try different
approaches. Using WHIRL -- an information integration system -- we
are able to implement different recommendation algorithms derived from
information retrieval principles. We also use a novel autonomous
procedure for gathering reviewer interest information from the Web. We
evaluate our approach and compare it to other methods using preference
data provided by members of the AAAI-98 conference reviewing committee
along with data about the actual submissions.
Hoffmann, J. and Nebel, B. (2001)
"The FF Planning System: Fast Plan Generation Through Heuristic Search",
Volume 14, pages 253-302.
PostScript: volume14/hoffmann01a.ps (532K)
compressed, volume14/hoffmann01a.ps.Z (234K)
Online Appendix1: volume14/hoffmann01a-appendix1.tar (379K) C code for FF-v2.2 as used in AIPS-2000 competition
Online Appendix3: volume14/hoffmann01a-appendix3.tar (901) PDDL files, raw data and experimental results (gzipped)
PDF: volume14/hoffmann01a.pdf (531K)
Abstract: We describe and evaluate the algorithmic techniques that are
used in the FF planning system. Like the HSP system, FF relies on
forward state space search, using a heuristic that estimates goal
distances by ignoring delete lists. Unlike HSP's heuristic, our method
does not assume facts to be independent. We introduce a novel search
strategy that combines hill-climbing with systematic search, and we
show how other powerful heuristic information can be extracted and
used to prune the search space. FF was the most successful automatic
planner at the recent AIPS-2000 planning competition. We review the
results of the competition, give data for other benchmark domains, and
investigate the reasons for the runtime performance of FF compared to
HSP.
Ginsberg, M.L. (2001)
"GIB: Imperfect Information in a Computationally Challenging Game",
Volume 14, pages 303-358.
PostScript: volume14/ginsberg01a.ps (561K)
compressed, volume14/ginsberg01a.ps.Z (252K)
PDF: volume14/ginsberg01a.pdf (414K)
Abstract: This paper investigates the problems arising in the
construction of a program to play the game of contract bridge. These
problems include both the difficulty of solving the game's perfect
information variant, and techniques needed to address the fact that
bridge is not, in fact, a perfect information game. GIB, the program
being described, involves five separate technical advances: partition
search, the practical application of Monte Carlo techniques to
realistic problems, a focus on achievable sets to solve problems
inherent in the Monte Carlo approach, an extension of alpha-beta
pruning from total orders to arbitrary distributive lattices, and the
use of squeaky wheel optimization to find approximately optimal
solutions to cardplay problems.
GIB is currently believed to be of approximately expert caliber, and
is currently the strongest computer bridge program in the world.
Halpern, J.Y. (2001)
"Conditional Plausibility Measures and Bayesian Networks",
Volume 14, pages 359-389.
PostScript: volume14/halpern01a.ps (699K)
compressed, volume14/halpern01a.ps.Z (207K)
PDF: volume14/halpern01a.pdf (386K)
Abstract: A general notion of algebraic conditional plausibility
measures is defined. Probability measures, ranking functions,
possibility measures, and (under the appropriate definitions) sets of
probability measures can all be viewed as defining algebraic
conditional plausibility measures. It is shown that algebraic
conditional plausibility measures can be represented using Bayesian
networks.
Volume 15
Hong, J. (2001)
"Goal Recognition through Goal Graph Analysis",
Volume 15, pages 1-30.
PostScript: volume15/hong01a.ps (439K)
compressed, volume15/hong01a.ps.Z (218K)
PDF: volume15/hong01a.pdf (252K)
Abstract: We present a novel approach to goal recognition based on a
two-stage paradigm of graph construction and analysis. First, a graph
structure called a Goal Graph is constructed to represent the observed
actions, the state of the world, and the achieved goals as well as
various connections between these nodes at consecutive time
steps. Then, the Goal Graph is analysed at each time step to recognise
those partially or fully achieved goals that are consistent with the
actions observed so far. The Goal Graph analysis also reveals valid
plans for the recognised goals or part of these goals.
Our approach to goal recognition does not need a plan library. It
does not suffer from the problems in the acquisition and hand-coding
of large plan libraries, neither does it have the problems in
searching the plan space of exponential size. We describe two
algorithms for Goal Graph construction and analysis in this
paradigm. These algorithms are both provably sound, polynomial-time,
and polynomial-space. The number of goals recognised by our algorithms
is usually very small after a sequence of observed actions has been
processed. Thus the sequence of observed actions is well explained by
the recognised goals with little ambiguity. We have evaluated these
algorithms in the UNIX domain, in which excellent performance has been
achieved in terms of accuracy, efficiency, and scalability.
Siskind, J.M. (2001)
"Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic",
Volume 15, pages 31-90.
PostScript: volume15/siskind01a.ps.gz (18M)
or compressed, volume15/siskind01a.ps.Z (35M)
Online Appendix1: volume15/siskind01a-appendix1.tar.gz (25M) Software and video sequences
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume15/siskind01a-html/siskind2001a-html.html
PDF: volume15/siskind01a.pdf (16M)
Abstract: This paper presents an implemented system for recognizing
the occurrence of events described by simple spatial-motion verbs in
short image sequences. The semantics of these verbs is specified with
event-logic expressions that describe changes in the state of
force-dynamic relations between the participants of the event. An
efficient finite representation is introduced for the infinite sets of
intervals that occur when describing liquid and semi-liquid events.
Additionally, an efficient procedure using this representation is
presented for inferring occurrences of compound events, described with
event-logic expressions, from occurrences of primitive events. Using
force dynamics and event logic to specify the lexical semantics of
events allows the system to be more robust than prior systems based on
motion profile.
Bhattacharyya, C. and Keerthi, S.S. (2001)
"Mean Field Methods for a Special Class of Belief Networks",
Volume 15, pages 91-114.
PostScript: volume15/bhattacharyya01a.ps (421K)
compressed, volume15/bhattacharyya01a.ps.Z (159K)
PDF: volume15/bhattacharyya01a.pdf (298K)
Abstract: The chief aim of this paper is to propose mean-field
approximations for a broad class of Belief networks, of which sigmoid
and noisy-or networks can be seen as special cases. The
approximations are based on a powerful mean-field theory suggested by
Plefka. We show that Saul, Jaakkola and Jordan' s approach is the
first order approximation in Plefka's approach, via a variational
derivation. The application of Plefka's theory to belief networks is
not computationally tractable. To tackle this problem we propose new
approximations based on Taylor series. Small scale experiments show
that the proposed schemes are attractive.
Refanidis, I. and Vlahavas, I. (2001)
"The GRT Planning System: Backward Heuristic Construction in Forward State-Space Planning",
Volume 15, pages 115-161.
PostScript: volume15/refanidis01a.ps (2.9M)
compressed, volume15/refanidis01a.ps.Z (626K)
Online Appendix1: volume15/refanidis01a-appendix1.tar.gz (4.6M) Source code and experimental data
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume15/refanidis01a-html/refanidis01a.html
PDF: volume15/refanidis01a.pdf (591K)
Abstract: This paper presents GRT, a domain-independent heuristic
planning system for STRIPS worlds. GRT solves problems in two
phases. In the pre-processing phase, it estimates the distance between
each fact and the goals of the problem, in a backward direction. Then,
in the search phase, these estimates are used in order to further
estimate the distance between each intermediate state and the goals,
guiding so the search process in a forward direction and on a
best-first basis. The paper presents the benefits from the adoption of
opposite directions between the preprocessing and the search phases,
discusses some difficulties that arise in the pre-processing phase and
introduces techniques to cope with them. Moreover, it presents several
methods of improving the efficiency of the heuristic, by enriching the
representation and by reducing the size of the problem. Finally, a
method of overcoming local optimal states, based on domain axioms, is
proposed. According to it, difficult problems are decomposed into
easier sub-problems that have to be solved sequentially. The
performance results from various domains, including those of the
recent planning competitions, show that GRT is among the fastest
planners.
Elomaa, T. and Kaariainen, M. (2001)
"An Analysis of Reduced Error Pruning",
Volume 15, pages 163-187.
PostScript: volume15/elomaa01a.ps (356K)
compressed, volume15/elomaa01a.ps.Z (166K)
PDF: volume15/elomaa01a.pdf (387K)
Abstract: Top-down induction of decision trees has been observed to
suffer from the inadequate functioning of the pruning phase. In
particular, it is known that the size of the resulting tree grows
linearly with the sample size, even though the accuracy of the tree
does not improve. Reduced Error Pruning is an algorithm that has been
used as a representative technique in attempts to explain the problems
of decision tree learning.
In this paper we present analyses of Reduced Error Pruning in three
different settings. First we study the basic algorithmic properties
of the method, properties that hold independent of the input decision
tree and pruning examples. Then we examine a situation that
intuitively should lead to the subtree under consideration to be
replaced by a leaf node, one in which the class label and attribute
values of the pruning examples are independent of each other. This
analysis is conducted under two different assumptions. The general
analysis shows that the pruning probability of a node fitting pure
noise is bounded by a function that decreases exponentially as the
size of the tree grows. In a specific analysis we assume that the
examples are distributed uniformly to the tree. This assumption lets
us approximate the number of subtrees that are pruned because they do
not receive any pruning examples.
This paper clarifies the different variants of the Reduced Error
Pruning algorithm, brings new insight to its algorithmic properties,
analyses the algorithm with less imposed assumptions than before, and
includes the previously overlooked empty subtrees to the analysis.
Stone, P., Littman, M.L., Singh, S. and Kearns, M. (2001)
"ATTac-2000: An Adaptive Autonomous Bidding Agent",
Volume 15, pages 189-206.
PostScript: volume15/stone01a.ps (256K)
compressed, volume15/stone01a.ps.Z (106K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume15/stone01a-html/stone01a.html
PDF: volume15/stone01a.pdf (266K)
Abstract: The First Trading Agent Competition (TAC) was held from June
22nd to July 8th, 2000. TAC was designed to create a benchmark
problem in the complex domain of e-marketplaces and to motivate
researchers to apply unique approaches to a common task. This article
describes ATTac-2000, the first-place finisher in TAC. ATTac-2000
uses a principled bidding strategy that includes several elements of
adaptivity. In addition to the success at the competition, isolated
empirical results are presented indicating the robustness and
effectiveness of ATTac-2000's adaptive strategy.
Ambite, J.L. and Knoblock, C.A. (2001)
"Planning by Rewriting",
Volume 15, pages 207-261.
PostScript: volume15/ambite01a.ps (849K)
compressed, volume15/ambite01a.ps.Z (354K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume15/ambite01a-html/ambite01a.html
PDF: volume15/ambite01a.pdf (321K)
Abstract: Domain-independent planning is a hard combinatorial
problem. Taking into account plan quality makes the task even more
difficult. This article introduces Planning by Rewriting (PbR), a new
paradigm for efficient high-quality domain-independent planning. PbR
exploits declarative plan-rewriting rules and efficient local search
techniques to transform an easy-to-generate, but possibly suboptimal,
initial plan into a high-quality plan. In addition to addressing the
issues of planning efficiency and plan quality, this framework offers
a new anytime planning algorithm. We have implemented this planner and
applied it to several existing domains. The experimental results show
that the PbR approach provides significant savings in planning effort
while generating high-quality plans.
Palomar, M. and Martinez-Barco, P. (2001)
"Computational Approach to Anaphora Resolution in Spanish Dialogues",
Volume 15, pages 263-287.
PostScript: volume15/palomar01a.ps (747K)
compressed, volume15/palomar01a.ps.Z (249K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume15/palomar01a-html/palomar01a.html
PDF: volume15/palomar01a.pdf (344K)
Abstract: This paper presents an algorithm for identifying noun-phrase
antecedents of pronouns and adjectival anaphors in Spanish
dialogues. We believe that anaphora resolution requires numerous
sources of information in order to find the correct antecedent of the
anaphor. These sources can be of different kinds, e.g., linguistic
information, discourse/dialogue structure information, or topic
information. For this reason, our algorithm uses various different
kinds of information (hybrid information). The algorithm is based on
linguistic constraints and preferences and uses an anaphoric
accessibility space within which the algorithm finds the noun
phrase. We present some experiments related to this algorithm and this
space using a corpus of 204 dialogues. The algorithm is implemented in
Prolog. According to this study, 95.9% of antecedents were located in
the proposed space, a precision of 81.3% was obtained for pronominal
anaphora resolution, and 81.5% for adjectival anaphora.
Renz, J. and Nebel, B. (2001)
"Efficient Methods for Qualitative Spatial Reasoning",
Volume 15, pages 289-318.
PostScript: volume15/renz01a.ps (1.1M)
compressed, volume15/renz01a.ps.Z (350K)
Online Appendix1: volume15/renz01a-appendix.tar ( 204K) Source code
PDF: volume15/renz01a.pdf (1.0M)
Abstract: The theoretical properties of qualitative spatial reasoning
in the RCC8 framework have been analyzed extensively. However, no
empirical investigation has been made yet. Our experiments show that
the adaption of the algorithms used for qualitative temporal reasoning
can solve large RCC8 instances, even if they are in the phase
transition region -- provided that one uses the maximal tractable
subsets of RCC8 that have been identified by us. In particular, we
demonstrate that the orthogonal combination of heuristic methods is
successful in solving almost all apparently hard instances in the
phase transition region up to a certain size in reasonable time.
Baxter, J. and Bartlett, P.L. (2001)
"Infinite-Horizon Policy-Gradient Estimation",
Volume 15, pages 319-350.
PostScript: volume15/baxter01a.ps (333K)
compressed, volume15/baxter01a.ps.Z (159K)
PDF: volume15/baxter01a.pdf (266K)
Abstract: Gradient-based approaches to direct policy search in
reinforcement learning have received much recent attention as a means
to solve problems of partial observability and to avoid some of the
problems associated with policy degradation in value-function methods.
In this paper we introduce GPOMDP, a simulation-based algorithm for
generating a biased estimate of the gradient of the average reward in
Partially Observable Markov Decision Processes POMDPs controlled by
parameterized stochastic policies. A similar algorithm was proposed by
(Kimura et al. 1995). The algorithm's chief advantages are that it
requires storage of only twice the number of policy parameters, uses
one free beta (which has a natural interpretation in terms of
bias-variance trade-off), and requires no knowledge of the underlying
state. We prove convergence of GPOMDP, and show how the correct choice
of the parameter beta is related to the mixing time of the
controlled POMDP. We briefly describe extensions of GPOMDP to
controlled Markov chains, continuous state, observation and control
spaces, multiple-agents, higher-order derivatives, and a version for
training stochastic policies with internal states. In a companion
paper (Baxter et al., this volume) we show how the gradient estimates
generated by GPOMDP can be used in both a traditional stochastic
gradient algorithm and a conjugate-gradient procedure to find local
optima of the average reward.
Baxter, J., Bartlett, P.L. and Weaver, L. (2001)
"Experiments with Infinite-Horizon, Policy-Gradient Estimation",
Volume 15, pages 351-381.
PostScript: volume15/baxter01b.ps (1.4M)
compressed, volume15/baxter01b.ps.Z (301K)
PDF: volume15/baxter01b.pdf (288K)
Abstract: In this paper, we present algorithms that perform gradient
ascent of the average reward in a partially observable Markov decision
process (POMDP). These algorithms are based on GPOMDP, an algorithm
introduced in a companion paper (Baxter & Bartlett, this volume),
which computes biased estimates of the performance gradient in POMDPs.
The algorithm's chief advantages are that it uses only one free
parameter beta, which has a natural interpretation in terms of
bias-variance trade-off, it requires no knowledge of the underlying
state, and it can be applied to infinite state, control and
observation spaces. We show how the gradient estimates produced by
GPOMDP can be used to perform gradient ascent, both with a traditional
stochastic-gradient algorithm, and with an algorithm based on
conjugate-gradients that utilizes gradient information to bracket
maxima in line searches. Experimental results are presented
illustrating both the theoretical results of (Baxter & Bartlett, this
volume) on a toy problem, and practical aspects of the algorithms on a
number of more realistic problems.
Meek, C. (2001)
"Finding a Path is Harder than Finding a Tree",
Volume 15, pages 383-389.
PostScript: volume15/meek01a.ps (139K)
compressed, volume15/meek01a.ps.Z (70K)
PDF: volume15/meek01a.pdf (175K)
Abstract: I consider the problem of learning an optimal path graphical
model from data and show the problem to be NP-hard for the maximum
likelihood and minimum description length approaches and a Bayesian
approach. This hardness result holds despite the fact that the problem
is a restriction of the polynomially solvable problem of finding the
optimal tree graphical model.
Sato, T. and Kameya, Y. (2001)
"Parameter Learning of Logic Programs for Symbolic-Statistical Modeling",
Volume 15, pages 391-454.
PostScript: volume15/sato01a.ps (1.1M)
compressed, volume15/sato01a.ps.Z (323K)
PDF: volume15/sato01a.pdf (697K)
Abstract: We propose a logical/mathematical framework for statistical
parameter learning of parameterized logic programs, i.e. definite
clause programs containing probabilistic facts with a parameterized
distribution. It extends the traditional least Herbrand model
semantics in logic programming to distribution semantics, possible
world semantics with a probability distribution which is
unconditionally applicable to arbitrary logic programs including ones
for HMMs, PCFGs and Bayesian networks.
We also propose a new EM algorithm, the graphical EM algorithm, that
runs for a class of parameterized logic programs representing
sequential decision processes where each decision is exclusive and
independent. It runs on a new data structure called support graphs
describing the logical relationship between observations and their
explanations, and learns parameters by computing inside and outside
probability generalized for logic programs.
The complexity analysis shows that when combined with OLDT search for
all explanations for observations, the graphical EM algorithm, despite
its generality, has the same time complexity as existing EM
algorithms, i.e. the Baum-Welch algorithm for HMMs, the Inside-Outside
algorithm for PCFGs, and the one for singly connected Bayesian
networks that have been developed independently in each research
field. Learning experiments with PCFGs using two corpora of moderate
size indicate that the graphical EM algorithm can significantly
outperform the Inside-Outside algorithm.
Volume 16
Baader, F., Lutz, C., Sturm, H., Wolter, F. (2002)
"Fusions of Description Logics and Abstract Description Systems",
Volume 16, pages 1-58.
PostScript: volume16/baader02a.ps (684K)
compressed, volume16/baader02a.ps.Z (294K)
PDF: volume16/baader02a.pdf (473K)
Abstract: Fusions are a simple way of combining logics. For normal
modal logics, fusions have been investigated in detail. In particular,
it is known that, under certain conditions, decidability transfers
from the component logics to their fusion. Though description logics
are closely related to modal logics, they are not necessarily
normal. In addition, ABox reasoning in description logics is not
covered by the results from modal logics.
In this paper, we extend the decidability transfer results from normal
modal logics to a large class of description logics. To cover
different description logics in a uniform way, we introduce abstract
description systems, which can be seen as a common generalization of
description and modal logics, and show the transfer results in this
general setting.
Drummond, C. (2002)
"Accelerating Reinforcement Learning by Composing Solutions of Automatically Identified Subtasks",
Volume 16, pages 59-104.
PostScript: volume16/drummond02a.ps (1.3M)
compressed, volume16/drummond02a.ps.Z (475K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume16/drummond02a-html/drummond02a-html.html
PDF: volume16/drummond02a.pdf (714K)
Abstract: This paper discusses a system that accelerates reinforcement
learning by using transfer from related tasks. Without such
transfer, even if two tasks are very similar at some abstract level,
an extensive re-learning effort is required. The system achieves
much of its power by transferring parts of previously learned
solutions rather than a single complete solution. The system
exploits strong features in the multi-dimensional function produced
by reinforcement learning in solving a particular task. These
features are stable and easy to recognize early in the learning
process. They generate a partitioning of the state space and thus
the function. The partition is represented as a graph. This is
used to index and compose functions stored in a case base to form a
close approximation to the solution of the new task. Experiments
demonstrate that function composition often produces more than an
order of magnitude increase in learning rate compared to a basic
reinforcement learning algorithm.
Singh, S., Litman, D., Kearns, M., Walker, M. (2002)
"Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System",
Volume 16, pages 105-133.
PostScript: volume16/singh02a.ps (1.1M)
compressed, volume16/singh02a.ps.Z (238K)
PDF: volume16/singh02a.pdf (345K)
Abstract: Designing the dialogue policy of a spoken dialogue system
involves many nontrivial choices. This paper presents a reinforcement
learning approach for automatically optimizing a dialogue policy,
which addresses the technical challenges in applying reinforcement
learning to a working dialogue system with human users. We report on
the design, construction and empirical evaluation of NJFun, an
experimental spoken dialogue system that provides users with access to
information about fun things to do in New Jersey. Our results show
that by optimizing its performance via reinforcement learning, NJFun
measurably improves system performance.
Blockeel, H., Dehaspe, L., Demoen, B., Janssens, G., Ramon, J., and Vandecasteele, H. (2002)
"Improving the Efficiency of Inductive Logic Programming Through the
Use of Query Packs",
Volume 16, pages 135-166.
PostScript: volume16/blockeel02a.ps (416K)
compressed, volume16/blockeel02a.ps.Z (193K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume16/blockeel02a-html/packs-html.html
PDF: volume16/blockeel02a.pdf (470K)
Abstract: Inductive logic programming, or relational learning, is a
powerful paradigm for machine learning or data mining. However, in
order for ILP to become practically useful, the efficiency of ILP
systems must improve substantially. To this end, the notion of a query
pack is introduced: it structures sets of similar
queries. Furthermore, a mechanism is described for executing such
query packs. A complexity analysis shows that considerable efficiency
improvements can be achieved through the use of this query pack
execution mechanism. This claim is supported by empirical results
obtained by incorporating support for query pack execution in two
existing learning systems.
Shatkay, H. and Kaelbling, L.P. (2002)
"Learning Geometrically-Constrained Hidden Markov Models for Robot Navigation: Bridging the Topological-Geometrical Gap",
Volume 16, pages 167-207.
PostScript: volume16/shatkay02a.ps (1.3M)
compressed, volume16/shatkay02a.ps.Z (472K)
PDF: volume16/shatkay02a.pdf (852K)
Abstract: Hidden Markov models (HMMs) and partially observable Markov
decision processes (POMDPs) provide useful tools for modeling
dynamical systems. They are particularly useful for representing the
topology of environments such as road networks and office buildings,
which are typical for robot navigation and planning. The work
presented here describes a formal framework for incorporating readily
available odometric information and geometrical constraints into both
the models and the algorithm that learns them. By taking advantage of
such information, learning HMMs/POMDPs can be made to generate better
solutions and require fewer iterations, while being robust in the face
of data reduction. Experimental results, obtained from both simulated
and real robot data, demonstrate the effectiveness of the approach.
Di Sciascio, E., Donini, F.M. and Mongiello, M. (2002)
"Structured Knowledge Representation for Image Retrieval",
Volume 16, pages 209-257.
PostScript: volume16/disciascio02a.ps (43M)
compressed, volume16/disciascio02a.ps.Z (2M)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume16/disciascio02a-html/disciascio02a.html
PDF: volume16/disciascio02a.pdf (1M)
Abstract: We propose a structured approach to the problem of retrieval
of images by content and present a description logic that has been
devised for the semantic indexing and retrieval of images containing
complex objects.
As other approaches do, we start from low-level features extracted
with image analysis to detect and characterize regions in an
image. However, in contrast with feature-based approaches, we provide
a syntax to describe segmented regions as basic objects and complex
objects as compositions of basic ones. Then we introduce a companion
extensional semantics for defining reasoning services, such as
retrieval, classification, and subsumption. These services can be
used for both exact and approximate matching, using similarity
measures.
Using our logical approach as a formal specification, we implemented a
complete client-server image retrieval system, which allows a user to
pose both queries by sketch and queries by example. A set of
experiments has been carried out on a testbed of images to assess the
retrieval capabilities of the system in comparison with expert users
ranking. Results are presented adopting a well-established measure of
quality borrowed from textual information retrieval.
Xu, X., He, H. and Hu, D. (2002)
"Efficient Reinforcement Learning Using Recursive Least-Squares Methods",
Volume 16, pages 259-292.
PostScript: volume16/xu02a.ps (1.7M)
compressed, volume16/xu02a.ps.Z (700K)
PDF: volume16/xu02a.pdf (219K)
Abstract: The recursive least-squares (RLS) algorithm is one of the
most well-known algorithms used in adaptive filtering, system
identification and adaptive control. Its popularity is mainly due to
its fast convergence speed, which is considered to be optimal in
practice. In this paper, RLS methods are used to solve reinforcement
learning problems, where two new reinforcement learning algorithms
using linear value function approximators are proposed and
analyzed. The two algorithms are called RLS-TD(lambda) and Fast-AHC
(Fast Adaptive Heuristic Critic), respectively. RLS-TD(lambda) can be
viewed as the extension of RLS-TD(0) from lambda=0 to general lambda
within interval [0,1], so it is a multi-step temporal-difference (TD)
learning algorithm using RLS methods. The convergence with probability
one and the limit of convergence of RLS-TD(lambda) are proved for
ergodic Markov chains. Compared to the existing LS-TD(lambda)
algorithm, RLS-TD(lambda) has advantages in computation and is more
suitable for online learning. The effectiveness of RLS-TD(lambda) is
analyzed and verified by learning prediction experiments of Markov
chains with a wide range of parameter settings.
The Fast-AHC algorithm is derived by applying the proposed
RLS-TD(lambda) algorithm in the critic network of the adaptive
heuristic critic method. Unlike conventional AHC algorithm, Fast-AHC
makes use of RLS methods to improve the learning-prediction efficiency
in the critic. Learning control experiments of the cart-pole balancing
and the acrobot swing-up problems are conducted to compare the data
efficiency of Fast-AHC with conventional AHC. From the experimental
results, it is shown that the data efficiency of learning control can
also be improved by using RLS methods in the learning-prediction
process of the critic. The performance of Fast-AHC is also compared
with that of the AHC method using LS-TD(lambda). Furthermore, it is
demonstrated in the experiments that different initial values of the
variance matrix in RLS-TD(lambda) are required to get better
performance not only in learning prediction but also in learning
control. The experimental results are analyzed based on the existing
theoretical work on the transient phase of forgetting factor RLS
methods.
Walker, M.A., Langkilde-Geary, I., Wright Hastie, H., Wright, J., Gorin, A. (2002)
"Automatically Training a Problematic Dialogue Predictor for a Spoken Dialogue System",
Volume 16, pages 293-319.
PostScript: volume16/walker02a.ps (304K)
compressed, volume16/walker02a.ps.Z (144K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume16/walker02a-html/index.html
PDF: volume16/walker02a.pdf (341K)
Abstract: Spoken dialogue systems promise efficient and natural access
to a large variety of information sources and services from any phone.
However, current spoken dialogue systems are deficient in their
strategies for preventing, identifying and repairing problems that
arise in the conversation. This paper reports results on automatically
training a Problematic Dialogue Predictor to predict problematic
human-computer dialogues using a corpus of 4692 dialogues collected
with the 'How May I Help You' (SM) spoken dialogue system. The
Problematic Dialogue Predictor can be immediately applied to the
system's decision of whether to transfer the call to a human customer
care agent, or be used as a cue to the system's dialogue manager to
modify its behavior to repair problems, and even perhaps, to prevent
them. We show that a Problematic Dialogue Predictor using
automatically-obtainable features from the first two exchanges in the
dialogue can predict problematic dialogues 13.2% more accurately than
the baseline.
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P. (2002)
"SMOTE: Synthetic Minority Over-sampling Technique",
Volume 16, pages 321-357.
PostScript: volume16/chawla02a.ps (812K)
compressed, volume16/chawla02a.ps.Z (330K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume16/chawla02a-html/chawla2002.html
PDF: volume16/chawla02a.pdf (490K)
Abstract: An approach to the construction of classifiers from
imbalanced datasets is described. A dataset is imbalanced if the
classification categories are not approximately equally
represented. Often real-world data sets are predominately composed of
``normal'' examples with only a small percentage of ``abnormal'' or
``interesting'' examples. It is also the case that the cost of
misclassifying an abnormal (interesting) example as a normal example
is often much higher than the cost of the reverse
error. Under-sampling of the majority (normal) class has been proposed
as a good means of increasing the sensitivity of a classifier to the
minority class. This paper shows that a combination of our method of
over-sampling the minority (abnormal) class and under-sampling the
majority (normal) class can achieve better classifier performance (in
ROC space) than only under-sampling the majority class. This paper
also shows that a combination of our method of over-sampling the
minority class and under-sampling the majority class can achieve
better classifier performance (in ROC space) than varying the loss
ratios in Ripper or class priors in Naive Bayes. Our method of
over-sampling the minority class involves creating synthetic minority
class examples. Experiments are performed using C4.5, Ripper and a
Naive Bayes classifier. The method is evaluated using the area under
the Receiver Operating Characteristic curve (AUC) and the ROC convex
hull strategy.
Wolpert, D.H. and Tumer, K. (2002)
"Collective Intelligence, Data Routing and Braess' Paradox",
Volume 16, pages 359-387.
PostScript: volume16/wolpert02a.ps (368K)
compressed, volume16/wolpert02a.ps.Z (173K)
PDF: volume16/wolpert02a.pdf (418K)
Abstract: We consider the problem of designing the the utility
functions of the utility-maximizing agents in a multi-agent system so
that they work synergistically to maximize a global utility. The
particular problem domain we explore is the control of network routing
by placing agents on all the routers in the network. Conventional
approaches to this task have the agents all use the Ideal Shortest
Path routing Algorithm (ISPA). We demonstrate that in many cases, due
to the side-effects of one agent's actions on another agent's
performance, having agents use ISPA's is suboptimal as far as global
aggregate cost is concerned, even when they are only used to route
infinitesimally small amounts of traffic. The utility functions of
the individual agents are not ``aligned'' with the global utility,
intuitively speaking. As a particular example of this we present an
instance of Braess' paradox in which adding new links to a network
whose agents all use the ISPA results in a decrease in overall
throughput. We also demonstrate that load-balancing, in which the
agents' decisions are collectively made to optimize the global cost
incurred by all traffic currently being routed, is suboptimal as far
as global cost averaged across time is concerned. This is also due to
`side-effects', in this case of current routing decision on future
traffic. The mathematics of Collective Intelligence (COIN) is
concerned precisely with the issue of avoiding such deleterious
side-effects in multi-agent systems, both over time and space. We
present key concepts from that mathematics and use them to derive an
algorithm whose ideal version should have better performance than that
of having all agents use the ISPA, even in the infinitesimal limit. We
present experiments verifying this, and also showing that a
machine-learning-based version of this COIN algorithm in which costs
are only imprecisely estimated via empirical means (a version
potentially applicable in the real world) also outperforms the ISPA,
despite having access to less information than does the ISPA. In
particular, this COIN algorithm almost always avoids Braess' paradox.
Pynadath, D.V. and Tambe, M. (2002)
"The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories and Models",
Volume 16, pages 389-423.
PostScript: volume16/pynadath02a.ps (15M)
compressed, volume16/pynadath02a.ps.Z (455K)
Online Appendix1: volume16/pynadath02a-appendix1.tar (336K) Algorithms and data
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume16/pynadath02a-html/index.html
PDF: volume16/pynadath02a.pdf (519K)
Abstract: Despite the significant progress in multiagent teamwork,
existing research does not address the optimality of its prescriptions
nor the complexity of the teamwork problem. Without a
characterization of the optimality-complexity tradeoffs, it is
impossible to determine whether the assumptions and approximations
made by a particular theory gain enough efficiency to justify the
losses in overall performance. To provide a tool for use by
multiagent researchers in evaluating this tradeoff, we present a
unified framework, the COMmunicative Multiagent Team Decision Problem
(COM-MTDP). The COM-MTDP model combines and extends existing
multiagent theories, such as decentralized partially observable Markov
decision processes and economic team theory. In addition to their
generality of representation, COM-MTDPs also support the analysis of
both the optimality of team performance and the computational
complexity of the agents' decision problem. In analyzing complexity,
we present a breakdown of the computational complexity of constructing
optimal teams under various classes of problem domains, along the
dimensions of observability and communication cost. In analyzing
optimality, we exploit the COM-MTDP's ability to encode existing
teamwork theories and models to encode two instantiations of joint
intentions theory taken from the literature. Furthermore, the
COM-MTDP model provides a basis for the development of novel team
coordination algorithms. We derive a domain-independent criterion for
optimal communication and provide a comparative analysis of the two
joint intentions instantiations with respect to this optimal policy.
We have implemented a reusable, domain-independent software package
based on COM-MTDPs to analyze teamwork coordination strategies, and we
demonstrate its use by encoding and evaluating the two joint
intentions strategies within an example domain.
Baget, J.F. and Mugnier, M.L. (2002)
"Extensions of Simple Conceptual Graphs: the Complexity of Rules and Constraints",
Volume 16, pages 425-465.
PostScript: volume16/baget02a.ps (751K)
compressed, volume16/baget02a.ps.Z (294K)
PDF: volume16/baget02a.pdf (567K)
Abstract: Simple conceptual graphs are considered as the kernel of
most knowledge representation formalisms built upon Sowa's
model. Reasoning in this model can be expressed by a graph
homomorphism called projection, whose semantics is usually given in
terms of positive, conjunctive, existential FOL. We present here a
family of extensions of this model, based on rules and constraints,
keeping graph homomorphism as the basic operation. We focus on the
formal definitions of the different models obtained, including their
operational semantics and relationships with FOL, and we analyze the
decidability and complexity of the associated problems (consistency
and deduction). As soon as rules are involved in reasonings, these
problems are not decidable, but we exhibit a condition under which
they fall in the polynomial hierarchy. These results extend and
complete the ones already published by the authors. Moreover we
systematically study the complexity of some particular cases obtained
by restricting the form of constraints and/or rules.
Volume 17
Howe, A.E. and Dahlman, E. (2002)
"A Critical Assessment of Benchmark Comparison in Planning",
Volume 17, pages 1-33.
PostScript: volume17/howe02a.ps (2M)
compressed, volume17/howe02a.ps.Z (251K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume17/howe02a-html/jair-plan-html.html
PDF: volume17/howe02a.pdf (327K)
Abstract: Recent trends in planning research have led to empirical
comparison becoming commonplace. The field has started to settle into
a methodology for such comparisons, which for obvious practical
reasons requires running a subset of planners on a subset of
problems. In this paper, we characterize the methodology and
examine eight implicit assumptions about the problems, planners and
metrics used in many of these comparisons. The problem assumptions
are: PR1) the performance of a general purpose planner should not be
penalized/biased if executed on a sampling of problems and domains,
PR2) minor syntactic differences in representation do not affect
performance, and PR3) problems should be solvable by STRIPS capable
planners unless they require ADL. The planner assumptions are: PL1)
the latest version of a planner is the best one to use, PL2) default
parameter settings approximate good performance, and PL3) time
cut-offs do not unduly bias outcome. The metrics assumptions are:
M1) performance degrades similarly for each planner when run on
degraded runtime environments (e.g., machine platform) and M2) the
number of plan steps distinguishes performance. We find that most of
these assumptions are not supported empirically; in particular, that
planners are affected differently by these assumptions. We conclude
with a call to the community to devote research resources to
improving the state of the practice and especially to enhancing the
available benchmark problems.
Barzilay, R., Elhadad, N., and McKeown K.R. (2002)
"Inferring Strategies for Sentence Ordering in Multidocument News Summarization",
Volume 17, pages 35-55.
PostScript: volume17/barzilay02a.ps (383K)
compressed, volume17/barzilay02a.ps.Z (251K)
PDF: volume17/barzilay02a.pdf (215K)
Abstract: The problem of organizing information for multidocument
summarization so that the generated summary is coherent has received
relatively little attention. While sentence ordering for single
document summarization can be determined from the ordering of
sentences in the input article, this is not the case for multidocument
summarization where summary sentences may be drawn from different
input articles. In this paper, we propose a methodology for studying
the properties of ordering information in the news genre and describe
experiments done on a corpus of multiple acceptable orderings we
developed for the task. Based on these experiments, we implemented a
strategy for ordering information that combines constraints from
chronological order of events and topical relatedness. Evaluation of
our augmented algorithm shows a significant improvement of the
ordering over two baseline strategies.
Halpern, J.Y. and Pucella, R. (2002)
"A Logic for Reasoning about Upper Probabilities",
Volume 17, pages 57-81.
PostScript: volume17/halpern02a.ps (371K)
compressed, volume17/halpern02a.ps.Z (173K)
PDF: volume17/halpern02a.pdf (295K)
Abstract: We present a propositional logic to reason about the
uncertainty of events, where the uncertainty is modeled by a set of
probability measures assigning an interval of probability to each
event. We give a sound and complete axiomatization for the logic, and
show that the satisfiability problem is NP-complete, no harder than
satisfiability for propositional logic.
Kaminka, G.A., Pynadath, D.V. and Tambe, M. (2002)
"Monitoring Teams by Overhearing: A Multi-Agent Plan-Recognition Approach",
Volume 17, pages 83-135.
PostScript: volume17/kaminka02a.ps (874K)
compressed, volume17/kaminka02a.ps.Z (369K)
Online Appendix1: volume17/kaminka02a-appendix1.tar (99K) sample data
PDF: volume17/kaminka02a.pdf (688K)
Abstract: Recent years are seeing an increasing need for on-line
monitoring of teams of cooperating agents, e.g., for visualization, or
performance tracking. However, in monitoring deployed teams, we often
cannot rely on the agents to always communicate their state to the
monitoring system. This paper presents a non-intrusive approach to
monitoring by 'overhearing', where the monitored team's state is
inferred (via plan-recognition) from team-members' *routine*
communications, exchanged as part of their coordinated task execution,
and observed (overheard) by the monitoring system. Key challenges in
this approach include the demanding run-time requirements of
monitoring, the scarceness of observations (increasing monitoring
uncertainty), and the need to scale-up monitoring to address
potentially large teams. To address these, we present a set of
complementary novel techniques, exploiting knowledge of the social
structures and procedures in the monitored team: (i) an efficient
probabilistic plan-recognition algorithm, well-suited for processing
communications as observations; (ii) an approach to exploiting
knowledge of the team's social behavior to predict future observations
during execution (reducing monitoring uncertainty); and (iii)
monitoring algorithms that trade expressivity for scalability,
representing only certain useful monitoring hypotheses, but allowing
for any number of agents and their different activities to be
represented in a single coherent entity. We present an empirical
evaluation of these techniques, in combination and apart, in
monitoring a deployed team of agents, running on machines physically
distributed across the country, and engaged in complex, dynamic task
execution. We also compare the performance of these techniques to
human expert and novice monitors, and show that the techniques
presented are capable of monitoring at human-expert levels, despite
the difficulty of the task.
Nock, R. (2002)
"Inducing Interpretable Voting Classifiers without Trading Accuracy for Simplicity: Theoretical Results, Approximation Algorithms, and Experiments",
Volume 17, pages 137-170.
PostScript: volume17/nock02a.ps (532K)
compressed, volume17/nock02a.ps.Z (228K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume17/nock02a-html/nock02a-html.html
PDF: volume17/nock02a.pdf (403K)
Abstract: Recent advances in the study of voting classification
algorithms have brought empirical and theoretical results clearly
showing the discrimination power of ensemble classifiers. It has been
previously argued that the search of this classification power in the
design of the algorithms has marginalized the need to obtain
interpretable classifiers. Therefore, the question of whether one
might have to dispense with interpretability in order to keep
classification strength is being raised in a growing number of machine
learning or data mining papers. The purpose of this paper is to study
both theoretically and empirically the problem. First, we provide
numerous results giving insight into the hardness of the
simplicity-accuracy tradeoff for voting classifiers. Then we provide
an efficient ``top-down and prune'' induction heuristic, WIDC, mainly
derived from recent results on the weak learning and boosting
frameworks. It is to our knowledge the first attempt to build a
voting classifier as a base formula using the weak learning framework
(the one which was previously highly successful for decision tree
induction), and not the strong learning framework (as usual for such
classifiers with boosting-like approaches). While it uses a well-known
induction scheme previously successful in other classes of concept
representations, thus making it easy to implement and compare, WIDC
also relies on recent or new results we give about particular cases of
boosting known as partition boosting and ranking loss
boosting. Experimental results on thirty-one domains, most of which
readily available, tend to display the ability of WIDC to produce
small, accurate, and interpretable decision committees.
Scerri, P., Pynadath, D.V., Tampe, M. (2002)
"Towards Adjustable Autonomy for the Real World",
Volume 17, pages 171-228.
PostScript: volume17/scerri02a.ps (16M)
compressed, volume17/scerri02a.ps.Z (1.2M)
PDF: volume17/scerri02a.pdf (774K)
Abstract: Adjustable autonomy refers to entities dynamically varying
their own autonomy, transferring decision-making control to other
entities (typically agents transferring control to human users) in key
situations. Determining whether and when such transfers-of-control
should occur is arguably the fundamental research problem in
adjustable autonomy. Previous work has investigated various approaches
to addressing this problem but has often focused on individual
agent-human interactions. Unfortunately, domains requiring
collaboration between teams of agents and humans reveal two key
shortcomings of these previous approaches. First, these approaches use
rigid one-shot transfers of control that can result in unacceptable
coordination failures in multiagent settings. Second, they ignore
costs (e.g., in terms of time delays or effects on actions) to an
agent's team due to such transfers-of-control.
To remedy these problems, this article presents a novel approach to
adjustable autonomy, based on the notion of a transfer-of-control
strategy. A transfer-of-control strategy consists of a conditional
sequence of two types of actions: (i) actions to transfer
decision-making control (e.g., from an agent to a user or vice versa)
and (ii) actions to change an agent's pre-specified coordination
constraints with team members, aimed at minimizing miscoordination
costs. The goal is for high-quality individual decisions to be made
with minimal disruption to the coordination of the team. We present a
mathematical model of transfer-of-control strategies. The model guides
and informs the operationalization of the strategies using Markov
Decision Processes, which select an optimal strategy, given an
uncertain environment and costs to the individuals and teams. The
approach has been carefully evaluated, including via its use in a
real-world, deployed multi-agent system that assists a research group
in its daily activities.
Darwiche, A. and Marquis, P. (2002)
"A Knowledge Compilation Map",
Volume 17, pages 229-264.
PostScript: volume17/darwiche02a.ps (1.1M)
compressed, volume17/darwiche02a.ps.Z (514K)
PDF: volume17/darwiche02a.pdf (444K)
Abstract: We propose a perspective on knowledge compilation which
calls for analyzing different compilation approaches according to two
key dimensions: the succinctness of the target compilation language,
and the class of queries and transformations that the language
supports in polytime. We then provide a knowledge compilation map,
which analyzes a large number of existing target compilation languages
according to their succinctness and their polytime transformations and
queries. We argue that such analysis is necessary for placing new
compilation approaches within the context of existing ones. We also go
beyond classical, flat target compilation languages based on CNF and
DNF, and consider a richer, nested class based on directed acyclic
graphs (such as OBDDs), which we show to include a relatively large
number of target compilation languages.
Chan, H. and Darwiche, A. (2002)
"When do Numbers Really Matter?",
Volume 17, pages 265-287.
PostScript: volume17/chan02a.ps (2.4M)
compressed, volume17/chan02a.ps.Z (519K)
PDF: volume17/chan02a.pdf (311K)
Abstract: Common wisdom has it that small distinctions in the
probabilities (parameters) quantifying a belief network do not matter
much for the results of probabilistic queries. Yet, one can develop
realistic scenarios under which small variations in network parameters
can lead to significant changes in computed queries. A pending
theoretical question is then to analytically characterize parameter
changes that do or do not matter. In this paper, we study the
sensitivity of probabilistic queries to changes in network parameters
and prove some tight bounds on the impact that such parameters can
have on queries. Our analytic results pinpoint some interesting
situations under which parameter changes do or do not matter. These
results are important for knowledge engineers as they help them
identify influential network parameters. They also help explain some
of the previous experimental results and observations with regards to
network robustness against parameter changes.
Bod, R. (2002)
"A Unified Model of Structural Organization in Language and Music",
Volume 17, pages 289-308.
PostScript: volume17/bod02a.ps (1M)
compressed, volume17/bod02a.ps.Z (534K)
PDF: volume17/bod02a.pdf (84K)
Abstract: Is there a general model that can predict the perceived
phrase structure in language and music? While it is usually assumed
that humans have separate faculties for language and music, this work
focuses on the commonalities rather than on the differences between
these modalities, aiming at finding a deeper 'faculty'. Our key idea
is that the perceptual system strives for the simplest structure (the
'simplicity principle'), but in doing so it is biased by the
likelihood of previous structures (the 'likelihood principle'). We
present a series of data-oriented parsing (DOP) models that combine
these two principles and that are tested on the Penn Treebank and the
Essen Folksong Collection. Our experiments show that (1) a combination
of the two principles outperforms the use of either of them, and (2)
exactly the same model with the same parameter setting achieves
maximum accuracy for both language and music. We argue that our
results suggest an interesting parallel between linguistic and musical
structuring.
Gao, Y. and Culberson, J. (2002)
"An Analysis of Phase Transition in NK Landscapes",
Volume 17, pages 309-332.
PostScript: volume17/gao02a.ps (359K)
compressed, volume17/gao02a.ps.Z (166K)
PDF: volume17/gao02a.pdf (527K)
Abstract: In this paper, we analyze the decision version of the NK
landscape model from the perspective of threshold phenomena and phase
transitions under two random distributions, the uniform probability
model and the fixed ratio model. For the uniform probability model, we
prove that the phase transition is easy in the sense that there is a
polynomial algorithm that can solve a random instance of the problem
with the probability asymptotic to 1 as the problem size tends to
infinity. For the fixed ratio model, we establish several upper bounds
for the solubility threshold, and prove that random instances with
parameters above these upper bounds can be solved polynomially. This,
together with our empirical study for random instances generated below
and in the phase transition region, suggests that the phase transition
of the fixed ratio model is also easy.
Al-Ani, A. and Deriche, M. (2002)
"A New Technique for Combining Multiple Classifiers using The Dempster-Shafer Theory of Evidence",
Volume 17, pages 333-361.
PostScript: volume17/alani02a.ps (426K)
compressed, volume17/alani02a.ps.Z (193K)
PDF: volume17/alani02a.pdf (243K)
Abstract: This paper presents a new classifier combination
technique based on the Dempster-Shafer theory of evidence. The
Dempster-Shafer theory of evidence is a powerful method for
combining measures of evidence from different classifiers. However,
since each of the available methods that estimates the evidence of
classifiers has its own limitations, we propose here a new
implementation which adapts to training data so that the overall
mean square error is minimized. The proposed technique is shown to
outperform most available classifier combination methods when
tested on three different classification problems.
Tennenholtz, M. (2002)
"Competitive Safety Analysis: Robust Decision-Making
in Multi-Agent Systems",
Volume 17, pages 363-378.
PostScript: volume17/tennenholtz02a.ps (409K)
compressed, volume17/tennenholtz02a.ps.Z (218K)
PDF: volume17/tennenholtz02a.pdf (250K)
Abstract: Much work in AI deals with the selection of proper actions
in a given (known or unknown) environment. However, the way to select
a proper action when facing other agents is quite unclear. Most work
in AI adopts classical game-theoretic equilibrium analysis to predict
agent behavior in such settings. This approach however does not
provide us with any guarantee for the agent. In this paper we
introduce competitive safety analysis. This approach bridges the gap
between the desired normative AI approach, where a strategy should be
selected in order to guarantee a desired payoff, and equilibrium
analysis. We show that a safety level strategy is able to guarantee
the value obtained in a Nash equilibrium, in several classical
computer science settings. Then, we discuss the concept of
competitive safety strategies, and illustrate its use in a
decentralized load balancing setting, typical to network problems. In
particular, we show that when we have many agents, it is possible to
guarantee an expected payoff which is a factor of 8/9 of the payoff
obtained in a Nash equilibrium. Our discussion of competitive safety
analysis for decentralized load balancing is further developed to deal
with many communication links and arbitrary speeds. Finally, we
discuss the extension of the above concepts to Bayesian games, and
illustrate their use in a basic auctions setup.
Fern, A., Givan, R., and Siskind, J.M. (2002)
"Specific-to-General Learning for Temporal Events with Application to Learning Event Definitions from Video",
Volume 17, pages 379-449.
PostScript: volume17/fern02a.ps (20M)
compressed, volume17/fern02a.ps.Z (6M)
Online Appendix1: volume17/fern02a-appendix1.tar.Z (3M) Source code and data
PDF: volume17/fern02a.pdf (1M)
Abstract: We develop, analyze, and evaluate a novel, supervised,
specific-to-general learner for a simple temporal logic and use the
resulting algorithm to learn visual event definitions from video
sequences. First, we introduce a simple, propositional, temporal,
event-description language called AMA that is sufficiently expressive
to represent many events yet sufficiently restrictive to support
learning. We then give algorithms, along with lower and upper
complexity bounds, for the subsumption and generalization problems for
AMA formulas. We present a positive-examples--only specific-to-general
learning method based on these algorithms. We also present a
polynomial-time--computable ``syntactic'' subsumption test that
implies semantic subsumption without being equivalent to it. A
generalization algorithm based on syntactic subsumption can be used in
place of semantic generalization to improve the asymptotic complexity
of the resulting learning algorithm. Finally, we apply this algorithm
to the task of learning relational event definitions from video and
show that it yields definitions that are competitive with hand-coded
ones.
Bui, H.H., Venkatesh, S., and West, G. (2002)
"Policy Recognition in the Abstract Hidden Markov Model",
Volume 17, pages 451-499.
PostScript: volume17/bui02a.ps (1.8M)
compressed, volume17/bui02a.ps.Z (563K)
PDF: volume17/bui02a.pdf (614K)
Abstract: In this paper, we present a method for recognising an
agent's behaviour in dynamic, noisy, uncertain domains, and across
multiple levels of abstraction. We term this problem on-line plan
recognition under uncertainty and view it generally as probabilistic
inference on the stochastic process representing the execution of the
agent's plan. Our contributions in this paper are twofold. In terms
of probabilistic inference, we introduce the Abstract Hidden Markov
Model (AHMM), a novel type of stochastic processes, provide its
dynamic Bayesian network (DBN) structure and analyse the properties of
this network. We then describe an application of the
Rao-Blackwellised Particle Filter to the AHMM which allows us to
construct an efficient, hybrid inference method for this model. In
terms of plan recognition, we propose a novel plan recognition
framework based on the AHMM as the plan execution model. The
Rao-Blackwellised hybrid inference for AHMM can take advantage of the
independence properties inherent in a model of plan execution, leading
to an algorithm for online probabilistic plan recognition that scales
well with the number of levels in the plan hierarchy. This
illustrates that while stochastic models for plan execution can be
complex, they exhibit special structures which, if exploited, can lead
to efficient plan recognition algorithms. We demonstrate the
usefulness of the AHMM framework via a behaviour recognition system in
a complex spatial environment using distributed video surveillance
data.
Gamberger, D. and Lavrac, N. (2002)
"Expert-Guided Subgroup Discovery: Methodology and Application",
Volume 17, pages 501-527.
PostScript: volume17/gamberger02a.ps (441K)
compressed, volume17/gamberger02a.ps.Z (221K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume17/gamberger02a-html/gamberger2002a.html
PDF: volume17/gamberger02a.pdf (356K)
Abstract: This paper presents an approach to expert-guided subgroup
discovery. The main step of the subgroup discovery process, the
induction of subgroup descriptions, is performed by a heuristic beam
search algorithm, using a novel parametrized definition of rule
quality which is analyzed in detail. The other important steps of the
proposed subgroup discovery process are the detection of statistically
significant properties of selected subgroups and subgroup
visualization: statistically significant properties are used to enrich
the descriptions of induced subgroups, while the visualization shows
subgroup properties in the form of distributions of the numbers of
examples in the subgroups. The approach is illustrated by the results
obtained for a medical problem of early detection of patient risk
groups.
Volume 18
Thompson, C.A and Mooney, R.J. (2003)
"Acquiring Word-Meaning Mappings for Natural Language Interfaces",
Volume 18, pages 1-44.
PostScript: volume18/thompson03a.ps (601K)
compressed, volume18/thompson03a.ps.Z (278K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume18/thompson03a-html/thompson03a-html.html
PDF: volume18/thompson03a.pdf (454K)
Abstract: This paper focuses on a system, WOLFIE (WOrd Learning From
Interpreted Examples), that acquires a semantic lexicon from a corpus
of sentences paired with semantic representations. The lexicon
learned consists of phrases paired with meaning representations.
WOLFIE is part of an integrated system that learns to transform
sentences into representations such as logical database queries.
Experimental results are presented demonstrating WOLFIE's ability to
learn useful lexicons for a database interface in four different
natural languages. The usefulness of the lexicons learned by WOLFIE
are compared to those acquired by a similar system, with results
favorable to WOLFIE. A second set of experiments demonstrates
WOLFIE's ability to scale to larger and more difficult, albeit
artificially generated, corpora.
In natural language acquisition, it is difficult to gather the
annotated data needed for supervised learning; however, unannotated
data is fairly plentiful. Active learning methods attempt to select
for annotation and training only the most informative examples, and
therefore are potentially very useful in natural language
applications. However, most results to date for active learning have
only considered standard classification tasks. To reduce annotation
effort while maintaining accuracy, we apply active learning to
semantic lexicons. We show that active learning can significantly
reduce the number of annotated examples required to achieve a given
level of performance.
Cemgil, A.T. and Kappen, B. (2003)
"Monte Carlo Methods for Tempo Tracking and Rhythm Quantization",
Volume 18, pages 45-81.
PostScript: volume18/cemgil03a.ps (1M)
compressed, volume18/cemgil03a.ps.Z (278K)
PDF: volume18/cemgil03a.pdf (801K)
Abstract: We present a probabilistic generative model for timing
deviations in expressive music performance. The structure of the
proposed model is equivalent to a switching state space model. The
switch variables correspond to discrete note locations as in a musical
score. The continuous hidden variables denote the tempo. We formulate
two well known music recognition problems, namely tempo tracking and
automatic transcription (rhythm quantization) as filtering and maximum
a posteriori (MAP) state estimation tasks. Exact computation of
posterior features such as the MAP state is intractable in this model
class, so we introduce Monte Carlo methods for integration and
optimization. We compare Markov Chain Monte Carlo (MCMC) methods (such
as Gibbs sampling, simulated annealing and iterative improvement) and
sequential Monte Carlo methods (particle filters). Our simulation
results suggest better results with sequential methods. The methods
can be applied in both online and batch scenarios such as tempo
tracking and transcription and are thus potentially useful in a number
of music applications such as adaptive automatic accompaniment, score
typesetting and music information retrieval.
Grumberg, O., Livne, S. and Markovitch, S. (2003)
"Learning to Order BDD Variables in Verification",
Volume 18, pages 83-116.
PostScript: volume18/grumberg03a.ps (581K)
compressed, volume18/grumberg03a.ps.Z (262K)
PDF: volume18/grumberg03a.pdf (279K)
Abstract: The size and complexity of software and hardware systems
have significantly increased in the past years. As a result, it is
harder to guarantee their correct behavior. One of the most successful
methods for automated verification of finite-state systems is model
checking. Most of the current model-checking systems use binary
decision diagrams (BDDs) for the representation of the tested model
and in the verification process of its properties. Generally, BDDs
allow a canonical compact representation of a boolean function (given
an order of its variables). The more compact the BDD is, the better
performance one gets from the verifier. However, finding an optimal
order for a BDD is an NP-complete problem. Therefore, several
heuristic methods based on expert knowledge have been developed for
variable ordering.
We propose an alternative approach in which the variable ordering
algorithm gains 'ordering experience' from training models and
uses the learned knowledge for finding good orders. Our
methodology is based on offline learning of pair precedence
classifiers from training models, that is, learning which variable
pair permutation is more likely to lead to a good order. For each
training model, a number of training sequences are evaluated. Every
training model variable pair permutation is then tagged based on
its performance on the evaluated orders. The tagged permutations
are then passed through a feature extractor and are given as
examples to a classifier creation algorithm. Given a model for
which an order is requested, the ordering algorithm consults each
precedence classifier and constructs a pair precedence table
which is used to create the order.
Our algorithm was integrated with SMV, which is one of the most
widely used verification systems. Preliminary empirical evaluation of our
methodology, using real benchmark models, shows performance that
is better than random ordering and is competitive with existing
algorithms that use expert knowledge. We believe that in
sub-domains of models (alu, caches, etc.) our system will prove
even more valuable. This is because it features the ability to
learn sub-domain knowledge, something that no other ordering
algorithm does.
Peral, J. and Ferrandez, A. (2003)
"Translation of Pronominal Anaphora between English and Spanish: Discrepancies and Evaluation",
Volume 18, pages 117-147.
PostScript: volume18/peral03a.ps (471K)
compressed, volume18/peral03a.ps.Z (235K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume18/peral03a-html/peral03a.html
PDF: volume18/peral03a.pdf (232K)
Abstract: This paper evaluates the different tasks carried out in the
translation of pronominal anaphora in a machine translation (MT)
system. The MT interlingua approach named AGIR (Anaphora Generation
with an Interlingua Representation) improves upon other proposals
presented to date because it is able to translate intersentential
anaphors, detect co-reference chains, and translate Spanish zero
pronouns into English---issues hardly considered by other systems. The
paper presents the resolution and evaluation of these anaphora
problems in AGIR with the use of different kinds of knowledge
(lexical, morphological, syntactic, and semantic). The translation of
English and Spanish anaphoric third-person personal pronouns
(including Spanish zero pronouns) into the target language has been
evaluated on unrestricted corpora. We have obtained a precision of
80.4% and 84.8% in the translation of Spanish and English pronouns,
respectively. Although we have only studied the Spanish and English
languages, our approach can be easily extended to other languages such
as Portuguese, Italian, or Japanese.
Lerman, K., Minton, S.N. and Knoblock, C.A. (2003)
"Wrapper Maintenance: A Machine Learning Approach",
Volume 18, pages 149-181.
PostScript: volume18/lerman03a.ps (1.1M)
compressed, volume18/lerman03a.ps.Z (615K)
PDF: volume18/lerman03a.pdf (377K)
Abstract: The proliferation of online information sources has led to an
increased use of wrappers for extracting data from Web sources. While
most of the previous research has focused on quick and efficient
generation of wrappers, the development of tools for wrapper
maintenance has received less attention. This is an important research
problem because Web sources often change in ways that prevent the
wrappers from extracting data correctly. We present an efficient
algorithm that learns structural information about data from positive
examples alone. We describe how this information can be used for two
wrapper maintenance applications: wrapper verification and
reinduction. The wrapper verification system detects when a wrapper is
not extracting correct data, usually because the Web source has
changed its format. The reinduction algorithm automatically recovers
from changes in the Web source by identifying data on Web pages so
that a new wrapper may be generated for this source. To validate our
approach, we monitored 27 wrappers over a period of a year. The
verification algorithm correctly discovered 35 of the 37 wrapper
changes, and made 16 mistakes, resulting in precision of 0.73 and
recall of 0.95. We validated the reinduction algorithm on ten Web
sources. We were able to successfully reinduce the wrappers, obtaining
precision and recall values of 0.90 and 0.80 on the data
extraction task.
Tan, K.C., Khor, E.F., Lee, T.H. and Sathikannan, R. (2003)
"An Evolutionary Algorithm with Advanced Goal and Priority Specification for Multi-objective Optimization",
Volume 18, pages 183-215.
PostScript: volume18/tan03a.ps (1.9M)
compressed, volume18/tan03a.ps.Z (803K)
PDF: volume18/tan03a.pdf (498K)
Abstract: This paper presents an evolutionary algorithm with a new
goal-sequence domination scheme for better decision support in
multi-objective optimization. The approach allows the inclusion of
advanced hard/soft priority and constraint information on each
objective component, and is capable of incorporating multiple
specifications with overlapping or non-overlapping objective functions
via logical 'OR' and 'AND' connectives to drive the search
towards multiple regions of trade-off. In addition, we propose a
dynamic sharing scheme that is simple and adaptively estimated
according to the on-line population distribution without needing any a
priori parameter setting. Each feature in the proposed algorithm is
examined to show its respective contribution, and the performance of
the algorithm is compared with other evolutionary optimization
methods. It is shown that the proposed algorithm has performed well in
the diversity of evolutionary search and uniform distribution of
non-dominated individuals along the final trade-offs, without
significant computational effort. The algorithm is also applied to the
design optimization of a practical servo control system for hard disk
drives with a single voice-coil-motor actuator. Results of the
evolutionary designed servo control system show a superior closed-loop
performance compared to classical PID or RPT approaches.
Wilkins, D.E., Lee, T.J. and Berry, P. (2003)
"Interactive Execution Monitoring of Agent Teams",
Volume 18, pages 217-261.
PostScript: volume18/wilkins03a.ps (3.2M)
compressed, volume18/wilkins03a.ps.Z (2.3M)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume18/wilkins03a-html/ex-mon-jair-l2h.html
PDF: volume18/wilkins03a.pdf (210K)
Abstract: There is an increasing need for automated support for humans
monitoring the activity of distributed teams of cooperating agents,
both human and machine. We characterize the domain-independent
challenges posed by this problem, and describe how properties of
domains influence the challenges and their solutions. We will
concentrate on dynamic, data-rich domains where humans are ultimately
responsible for team behavior. Thus, the automated aid should
interactively support effective and timely decision making by the
human. We present a domain-independent categorization of the types of
alerts a plan-based monitoring system might issue to a user, where
each type generally requires different monitoring techniques. We
describe a monitoring framework for integrating many domain-specific
and task-specific monitoring techniques and then using the concept of
value of an alert to avoid operator overload.
We use this framework to describe an execution monitoring approach we
have used to implement Execution Assistants (EAs) in two different
dynamic, data-rich, real-world domains to assist a human in
monitoring team behavior. One domain (Army small unit operations) has
hundreds of mobile, geographically distributed agents, a combination
of humans, robots, and vehicles. The other domain (teams of unmanned
ground and air vehicles) has a handful of cooperating robots. Both
domains involve unpredictable adversaries in the vicinity. Our
approach customizes monitoring behavior for each specific task, plan,
and situation, as well as for user preferences. Our EAs alert the
human controller when reported events threaten plan execution or
physically threaten team members. Alerts were generated in a timely
manner without inundating the user with too many alerts (less than
10 percent of alerts are unwanted, as judged by domain experts).
Poole, D. and Zhang, N.L. (2003)
"Exploiting Contextual Independence In Probabilistic Inference",
Volume 18, pages 263-313.
PostScript: volume18/poole03a.ps (1.6M)
compressed, volume18/poole03a.ps.Z (454K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume18/poole03a-html/poole03a.html
PDF: volume18/poole03a.pdf (236K)
Abstract: Bayesian belief networks have grown to prominence because
they provide compact representations for many problems for which
probabilistic inference is appropriate, and there are algorithms to
exploit this compactness. The next step is to allow compact
representations of the conditional probabilities of a variable given
its parents. In this paper we present such a representation that
exploits contextual independence in terms of parent contexts; which
variables act as parents may depend on the value of other
variables. The internal representation is in terms of contextual
factors (confactors) that is simply a pair of a context and a table.
The algorithm, contextual variable elimination, is based on the
standard variable elimination algorithm that eliminates the non-query
variables in turn, but when eliminating a variable, the tables that
need to be multiplied can depend on the context. This algorithm
reduces to standard variable elimination when there is no contextual
independence structure to exploit. We show how this can be much more
efficient than variable elimination when there is structure to
exploit. We explain why this new method can exploit more structure
than previous methods for structured belief network inference and an
analogous algorithm that uses trees.
Brafman, R.I. and Domshlak, C. (2003)
"Structure and Complexity in Planning with Unary Operators",
Volume 18, pages 315-349.
PostScript: volume18/brafman03a.ps (636K)
compressed, volume18/brafman03a.ps.Z (324K)
PDF: volume18/brafman03a.pdf (351K)
Abstract: Unary operator domains -- i.e., domains in which operators
have a single effect -- arise naturally in many control problems. In
its most general form, the problem of STRIPS planning in unary
operator domains is known to be as hard as the general STRIPS planning
problem -- both are PSPACE-complete. However, unary operator domains
induce a natural structure, called the domain's causal graph. This
graph relates between the preconditions and effect of each domain
operator. Causal graphs were exploited by Williams and Nayak in order
to analyze plan generation for one of the controllers in NASA's
Deep-Space One spacecraft. There, they utilized the fact that when
this graph is acyclic, a serialization ordering over any subgoal can
be obtained quickly. In this paper we conduct a comprehensive study of
the relationship between the structure of a domain's causal graph and
the complexity of planning in this domain. On the positive side, we
show that a non-trivial polynomial time plan generation algorithm
exists for domains whose causal graph induces a polytree with a
constant bound on its node indegree. On the negative side, we show
that even plan existence is hard when the graph is a directed-path
singly connected DAG. More generally, we show that the number of
paths in the causal graph is closely related to the complexity of
planning in the associated domain. Finally we relate our results to
the question of complexity of planning with serializable subgoals.
Patel-Schneider, P.F. and Sebastiani, R. (2003)
"A New General Method to Generate Random Modal Formulae for Testing Decision Procedures",
Volume 18, pages 351-389.
PostScript: volume18/patelschneider03a.ps (1.9M)
compressed, volume18/patelschneider03a.ps.Z (422K)
PDF: volume18/patelschneider03a.pdf (691K)
Abstract: The recent emergence of heavily-optimized modal decision
procedures has highlighted the key role of empirical testing in this
domain. Unfortunately, the introduction of extensive empirical tests
for modal logics is recent, and so far none of the proposed test
generators is very satisfactory. To cope with this fact, we present a
new random generation method that provides benefits over previous
methods for generating empirical tests. It fixes and much generalizes
one of the best-known methods, the random CNF_[]m test, allowing for
generating a much wider variety of problems, covering in principle the
whole input space. Our new method produces much more suitable test
sets for the current generation of modal decision procedures. We
analyze the features of the new method by means of an extensive
collection of empirical tests.
Lang, J., Liberatore, P. and Marquis, P. (2003)
"Propositional Independence - Formula-Variable Independence and
Forgetting",
Volume 18, pages 391-443.
PostScript: volume18/lang03a.ps (503K)
compressed, volume18/lang03a.ps.Z (216K)
PDF: volume18/lang03a.pdf (486K)
Abstract: Independence -- the study of what is relevant to a given
problem of reasoning -- has received an increasing attention from the
AI community. In this paper, we consider two basic forms of
independence, namely, a syntactic one and a semantic one. We show
features and drawbacks of them. In particular, while the syntactic
form of independence is computationally easy to check, there are cases
in which things that intuitively are not relevant are not recognized
as such. We also consider the problem of forgetting, i.e., distilling
from a knowledge base only the part that is relevant to the set of
queries constructed from a subset of the alphabet. While such process
is computationally hard, it allows for a simplification of subsequent
reasoning, and can thus be viewed as a form of compilation: once the
relevant part of a knowledge base has been extracted, all reasoning
tasks to be performed can be simplified.
Acid, S. and de Campos. L.M. (2003)
"Searching for Bayesian Network Structures in the Space of Restricted Acyclic Partially Directed Graphs",
Volume 18, pages 445-490.
PostScript: volume18/acid03a.ps (894K)
compressed, volume18/acid03a.ps.Z (306K)
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume18/acid03a-html/index.html
PDF: volume18/acid03a.pdf (544K)
Abstract: Although many algorithms have been designed to construct
Bayesian network structures using different approaches and principles,
they all employ only two methods: those based on independence
criteria, and those based on a scoring function and a search procedure
(although some methods combine the two). Within the score+search
paradigm, the dominant approach uses local search methods in the space
of directed acyclic graphs (DAGs), where the usual choices for
defining the elementary modifications (local changes) that can be
applied are arc addition, arc deletion, and arc reversal. In this
paper, we propose a new local search method that uses a different
search space, and which takes account of the concept of equivalence
between network structures: restricted acyclic partially directed
graphs (RPDAGs). In this way, the number of different configurations
of the search space is reduced, thus improving efficiency. Moreover,
although the final result must necessarily be a local optimum given
the nature of the search method, the topology of the new search space,
which avoids making early decisions about the directions of the arcs,
may help to find better local optima than those obtained by searching
in the DAG space. Detailed results of the evaluation of the proposed
search method on several test problems, including the well-known Alarm
Monitoring System, are also presented.
Reiter, E., Sripada, S.G., and Robertson, R. (2003)
"Acquiring Correct Knowledge for Natural Language Generation",
Volume 18, pages 491-516.
PostScript: volume18/reiter03a.ps (793K)
compressed, volume18/reiter03a.ps.Z (236K)
PDF: volume18/reiter03a.pdf (203K)
Abstract: Natural language generation (NLG) systems are computer software
systems that produce texts in English and other human languages, often
from non-linguistic input data. NLG systems, like most AI systems, need
substantial amounts of knowledge. However, our experience in two
NLG projects suggests that it is difficult to acquire correct knowledge
for NLG systems; indeed, every knowledge acquisition (KA) technique
we tried had significant problems. In general terms, these problems were
due to the complexity, novelty, and poorly understood nature of the
tasks our systems attempted, and were worsened by the fact that people
write so differently. This meant in particular that corpus-based KA
approaches suffered because it was impossible to assemble a sizable
corpus of high-quality consistent manually written texts in our domains;
and structured expert-oriented KA techniques suffered because experts
disagreed and because we could not get enough information about special
and unusual cases to build robust systems. We believe that such problems
are likely to affect many other NLG systems as well. In the long term,
we hope that new KA techniques may emerge to help NLG system builders.
In the shorter term, we believe that understanding how individual KA
techniques can fail, and using a mixture of different KA techniques
with different strengths and weaknesses, can help developers acquire
NLG knowledge that is mostly correct.
Volume 19
Zanuttini, B. (2003)
"New Polynomial Classes for Logic-Based Abduction",
Volume 19, pages 1-10.
PostScript: volume19/zanuttini03a.ps (174K)
compressed, volume19/zanuttini03a.ps.Z (85K)
Online Appendix1: volume19/zanuttini03a-appendix1.pdf (200K) Technical report with proofs and examples
HTML: http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume19/zanuttini03a-html/zanuttini03a-html.html
PDF: volume19/zanuttini03a.pdf (155K)
Abstract: We address the problem of propositional logic-based
abduction, i.e., the problem of searching for a best explanation for a
given propositional observation according to a given propositional
knowledge base. We give a general algorithm, based on the notion of
projection; then we study restrictions over the representations of the
knowledge base and of the query, and find new polynomial classes of
abduction problems.