Next: Experimental Results Up: No Title Previous: Empirical Analysis

The EUREKA System

The empirical and theoretical comparisons presented in the previous section indicate that alternative parallel search strategies perform well under different conditions, and that performance trends do exist which can be used to automatically select strategies and parameter settings for new problems. However, the results of these studies are not sufficient for automatically determining the appropriate set of strategies. One limitation is that information used to generate the formulas and to control the experiments, such as goal position, is not known in advance. Another limitation is that some of the assumptions, such as a constant branching factor, are not realistic for many applications. As a result, we need a method to automatically select optimal strategies for real-world problems given information that is available.

In response to this need, we add a machine learning component to the EUREKA system. EUREKA merges many of the approaches to parallel search discussed the previous section. Parameters can be set that control the task distribution strategy, the load balancing strategies, and the ordering techniques. In particular, the strategies that can be selected include:

Distribution strategy [Kumar and Rao, Breadth-first]
Number of clusters [1 $\ldots$ #processors]
Load balancing [On, Off]
Processor selection [Neighbor, Random]
Percentage of stack distributed upon request [0% $\ldots$ 100%]
Donating strategy [HeadOfList, TailOfList]
Anticipatory load balancing trigger [0..stack_size]
Ordering [Fixed, Local, TOIDA]

The user is allowed to determine the types of strategies to be used for a given problem. However, because such a decision is difficult to make without significant information about the search space, EUREKA also has the capability of making all necessary strategy selections. Based on the characteristics of a search space, EUREKA automatically configures itself to optimize performance on a particular application. Sample problems are fed as training examples to a machine learning system, which in turn learns the optimal strategies to apply to particular classes of search spaces. Applying machine learning to optimize software applications has been pursued in other areas of research. For example, Minton [23] has applied a similar technique to automatically synthesize problem-specific versions of constraint-satisfaction algorithms. Research in other areas of computer science has yielded similar ideas of customizable environments applied to computer networks [4,33] and to interactive human-computer interfaces [15,20]. This work is unique in allowing both problem-specific and architecture-specific features to influence the choice of strategies and in applying adaptive software techniques to parallel search. EUREKA also offers a framework that can potentially automate both static and dynamic software customization.

To perform parallel search, EUREKA executes the following steps:

1.

Timings from sample problem instances are captured, varying each strategy parameter independently. In order to acquire an effective sample set, problems are selected from a variety of domains and parallel architectures.

2.

For each problem instance, EUREKA captures features of the problem space that are known to affect the optimal choice of strategy. These features include:

Branching Factor (b):: The average branching factor of the search tree.
Heuristic Error (herror):: The difference, on average, between the estimated distance to a goal node and the true distance to the closest goal node. This is estimated from the shallow search by computing the difference between the estimated distance to the goal at the beginning of the search and the smallest estimated distance to the goal for all leaf nodes of the shallow search, minus the actual distance from the root node to the leaf node.
Imbalance (imb):: The degree to which nodes are unevenly distributed among subtrees in the search space.
Goal Location (loc):: The left-to-right position of the first optimal solution node. This is estimated from the shallow search by determining which subtree contains nodes with the lowest estimated distances to the goal.
Heuristic Branching Factor (hbf):: The ratio of nodes expanded between the current and previous IDA* iterations.

Features from the non-goal iterations represent attributes for the test case, and the strategy that results in the best performance (shortest run time) represents the correct classification of the problem instance for a given strategy choice.

3.

Problem attributes are combined with the corresponding classes and are fed as training examples to a machine learning system. We use C4.5 [26] to induce a decision tree from the pre-classified cases. A rule base is generated for each concept to be learned, corresponding to each of the strategy decisions listed above that need to be made.

4.

To solve a new problem, EUREKA performs a shallow search through the space until roughly 200,000 nodes are expanded. If a goal is not found during the shallow search, the features of the tree are calculated at this point and used to index appropriate rules from the C4.5 database.

5.

The learned rules recommend strategy choices given the features of the new problem space. EUREKA then initiates a parallel search from the root of the space, employing the selected strategies. For many applications, the initial expansion takes only a few seconds and does not greatly affect the runtime of the search algorithm.

The described set of features and the amount of time to spend on the initial EUREKA iteration are chosen based on our experimental data to yield the most helpful information in the shortest time. Searching enough iterations of the problem space until 200,000 nodes are generated takes less than 10 seconds on the problem domains we tested. Spending less time than this may yield erroneous information because features of the tree do not stabilize until several levels down in the tree. Searching additional iterations in general does not significantly improve the quality of information and the time requirements grow exponentially. Improved approaches may include performing the initial search until the stability of the space reaches a prescribed level, or periodically updating the C4.5 choices and adjusting the parallel search approaches dynamically during execution.

Performing an initial shallow search has also been shown to be effective in other parallel search research. For example, Suttner [34] performs parallel search of the first few IDA* iterations twice during parallel search in order to determine the number of individual tasks that should be distributed to each processor. Cook et al. [10] also perform an initial shallow search in order to obtain more accurate operator ordering information to use in reordering the search space. In each case, the amount of time required to perform the extra initial search is minimal yet greatly improves the performance of the overall search task.

The selected features each demonstrate a significant influence on the optimal search strategy. Although feature values can change dramatically from one problem to the next, each feature remains fairly stable between levels of the same tree. As a result, computing the values of these features at a low level in the tree provides a good indication of the structure of the entire tree. The ``Deviation Within Problems'' entries in Table 1 show the standard deviation of each feature value over all search tree levels. The results are averaged over 98 problem instances in the fifteen puzzle domain¹ and 20 problem instances in the robot arm motion planning domain. Both of these test domains are described in the next section. The low values in these columns indicate that the features provide a good indicator of the structure of the problem space at all levels in the tree. In contrast, the ``Deviation Between Problems'' table entries show the standard deviation of the average feature value (averaged over all levels in each tree) over all problem spaces. The larger values in these columns indicate that the features can effectively distinguish between different problem spaces.

Table 1: Problem Feature Value Deviations

		Deviation Within Problems				Deviation Between Problems
Domain	Stat	Bf	Herror	Imb	Hbf	Bf	Herror	Imb	Hbf
15puzzle	Std Dev	0.000	1.748	0.001	2.839	0.002	2.577	0.008	4.889
	Avg Value	1.509	3.039	0.286	6.938	1.509	3.039	0.286	6.938
RMP	Std Dev	0.050	9.331	0.001	0.002	0.205	43.400	0.002	0.138
	Avg Value	2.643	11.265	0.148	1.105	2.643	11.265	0.148	1.105

EUREKA is written in C and is currently implemented on an nCUBE 2, on Linux workstations using the Parallel Virtual Machine (PVM) communication software, on a DEC Alpha using an implementation of Posix threads, on Windows NT running Java threads and Cilk threads, and as a distributed Java system using RMI. We are also currently investigating means of dynamically switching between strategies during execution as the problem structure or environment changes.

Next: Experimental Results Up: No Title Previous: Empirical Analysis