The Exploring Process and the Evaluation of Candidate Structures

The search method we have described may be applied in combination with any score equivalent function

(for example the AIC, BIC, MDL and BDe scoring functions are score equivalent). An easy (but inefficient) way to integrate our search method with a score equivalent function would be as follows: given an RPDAG

to be evaluated, select any extension

and compute

. We could also use other (non-equivalent) scoring functions, although the score of

would depend on the selected extension.

However, let us consider the case of a decomposable scoring function

: the DAG obtained by adding or removing an arc from the current DAG

can be evaluated by modifying only one local score:

Using decomposable scoring functions, the process of selecting, given an RPDAG, a representative DAG and then evaluating it may be quite inefficient, since we would have to recompute the local scores for all the nodes instead of only one local score. This fact can make a learning algorithm that searches in the space of equivalence classes of DAGs considerably slower than an algorithm that searches in the space of DAGs (this is the case of the algorithm proposed by [16]).

Our search method can be used for decomposable scoring functions so that: (1) it is not necessary to transform the RPDAG into a DAG, the RPDAG can be evaluated directly, and (2) the score of any neighboring RPDAG can be obtained by computing at most two local scores. All the advantages of the search methods on the space of DAGs are therefore retained, but a more reduced and robust search space is used.

Before these assertions are proved, let us examine an example. Consider the RPDAG

in Figure 15 and the three neighboring configurations produced by the inclusion of an edge between

and

(also displayed in Figure 15).

**Figure 15:** An RPDAG and three neighboring configurations , and
$\begin{figure}\centerline{\psfig{figure=./figuras/mievaluating.eps,height=.3\textwidth}}\end{figure}$

The score of each of these RPDAGs is equal to the score of any of their extensions. Figure 16 displays one extension for each neighboring configuration.

**Figure 16:** Extensions , and of the RPDAGs , and in Fig. 15
$\begin{figure}\centerline{\psfig{figure=./figuras/miextensions.eps,height=.3\textwidth}}\end{figure}$

$\displaystyle g(G_1:D)=g(H_1:D)$	$\textstyle =$	$\displaystyle g_D(x,\emptyset)+g_D(a,\{xb\})+g_D(b,c)+g_D(c,y)+g_D(y,x)+g_D(d,y)$
$\displaystyle g(G_2:D)= g(H_2:D)$	$\textstyle =$	$\displaystyle g_D(x,\emptyset)+g_D(a,\{xb\})+g_D(b,c)+g_D(c,y)+g_D(y,\{xd\})+g_D(d,\emptyset)$
$\displaystyle g(G_3:D)= g(H_3:D)$	$\textstyle =$	$\displaystyle g_D(x,\emptyset)+g_D(a,\{xb\})+g_D(b,c)+g_D(c,\emptyset)+g_D(y,\{xc\})+g_D(d,y)$

For each extension

of any neighboring configuration

, it is always possible to find an extension $H_{Gi}$ of the current RPDAG

such that the scores of

and $H_{Gi}$ only differ in one local score (Figure 17 displays these extensions). We can then write:

$\displaystyle g(G:D)=g(H_{G1}:D)$	$\textstyle =$	$\displaystyle g_D(x,\emptyset)+g_D(a,\{xb\})+g_D(b,c)+g_D(c,y)+g_D(y,\emptyset)+g_D(d,y)$
$\displaystyle g(G:D)=g(H_{G2}:D)$	$\textstyle =$	$\displaystyle g_D(x,\emptyset)+g_D(a,\{xb\})+g_D(b,c)+g_D(c,y)+g_D(y,d)+g_D(d,\emptyset)$
$\displaystyle g(G:D)=g(H_{G3}:D)$	$\textstyle =$	$\displaystyle g_D(x,\emptyset)+g_D(a,\{xb\})+g_D(b,c)+g_D(c,\emptyset)+g_D(y,c)+g_D(d,y)$

**Figure 17:** Three different extensions $H_{G1}$ , $H_{G2}$ and $H_{G3}$ of the RPDAG in Fig. 15
$\begin{figure}\centerline{\psfig{figure=./figuras/miextensions2.eps,height=.3\textwidth}}\end{figure}$

Therefore, the score of any neighboring configuration may be obtained from the score of

by computing only two local scores. Note that some of these local scores may have already been computed at previous iterations of the search process: for example, $g_D(y,\emptyset)$ had to be used to score the initial empty RPDAG, and either

could have been computed when the link $y\mbox{-}d$ or $y\mbox{-}c$ was inserted into the structure.

Proposition 6 Let

be an RPDAG and

be any RPDAG obtained by applying one of the operators described in Table 2 to

. Let

be a score equivalent and decomposable function.

(a)

If the operator is $A\_link(x,y)$ then

$\begin{displaymath} g(G':D)=g(G:D)-g_D(y,\emptyset)+g_D(y,\{x\}) \end{displaymath}$

(b)

If the operator is $A\_arc(x,y)$ then

$\begin{displaymath} g(G':D)=g(G:D)-g_D(y,Pa_G(y))+g_D(y,Pa_G(y)\cup\{x\}) \end{displaymath}$

(c)

If the operator is $A\_hh(x,y,z)$ then

$\begin{displaymath} g(G':D)=g(G:D)-g_D(y,\{z\})+g_D(y,\{x,z\}) \end{displaymath}$

(d)

If the operator is $D\_link(x,y)$ then

$\begin{displaymath} g(G':D)=g(G:D)-g_D(y,\{x\})+g_D(y,\emptyset) \end{displaymath}$

(e)

If the operator is $D\_arc(x,y)$ then

$\begin{displaymath} g(G':D)=g(G:D)-g_D(y,Pa_G(y))+g_D(y,Pa_G(y)\setminus\{x\}) \end{displaymath}$

(1) First, we shall prove that we can construct an extension

and another extension

, such that

and

differ in only one arc (this arc being $x\rightarrow y$ ).

$\bullet$ Consider the cases (a), (b), and (c), which correspond to the addition of an edge between

and

: in case (a), $G'=G\cup\{x\mbox{-}y\}$ and let

be an extension of

that contains the arc $x\rightarrow y$ ; in case (b), where $G'=G\cup\{x\rightarrow y\}$ , and in case (c), where $G'=(G\setminus\{y\mbox{-}z\})\cup\{x\rightarrow y\leftarrow z\}$ , let

be any extension of

(which will contain the arc $x\rightarrow y$ ). In all three cases, let $H=H'\setminus\{x\rightarrow y\}$ . We shall prove that

is an extension of

- Secondly, if $u\rightarrow v\in G$ (in either case $u\rightarrow v\neq x\rightarrow y$ ), then $u\rightarrow v\in G'$ . As

is an extension of

, then $u\rightarrow v\in H'$ , and this implies that $u\rightarrow v\in H$ . Therefore, all the arcs in

are also arcs in

. This result also ensures that every h-h pattern in

is also an h-h pattern in

- Thirdly, if $u\rightarrow v\leftarrow w$ is an h-h pattern in

(in either case $u\rightarrow v\leftarrow w\neq x\rightarrow y\leftarrow w$ ), then $u\rightarrow v\leftarrow w\in H'$ . Once again, as

is an extension of

, we can see that $u\rightarrow v\leftarrow w\in G'$ , and then $u\rightarrow v\leftarrow w\in G$ . So,

and

have the same h-h patterns.

is therefore an extension of

, according to Definition 2. Note that $\forall u\neq y \; Pa_H(u)=Pa_{H'}(u)$ and $Pa_H(y)=Pa_{H'}(y)\setminus\{x\}$ .

$\bullet$ Let us now consider cases (d) and (e), which correspond to the deletion of an edge between

and

(either a link or an arc, respectively): in case (d), let

be an extension of

containing the arc $x\rightarrow y$ ; in case (e), let

be any extension of

. In both cases, let $H'=H\setminus\{x\rightarrow y\}$ . We will prove that

is an extension of

- Secondly, if $u\rightarrow v\in G'$ (note that $u\rightarrow v\neq x\rightarrow y$ ), then $u\rightarrow v\in G$ . As

is an extension of

, then $u\rightarrow v\in H$ , and therefore $u\rightarrow v\in H'$ . So, all the arcs in

are also arcs in

. Moreover, every h-h pattern in

is also an h-h pattern in

- Thirdly, if $u\rightarrow v\leftarrow w$ is an h-h pattern in

(and we know that $u\rightarrow v\leftarrow w\neq x\rightarrow y\leftarrow w$ ), then $u\rightarrow v\leftarrow w\in H$ . As

is an extension of

, then $u\rightarrow v\leftarrow w\in G$ . Therefore, $u\rightarrow v\leftarrow w\in G'$ (the removal of the arc $x\rightarrow y$ cannot destroy any h-h pattern where $x\rightarrow y$ is not involved). So,

and

have the same h-h patterns.

In this way,

is an extension of

. Moreover, we can see that $\forall u\neq y \; Pa_{H'}(u)=Pa_H(u)$ and $Pa_{H'}(y)=Pa_H(y)\setminus\{x\}$ .

(2) The scores of

and

are the same as the scores of

and

respectively, since

is score equivalent. Moreover, as

is decomposable, we can write

(a) In this case, we know from Table 2 that $Pa_G(y)=\emptyset$ . Moreover, $Pa_{G'}(y)=\emptyset$ (because we are inserting a link) and $Pa_{H'}(y)\neq\emptyset$ (because

is an extension of

that contains the arc $x\rightarrow y$ ). Then, from Proposition 5 we obtain $\vert Pa_{H'}(y)\vert=1$ , i.e. $Pa_{H'}(y)=\{x\}$ . Moreover, $Pa_H(y)=Pa_{H'}(y)\setminus\{x\}=\emptyset$ . So, Eq. (4) becomes

If $Pa_G(y)\neq\emptyset$ , from Proposition 5 we obtain

. Moreover, $Pa_{H'}(y)=Pa_H(y)\cup\{x\}=Pa_G(y)\cup\{x\}$ .

If $Pa_G(y)=\emptyset$ then $Pa_{G'}(y)=\{x\}$ (because we are adding the arc $x\rightarrow y$ ). From Proposition 5 we obtain $Pa_{H'}(y)=Pa_{G'}(y)=\{x\}=Pa_G(y)\cup\{x\}$ . Moreover, $Pa_H(y)=Pa_{H'}(y)\setminus\{x\}=\emptyset=Pa_G(y)$ .

(c) In this case, $Pa_G(y)=\emptyset$ and $Pa_{G'}(y)=\{x,z\}$ . From Proposition 5 we obtain $Pa_{H'}(y)=\{x,z\}$ . Moreover, $Pa_H(y)=Pa_{H'}(y)\setminus\{x\}=\{z\}$ . Then, Eq. (4) becomes

(d) As $Pa_G(y)=\emptyset$ and

is an extension of

containing the arc $x\rightarrow y$ , from Proposition 5 we get $Pa_H(y)=\{x\}$ . Moreover, $Pa_{H'}(y)=Pa_H(y)\setminus\{x\}=\emptyset$ . In this case Eq. (4) becomes

(e) In this case, as $Pa_G(y)\neq\emptyset$ , Proposition 5 asserts that

. Moreover, $Pa_{H'}(y)=Pa_H(y)\setminus\{x\}=Pa_G(y)\setminus\{x\}$ . Therefore, Eq. (4) becomes

$\displaystyle g(G_1:D)$	$\textstyle =$	$\displaystyle g(G:D)-g_D(y,\emptyset)+g_D(y,x)$
$\displaystyle g(G_2:D)$	$\textstyle =$	$\displaystyle g(G:D)-g_D(y,d)+g_D(y,\{xd\})$
$\displaystyle g(G_3:D)$	$\textstyle =$	$\displaystyle g(G:D)-g_D(y,c)+g_D(y,\{xc\})$

$\displaystyle g(H\cup\{x\rightarrow y\}:D)=g(H:D)-g_D(y, Pa_H(y))+g_D(y, Pa_H(y)\cup\{x\})$
$\displaystyle g(H\setminus\{x\rightarrow y\}:D)=g(H:D)-g_D(y, Pa_H(y))+g_D(y, Pa_H(y)\setminus\{x\})$