next up previous
Next: Lower Envelopes (or Coherent Up: A Brief Introduction to Previous: Important Definitions: Conditional Preferences

Subsections


Lower Expectations and Lower Previsions

Here we look at two theories of decision and inference that are closely related to Quasi-Bayesian theory: lower expectations and lower previsions. The main message is that, as far as mathematical structure is concerned, the three of them refer to the same thing.


Lower Expectations

Suppose you have a loss function $l(\cdot)$. Now you pick a probability distribution $p(\cdot)$ and you calculate the expected loss using that probability distribution:

\begin{displaymath}
E[l] = \int \left( l(x) p(x) \right) dx.
\end{displaymath}

Suppose you repeat this for all probability distributions in a set of distributions. Then you will have a value of E[l] for each probability distribution in the set. Suppose the set of distributions is convex. A little bit of thinking will convince you that in this case you will produce an interval of values of E[l]. Why? Because if two distributions are in the credal set, the ``distributions within'' them are in the set (the set is convex), so an interval of expectations is created. So for every loss function $l(\cdot)$, a convex set of distributions determines an interval of expected losses. The minimum value of expected loss is called lower expectation and the maximum value of expected loss is called upper expectations.

\begin{displaymath}
\underline{E}[l] = \inf_{p \in K} E[l]
\end{displaymath}


\begin{displaymath}
\overline{E}[l] = \sup_{p \in K} E[l]
\end{displaymath}

In fact you do not need to store $\underline{E}[l]$ and $\overline{E}[l]$. We have the following fundamental relation:

\begin{displaymath}
\underline{E}[l] = - \overline{E}[-l],
\end{displaymath}

which indicates that a system of intervals of expected loss can be summarized by the lower expectations.

The Correspondence between Lower Expectations and Convex Sets of Distributions

Based on the definitions of lower and upper expectations, the following question arises:

Suppose you start with a system of intervals for expected loss. For every loss function you can think of, determine an interval on the real line, and say that interval is the interval of of expected losses. You don't have any particular model of probabilities at this point, just the expected loss intervals. You have created a lower expectation model.

Note: before, we started from a convex set of probability distributions and we calculated a system of intervals. Now we consider the reverse process.

You want to use the lower expectation model to create a set of probability distributions. You can do that by defining the set:

\begin{displaymath}
K = \mbox{ All distributions such that } E[l] \geq \underline{E}[l],
\mbox{ for all } l .
\end{displaymath}

Now the following question becomes relevant:

If I create a lower expectation and then I obtain K as in the previous expression, is it possible to recover exactly the original lower expectation by creating intervals with K? Are the lower expectation and the set K representing the same thing?

The answer is this:

If the lower expectation is superadditive:

\begin{displaymath}
\underline{E}[l_1 + l_2] \geq \underline{E}[l_1] + \underline{E}[l_2],
\end{displaymath}

and affinely homogeneous:

\begin{displaymath}
\underline{E}[\alpha l_1 + \beta] = \alpha \underline{E}[l_1] + \beta,
\end{displaymath}

then there is a one-to-one correspondence between convex sets of probability distributions and affinely superadditive lower expectation. This fundamental result is proved without much detail in [16] and with a lot of detail in [30].

(An operator $o(\cdot)$ is superadditive if $o(x+y) \geq o(x) + o(y)$. Notice that $\min$ and $\inf$ are superadditive.)

Note this other fundamental fact: if you have a set of probability distributions that creates an affinely superadditive lower expectation, the convex hull of this set of distributions will create the same lower expectation. Convex sets of probability distributions are convenient because they summarize all the information: you cannot generate a non-convex set K by the process above.

The relationship between credal sets and superadditive, positively affine lower expectations is simple: they are different representations of the same thing.

For lower expectation models that are not superadditive positively affine, the situation is more complicated. Suppose you pick one of those lower expectations, $\underline{E}$, and you generate the set K of all distributions that generate $E[l] \geq \underline{E}[l]$ (for all functions l). Now if you take the try to generate a lower expectation from K, it will not necessarily be equal to $\underline{E}$. But the lower expectation generated by K will be affinely superadditive. And from that point on, there will correspondence between the credal set and the lower expectation generated from it [16].

Lower Previsions

There is another name for affinely superadditive lower expectations, advocated by Walley [30]: lower previsions. The name prevision has some philosophical connotations because it emphasizes that an expected loss is a subjective ``guess'' about the future. But for any practical purposes lower previsions are exactly equal to affinely superadditive lower expectations.


next up previous
Next: Lower Envelopes (or Coherent Up: A Brief Introduction to Previous: Important Definitions: Conditional Preferences
Fabio Gagliardi Cozman
1999-12-30