 
 
 
 
 
   
Let  be the number of classes. Unless otherwise specified, an example
 be the number of classes. Unless otherwise specified, an example  is a couple
 is a couple  where
where  is an observation described over
 is an observation described over  variables, and
 variables, and  its
corresponding class among
 its
corresponding class among 
 ; to each example
 ; to each example  is associated a weight
is associated a weight  , representing its appearance probability
with respect to a learning sample
, representing its appearance probability
with respect to a learning sample  which we dispose of.
 which we dispose of.  is itself a subset of a whole domain which
we denote
 is itself a subset of a whole domain which
we denote  . Obviously, we do not have entire access to
. Obviously, we do not have entire access to  (
 (
 ) : in general, we even have
) : in general, we even have 
 (
 ( denotes the
cardinality; we suppose in all that follows that
 denotes the
cardinality; we suppose in all that follows that  is discrete with
finite cardinality). In the particular
case where
 is discrete with
finite cardinality). In the particular
case where  , the two classes are noted ``-'' (
, the two classes are noted ``-'' ( ) and ``+'' (
) and ``+'' ( ),  and called
respectively the negative and positive class. The learning sample is the union
of two samples, noted
),  and called
respectively the negative and positive class. The learning sample is the union
of two samples, noted  and
 and  , containing respectively the negative
and positive examples. It is worthwhile to think the positive examples as
belonging to a subset of
, containing respectively the negative
and positive examples. It is worthwhile to think the positive examples as
belonging to a subset of  containing all possible positive examples,
usually called the target concept.
 containing all possible positive examples,
usually called the target concept.
As part of our goal in machine learning, is the need to build a reliable
approximation to the true classification of the examples in  , that is, a good approximation of the target concept, by
using only the examples in
, that is, a good approximation of the target concept, by
using only the examples in  . Good approximations shall have a high accuracy over
. Good approximations shall have a high accuracy over  , although we do not have access to this quantity,
but rather to its estimator: a more or less reliable accuracy computable over
, although we do not have access to this quantity,
but rather to its estimator: a more or less reliable accuracy computable over  . We refer
the reader to standard machine learning books [Mitchell1997] for further
considerations about this issue. A DC contains two parts:
. We refer
the reader to standard machine learning books [Mitchell1997] for further
considerations about this issue. A DC contains two parts:
 where each
 where each
   is a monomial (a conjunction of literals) over
 is a monomial (a conjunction of literals) over 
 (
 ( being the number of
  description variables, each
 being the number of
  description variables, each  is a positive literal and each
 is a positive literal and each
  
 is a negative literal), and each
 is a negative literal), and each  is a vector in
 is a vector in
   . For the sake of readability, this vectorial notation shall be kept throughout all the paper,
  even for problems with only two classes. One might choose to add a single
  real rather than a 2-component vector in that case.
. For the sake of readability, this vectorial notation shall be kept throughout all the paper,
  even for problems with only two classes. One might choose to add a single
  real rather than a 2-component vector in that case.
 in
 in ![$[0,1]^c$](img38.png) . Again, in the two-class case, it is sufficient to replace
. Again, in the two-class case, it is sufficient to replace  by a default class in
 by a default class in  .
.
 and any monomial
 and any monomial  , the proposition ``
, the proposition `` satisfies
 satisfies  '' is denoted by
'' is denoted by 
 . The opposite proposition ``
. The opposite proposition `` does not satisfy
 does not satisfy  '' is denoted by ``
'' is denoted by ``
 ''. The classification of any observation
''. The classification of any observation  is made in the following way: define
 is made in the following way: define  as follows
 as follows 
 
 is then:
 is then:
 if
 if 
 , and
, and 
 otherwise.
 otherwise. 
 is unique, then the
index gives the class assigned to
 is unique, then the
index gives the class assigned to  . Otherwise, we take the index of the
maximal component of
. Otherwise, we take the index of the
maximal component of  corresponding to the maximal component of
 corresponding to the maximal component of
 (ties are solved by a random choice among the maximal components).
 (ties are solved by a random choice among the maximal components).
 and each monomial is present
at most once. The values
 and each monomial is present
at most once. The values  ,
,  ,
,  allow natural interpretations of the rules, 
being either in favor of the corresponding class (
 allow natural interpretations of the rules, 
being either in favor of the corresponding class ( ), neutral with respect
to the class (
), neutral with respect
to the class ( ), or in disfavor of the corresponding class (
), or in disfavor of the corresponding class ( ). This subclass, to which we relate as DC
). This subclass, to which we relate as DC
 , is, as we now prove, suffering the same algorithmic drawbacks as DT [Hyafil  Rivest1976] and DL [Nock  Jappy1998]: even without restricting the components of the vectors, or with any restriction to a set containing at least one real value, the construction of small formulas with sufficiently high accuracy is hard. This is a clear motivation for using heuristics in decision committee's induction.
, is, as we now prove, suffering the same algorithmic drawbacks as DT [Hyafil  Rivest1976] and DL [Nock  Jappy1998]: even without restricting the components of the vectors, or with any restriction to a set containing at least one real value, the construction of small formulas with sufficiently high accuracy is hard. This is a clear motivation for using heuristics in decision committee's induction.
 
 
 
 
