<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>9.1 Introduction</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse40.xml" >next</a>] [<a 
href="#tailthesisse39.xml">tail</a>] [<a 
href="thesisch9.xml#thesisse39.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">9.1. </span> <a 
  name="x56-810009.1"></a>Introduction</h3>
<!--l. 3474--><p class="noindent">Covering number bounds are used to bound the true error rate
of classifiers chosen from an infinite hypothesis space using <!--l. 3475--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi></mrow></math>
examples <span class="cite">[<a 
href="thesisli2.xml#XHaussler"><span 
class="ecbx-1000">20</span></a>]</span>. A cover is a finite set of hypotheses which satisfies the following property:
every hypothesis in the infinite space is &#x201C;near&#x201D; some element in the finite cover. When a
Lipschitz condition holds on the hypothesis space, it is generally possible to construct
these covers and the existence of a cover is required for learnability <span class="cite">[<a 
href="thesisli2.xml#XBI"><span 
class="ecbx-1000">2</span></a>]</span>. Alternatively,
Sauer&#x2019;s lemma (see <span class="cite">[<a 
href="thesisli2.xml#XPollard"><span 
class="ecbx-1000">43</span></a>]</span> or <span class="cite">[<a 
href="thesisli2.xml#XVapnik"><span 
class="ecbx-1000">51</span></a>]</span>) bounds the size of the cover in terms of the VC
dimension which is defined combinatorially: the VC dimension is the largest
number of examples which the hypothesis space can classify in an arbitrary
manner.
</p><!--l. 3485--><p class="indent">   The principal disadvantage of covering number results is that they are notoriously
loose, to the point that they are often useless when applied in practice (see &#x201C;criticisms&#x201D; in
<span class="cite">[<a 
href="thesisli2.xml#XHaussler"><span 
class="ecbx-1000">20</span></a>]</span>). Here, &#x201C;useless&#x201D; means that the bound on the true error rate is &#x201C;always wrong&#x201D;. The
amount of &#x201C;looseness&#x201D; can be quantified by comparison with other bounds in
the regimes where other bounds hold. On a finite hypothesis space we have
near-perfect agreement between the upper bound  <a 
href="thesisse16.xml#x23-32001r1">4.2.1<!--tex4ht:ref: th-DHSCP --></a> and the lower upper
bound  <a 
href="thesisse18.xml#x25-34006r2">4.4.2<!--tex4ht:ref: th-dhlub --></a> for independent hypotheses. In fact, as the number of examples goes
to infinity, the agreement is perfect, regardless of the size of the hypothesis
space. When we apply covering number bounds to this problem such properties
do not arise. Since part of the argument involves splitting the examples into <!--l. 3495--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>2</mn></mrow></math> sets,
the difference between a covering number based upper bound and the lower upper bound
can be large even when the number of examples goes to infinity. In practice, the covering
number bound at least squares the discrete hypothesis size. Obviously, further
loosening of covering number bounds with Sauer&#x2019;s lemma result in even worse
bounds.
</p><!--l. 3502--><p class="indent">   Can we construct a calculable true error upper bound for continuous hypothesis
spaces which is at least asymptotically tight? In a sense, this has already been done with
PAC-Bayes bounds in chapter  <a 
href="thesisch6.xml#x37-570006">6<!--tex4ht:ref: sec-PB --></a>, but there are drawbacks in applicability to that
approach since PAC-Bayes bounds do not apply in a meaningful way to a single
hypothesis drawn from an infinite hypothesis space. A covering number argument would
hopefully apply in a meaningful way to a singly hypothesis. A covering number bound
                                                                     

                                                                     
which is asymptotically tight on <span 
class="ecti-1000">some </span>learning problems does exist and is covered
next.
</p><!--l. 3512--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse40.xml" >next</a>] [<a 
href="thesisse39.xml" >front</a>] [<a 
href="thesisch9.xml#thesisse39.xml" >up</a>] </p></div><a 
  name="tailthesisse39.xml"></a>  
</body> 
</html> 
