<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>9.2 The Setting and Prior Results</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse41.xml" >next</a>] [<a 
href="thesisse39.xml" >prev</a>] [<a 
href="thesisse39.xml#tailthesisse39.xml" >prev-tail</a>] [<a 
href="#tailthesisse40.xml">tail</a>] [<a 
href="thesisch9.xml#thesisse40.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">9.2. </span> <a 
  name="x57-820009.2"></a>The Setting and Prior Results</h3>
<!--l. 3514--><p class="noindent">We will first discuss standard covering number bounds. Define a
&#x201C;distance&#x201D; in terms of how often hypotheses disagree according to: <!--l. 3516--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                       <msub><mrow 
><mi 
>d</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi><mo 
class="MathClass-punc">,</mo><mi 
>f</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2260;</mo><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>Now, start with an epsilon
net defined by: <!--l. 3519--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">       <mrow 
>
                   <mi 
>N</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>H</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>d</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <!--mstyle 
class="text"--><mtext class="textrm">inf</mtext><!--/mstyle--></mrow><mrow 
><mi 
>F</mi></mrow></msub 
> <mfenced separators="" 
open="|"  close="|" ><mrow><mi 
>F</mi> <mo 
class="MathClass-punc">:</mo>  <mi 
>&#x2200;</mi><mi 
>h</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>H</mi><mi 
>&#x2203;</mi><mi 
>f</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>F</mi> <mo 
class="MathClass-punc">:</mo>  <msub><mrow 
><mi 
>d</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi><mo 
class="MathClass-punc">,</mo><mi 
>f</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B5;</mi></mrow></mfenced>
</mrow></math>
An epsilon net is the minimum size of a set which contains an element &#x201C;near&#x201D; to every element
in <!--l. 3522--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>H</mi></mrow></math>.
</p><!--l. 3524--><p class="indent">   Then a covering number is defined as: <!--l. 3525--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
                                                                     

                                                                     
<mrow 
>
                       <mi 
>C</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>H</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B5;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <!--mstyle 
class="text"--><mtext class="textrm">sup</mtext><!--/mstyle--></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mi 
>N</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>H</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>d</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math> The
covering number is the worst epsilon net.
</p>
   <div class="newtheorem">
<!--l. 3529--><p class="noindent"><span class="head">
<a 
  name="x57-82001r1"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>9.2.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 3530--><p class="indent">   <span 
class="ecti-1000">(Covering number bound) For all </span><!--l. 3530--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 3531--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                   <mi 
>&#x2200;</mi><mi 
>H</mi> <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>4</mn><msqrt><mi 
></mi>
 <mrow><mfrac><mrow 
><mo 
> ln</mo> <!--nolimits--> <mn>4</mn><mi 
>C</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>H</mi><mo 
class="MathClass-punc">,</mo> <mi 
>&#x03B5;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo> <!--nolimits--> <mfrac> <mrow 
> <mn>1</mn></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac></mrow>
           <mrow 
><mi 
>m</mi></mrow></mfrac></mrow></msqrt>         </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
</p>
   </div>
   <div class="proof">
<!--l. 3536--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>In <span class="cite">[<a 
href="thesisli2.xml#XHaussler"><span 
class="ecbx-1000">20</span></a>]</span>. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 3538--><p class="indent">   How tight is this bound when applied to a finite independent hypothesis
space? We can improve the constants by using an argument with fewer
                                                                     

                                                                     
triangle inequalities in the discrete case and get the following results: <!--l. 3541--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                 <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2212;</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mn>2</mn><msqrt><mi 
></mi>
 <mrow><mfrac><mrow 
><mo 
> ln</mo> <!--nolimits--> <mn>4</mn><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>H</mi><mo 
class="MathClass-rel">&#x2223;</mo> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo> <!--nolimits--> <mi 
>&#x03B4;</mi></mrow>
      <mrow 
><mi 
>m</mi></mrow></mfrac></mrow></msqrt>       </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
Comparing this with a very loose application of the discrete hypothesis bound  <a 
href="thesisse16.xml#x23-32007r3">4.2.3<!--tex4ht:ref: th-adhscb --></a> we
see that the penalty term in the covering number bound is worse by factor of <!--l. 3545--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>2</mn><msqrt><mi 
></mi>
 <mrow><mn>2</mn></mrow></msqrt></mrow></math>.
Put another way, dividing the number of samples by <!--l. 3546--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>8</mn></mrow></math>or increasing the hypothesis
space size to <!--l. 3546--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>H</mi><msup><mrow 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow><mrow 
><mn>8</mn></mrow></msup 
></mrow></math>
and then applying a sloppy discrete hypothesis bound is about equivalent to
applying a very specialized covering number bound. We seek a covering
number bound which does not divide the effective value of a hypothesis by <!--l. 3549--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>8</mn></mrow></math>.
</p><!--l. 3552--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse41.xml" >next</a>] [<a 
href="thesisse39.xml" >prev</a>] [<a 
href="thesisse39.xml#tailthesisse39.xml" >prev-tail</a>] [<a 
href="thesisse40.xml" >front</a>] [<a 
href="thesisch9.xml#thesisse40.xml" >up</a>] </p></div><a 
  name="tailthesisse40.xml"></a>  
</body> 
</html> 
