<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>7.1 Earlier Results</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse30.xml" >next</a>] [<a 
href="#tailthesisse29.xml">tail</a>] [<a 
href="thesisch7.xml#thesisse29.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">7.1. </span> <a 
  name="x44-650007.1"></a>Earlier Results</h3>
<!--l. 2607--><p class="noindent">The improved averaging bound arises from improving one critical step in the proof of the
original margin bound <span class="cite">[<a 
href="thesisli2.xml#XMargin"><span 
class="ecbx-1000">46</span></a>]</span> which is stated next.
</p>
   <div class="newtheorem">
<!--l. 2610--><p class="noindent"><span class="head">
                                                                     

                                                                     
<a 
  name="x44-65001r1"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>7.1.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 2611--><p class="indent">   <span 
class="ecti-1000">(Margin Bound </span><span class="cite">[<a 
href="thesisli2.xml#XMargin"><span 
class="ecbx-1000">46</span></a>]</span><span 
class="ecti-1000">) For all </span><!--l. 2611--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">for all base hypothesis spaces, </span><!--l. 2612--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>H</mi></mrow></math><span 
class="ecti-1000">,</span>
<!--l. 2613--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
            <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x2203;</mi><mi 
>c</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B8;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-punc">:</mo>  <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>c</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo><msub><mrow 
> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>c</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mi 
>O</mi> <mfenced separators="" 
open="("  close=")" ><mrow><msqrt><mi 
></mi>
 <mrow><mfrac><mrow 
> <mfrac> <mrow 
> <mo 
>ln</mo> <!--nolimits--> <mo 
class="MathClass-rel">&#x2223;</mo><mi 
>H</mi><mo 
class="MathClass-rel">&#x2223;</mo></mrow> 
 <mrow 
><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow></mfrac>   <mo 
> ln</mo><!--nolimits--><mi 
>m</mi> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac></mrow>
          <mrow 
><mi 
>m</mi></mrow></mfrac></mrow></msqrt>        </mrow></mfenced></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
</p>
   </div>
   <div class="proof">
<!--l. 2618--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>Given in <span class="cite">[<a 
href="thesisli2.xml#XMargin"><span 
class="ecbx-1000">46</span></a>]</span>. A simplification of the improved averaging bound proof.
<span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 2620--><p class="indent">   Here, the notation <!--l. 2620--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>b</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mi 
>O</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>a</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> means
there exists a constant <!--l. 2620--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>C</mi></mrow></math>
such that <!--l. 2621--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>b</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>C</mi> <mo 
class="MathClass-punc">&#x22C5;</mo> <mi 
>a</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
for all <!--l. 2621--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>m</mi></mrow></math>.
This margin bound implies that if most training examples have a large margin <!--l. 2622--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B8;</mi></mrow></math> (i.e. <!--l. 2622--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>t</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi><mo 
class="MathClass-punc">,</mo><mi 
>y</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mi 
>&#x03B8;</mi></mrow></math> for most <!--l. 2623--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi><mo 
class="MathClass-punc">,</mo><mi 
>y</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>S</mi></mrow></math>) and
the hypothesis space is not too large, then the generalization error cannot be large. This
theorem can only be non-vacuous when the base hypothesis space is finite.
There are various extensions (see <span class="cite">[<a 
href="thesisli2.xml#XMargin"><span 
class="ecbx-1000">46</span></a>]</span>) of this bound for continuous hypothesis
                                                                     

                                                                     
spaces based upon VC dimension and covering number techniques. However, the
extensions tend to result in extremely loose guarantees and are not relevant to
the discussion here. One of the advantages of the improved averaging bound
is that it <span 
class="ecti-1000">can </span>apply in a non-vacuous way to infinite hypothesis spaces. This
generalization comes about with essentially zero loosening of the underlying
bound.
</p><!--l. 2634--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse30.xml" >next</a>] [<a 
href="thesisse29.xml" >front</a>] [<a 
href="thesisch7.xml#thesisse29.xml" >up</a>] </p></div><a 
  name="tailthesisse29.xml"></a>  
</body> 
</html> 
