<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>13.3 Conclusion</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse58.xml" >prev</a>] [<a 
href="thesisse58.xml#tailthesisse58.xml" >prev-tail</a>] [<a 
href="#tailthesisse59.xml">tail</a>] [<a 
href="thesisch13.xml#thesisse59.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">13.3. </span> <a 
  name="x81-12400013.3"></a>Conclusion</h3>
<!--l. 5268--><p class="noindent">PAC-Bayes bounds give excellent results on a stochastic neural
network. The stochastic neural network bound is radically tighter (<!--l. 5269--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>2</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>3</mn></mrow></math> orders
of magnitude) bound on the true error rate of a classifier while increasing the empirical
and true error rates only a small amount.
</p><!--l. 5273--><p class="indent">   Although, the stochastic neural net bound is not completely tight, it is not vacuous with
just <!--l. 5274--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mn>1</mn><mn>0</mn><mn>0</mn></mrow></math>
examples and the minima of the bound weakly predicts the point where overtraining
occurs.
</p>
   <div class="newtheorem">
<!--l. 5277--><p class="noindent"><span class="head">
<a 
  name="x81-124001r1"></a>
  <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">B</small><small 
class="small-caps">L</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>13.3.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 5278--><p class="indent">   (Open) To what extent do these results extend to other learning problems and
other continuous learning algorithms? Work is under-way (most notably in <span class="cite">[<a 
href="thesisli2.xml#XSeeger"><span 
class="ecbx-1000">48</span></a>]</span>) to
evaluate both of these questions.
</p>
   </div>
<!--l. 5283--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse58.xml" >prev</a>] [<a 
href="thesisse58.xml#tailthesisse58.xml" >prev-tail</a>] [<a 
href="thesisse59.xml" >front</a>] [<a 
href="thesisch13.xml#thesisse59.xml" >up</a>] </p></div><a 
  name="tailthesisse59.xml"></a>  
</body> 
</html> 
