<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>8.2 Sampling Shell Bound</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse36.xml" >next</a>] [<a 
href="thesisse34.xml" >prev</a>] [<a 
href="thesisse34.xml#tailthesisse34.xml" >prev-tail</a>] [<a 
href="#tailthesisse35.xml">tail</a>] [<a 
href="thesisch8.xml#thesisse35.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">8.2. </span> <a 
  name="x51-760008.2"></a>Sampling Shell Bound</h3>
<!--l. 3201--><p class="noindent">The Shell bound relies upon the distribution of empirical errors across the entire hypothesis space.
Calculating <!--l. 3202--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>c</mi> <mfenced separators="" 
open="("  close=")" ><mrow> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></math>, while
theoretically possible, is often practically intractable. For example, the space of all binary functions on <!--l. 3204--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>n</mi></mrow></math> features has size <!--l. 3204--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mn>2</mn></mrow><mrow 
><msup><mrow 
><mn>2</mn></mrow><mrow 
><mi 
>n</mi></mrow></msup 
>
   </mrow></msup 
></mrow></math>and for a decision tree with
a number of nodes <!--l. 3205--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>k</mi> <mo 
class="MathClass-rel">=</mo> <mi 
>O</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
there are more than <!--l. 3206--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msup><mrow 
><mn>2</mn></mrow><mrow 
><mi 
>k</mi></mrow></msup 
></mrow></math>
hypotheses with the same number of nodes. In order to avoid this difficulty,
we will use sampling which is made possible by noticing that the Shell bound
does not require exact knowledge of the empirical error distribution. Instead,
we can safely count a hypothesis twice because over-counting monotonically
worsens the bound. Assume that we have an oracle which can be used to sample
uniformly from the set of all hypotheses. Then, we can bin the sampled
hypotheses according to their error rate on the example set. After repeating <!--l. 3212--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>k</mi></mrow></math> times
we will arrive at an empirical distribution over error rates which can be altered into a
bound on the true distribution of error rates by upper bounding the count <!--l. 3215--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>c</mi> <mfenced separators="" 
open="("  close=")" ><mrow> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></math>. Let <!--l. 3215--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></math>
be the observed number of hypotheses out of <!--l. 3216--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>k</mi></mrow></math> uniform choices with
empirical error <!--l. 3217--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></math>
and define: <!--l. 3218--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
                      <mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>k</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow> 
<mrow 
><mi 
>k</mi></mrow></mfrac><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>H</mi><mo 
class="MathClass-rel">&#x2223;</mo></mrow></mfenced><mo 
class="MathClass-rel">&#x2261;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>H</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-bin">&#x2217;</mo><msub><mrow 
><mo 
> max</mo></mrow><mrow 
><mi 
>p</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo>  <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo>&#x0302;</mo></mover><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced>
</mrow></math>
                                                                     

                                                                     
Intuitively, <!--l. 3220--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow></math>
will be a high confidence upper bound on the number of hypotheses with empirical error rate
<!--l. 3221--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow></math>.
Given these quantities, we can calculate an approximate <!--l. 3222--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow></math> according to the
formula: <!--l. 3223--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
            <mover 
accent="true"><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2261;</mo> <mn>2</mn><msubsup><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
  </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>0</mn></mrow><mrow 
><mi 
>m</mi></mrow></msubsup 
><mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>k</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mover 
accent="true"><mrow 
><mi 
>c</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced> </mrow> 
  <mrow 
><mi 
>k</mi></mrow></mfrac>  <mo 
class="MathClass-punc">,</mo>  <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn><mi 
>m</mi></mrow></mfrac><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>H</mi><mo 
class="MathClass-rel">&#x2223;</mo></mrow></mfenced><mi 
>B</mi> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>i</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></mfenced>
</mrow></math>
</p>
   <div class="newtheorem">
<!--l. 3227--><p class="noindent"><span class="head">
<a 
  name="x51-76001r1"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>8.2.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 3228--><p class="indent">   <span 
class="ecti-1000">(Sampling Shell Bound) For all </span><!--l. 3228--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x003E;</mo> <mn>0</mn></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 3229--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                       <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x2203;</mi><mi 
>h</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>H</mi> <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
<span 
class="ecti-1000">where </span><!--l. 3231--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><mo 
> min</mo> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>&#x03B5;</mi> <mo 
class="MathClass-punc">:</mo>  <mover 
accent="true"><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo>&#x0302;</mo></mover></mrow><mo>&#x0302;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></math>
</p>
                                                                     

                                                                     
   </div>
   <div class="proof">
<!--l. 3234--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>For every <!--l. 3234--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>i</mi></mrow></math>,
we know that <!--l. 3234--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow></mfenced> <mo 
class="MathClass-rel">&#x2265;</mo><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></mfenced></mrow></math>
with probability at least <!--l. 3235--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>.
This implies that <!--l. 3235--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow></mfenced> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>P</mi> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow></mfenced></mrow></math>
with probability at least <!--l. 3236--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math>.
Applying the union bound with the full knowledge theorem gives us the result. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 3239--><p class="indent">   The Sampling Shell bound is still relatively fast to calculate given the sampling results, but it
is worth noting that <span 
class="ecti-1000">many </span>samples are required&#x2014;the bound will typically tighten linearly with <!--l. 3241--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
>ln</mo><!--nolimits--><mi 
>k</mi></mrow></math>. In other words, an
exponentially large <!--l. 3242--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>k</mi></mrow></math>
is required to realize all of the gains of the shell bound. Thus, the sampling shell bound
will interpolate between the discrete hypothesis bound  <a 
href="thesisse16.xml#x23-32001r1">4.2.1<!--tex4ht:ref: th-DHSCP --></a> and the shell bound as <!--l. 3244--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
>ln</mo><!--nolimits--><mi 
>k</mi></mrow></math> moves
from <!--l. 3245--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mn>1</mn></mrow></math>
to <!--l. 3245--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
>ln</mo><!--nolimits--><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>H</mi><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>.
</p><!--l. 3248--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse36.xml" >next</a>] [<a 
href="thesisse34.xml" >prev</a>] [<a 
href="thesisse34.xml#tailthesisse34.xml" >prev-tail</a>] [<a 
href="thesisse35.xml" >front</a>] [<a 
href="thesisch8.xml#thesisse35.xml" >up</a>] </p></div><a 
  name="tailthesisse35.xml"></a>  
</body> 
</html> 
