<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>8.4 Shell Bounds for Continuous Spaces</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse38.xml" >next</a>] [<a 
href="thesisse36.xml" >prev</a>] [<a 
href="thesisse36.xml#tailthesisse36.xml" >prev-tail</a>] [<a 
href="#tailthesisse37.xml">tail</a>] [<a 
href="thesisch8.xml#thesisse37.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">8.4. </span> <a 
  name="x53-780008.4"></a>Shell Bounds for Continuous Spaces</h3>
<!--l. 3323--><p class="noindent">Applying Shell bounds to continuous hypothesis spaces is not easy. In fact, upon first
inspection, this appears to be impossible since shell bounds require knowledge of the
number of hypotheses with a particular empirical error. We can avoid these difficulties,
while introducing a small amount of slack, by always being concerned with the <span 
class="ecti-1000">measure</span>
rather than the count of hypotheses. In particular, we will keep track of the <span 
class="ecti-1000">measure </span>of
hypotheses with a confusingly small error and the <span 
class="ecti-1000">measure </span>of the hypotheses that we
pick. The approach here is similar to the approach in section  <a 
href="thesisch6.xml#x37-570006">6<!--tex4ht:ref: sec-PB --></a> although more
simplistic.
</p><!--l. 3333--><p class="indent">   First assume that there is some measure <!--l. 3333--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>Q</mi></mrow></math> over the hypothesis space <!--l. 3333--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>H</mi></mrow></math>. Suppose that we choose
some subset, <!--l. 3334--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>U</mi></mrow></math>, of the
hypotheses with measure <!--l. 3335--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
We will be concerned with the <span 
class="ecti-1000">largest </span>empirical error rate <!--l. 3336--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> max</mo></mrow><mrow 
><mi 
>h</mi><mo 
class="MathClass-rel">&#x2208;</mo><mi 
>U</mi></mrow></msub 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> and the <span 
class="ecti-1000">average </span>true
error rate, <!--l. 3337--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>U</mi></mrow></msub 
></mrow></msub 
><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
A bound on the gap between the largest empirical error rate and the average true error
rate will imply a true error bound for the stochastic classifier which chooses randomly
and evaluates.
</p><!--l. 3341--><p class="indent">   We will need a different definition of <!--l. 3341--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>P</mi></mrow></math>.
Let:
</p><!--l. 3343--><p class="indent">   <!--l. 3343--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
                      <msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo> <mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2261;</mo><msub><mrow 
><mo 
class="MathClass-op">&#x222B;</mo>
  </mrow><mrow 
><mi 
>h</mi><mo 
class="MathClass-punc">:</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x003E;</mo><mi 
>K</mi><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B5;</mi></mrow></msub 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mi 
>d</mi><mi 
>h</mi>
</mrow></math>
We will also need the concept of rounding. Choose <!--l. 3345--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
                                                                     

                                                                     
<mrow 
><mi 
>i</mi></mrow></math> such that <!--l. 3345--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
> <mfrac><mrow 
><mn>1</mn></mrow>
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mi 
>i</mi></mrow></msup 
></mrow></mfrac> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msup 
></mrow></mfrac> </mrow></math> then, define:
<!--l. 3347--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     <mrow 
>
                                             <mfenced separators="" 
open="\delimiter "4262304 "  close="\delimiter "5263305 " ><mrow><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">=</mo>    <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msup 
></mrow></mfrac>
</mrow></math>
Now, the following theorem holds:
</p>
   <div class="newtheorem">
<!--l. 3351--><p class="noindent"><span class="head">
<a 
  name="x53-78001r1"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>8.4.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 3352--><p class="indent">   <span 
class="ecti-1000">(Stochastic Full knowledge) For all distributions </span><!--l. 3353--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>Q</mi></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">For all </span><!--l. 3353--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 3354--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                        <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x2203;</mi><mi 
>U</mi> <msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mo 
class="MathClass-punc">,</mo><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced>
</mrow></math>
<span 
class="ecti-1000">where </span><!--l. 3356--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B5;</mi> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac><mo 
class="MathClass-punc">,</mo><mi 
>U</mi></mrow></mfenced> <mo 
class="MathClass-rel">=</mo><mo 
> min</mo> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>&#x03B5;</mi> <mo 
class="MathClass-punc">:</mo>  <msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi><mfenced separators="" 
open="\delimiter "4262304 "  close="\delimiter "5263305 " ><mrow><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced></mrow> 
   <mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>3</mn></mrow></msup 
></mrow></mfrac>    </mrow></mfenced></mrow></math>
</p>
   </div>
                                                                     

                                                                     
   <div class="proof">
<!--l. 3359--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>Call a hypothesis with a large true error (<!--l. 3359--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi></mrow></math>)
and small empirical error (<!--l. 3360--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></math>)
a &#x201C;bad&#x201D; hypothesis. <!--l. 3361--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is the expected measure of the bad hypotheses. We will use Markov&#x2019;s inequality
to bound the actual measure of bad hypotheses. Then, given that the quantity is
bounded, we can bound the expected true error by assuming that we included every
bad hypothesis in the set <!--l. 3364--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>U</mi></mrow></math>.
</p><!--l. 3366--><p class="indent">   Let <!--l. 3367--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                                 <msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo><mo 
> min</mo> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>&#x03B5;</mi> <mo 
class="MathClass-punc">:</mo>  <msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo>  <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>3</mn><mo 
class="MathClass-bin">+</mo><mi 
>i</mi></mrow></msup 
></mrow></mfrac></mrow></mfenced>
</mrow></math>
Intuitively, <!--l. 3369--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
></mrow></math>
is the value we will use when <!--l. 3369--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover> <mo 
class="MathClass-rel">=</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></math>
and <!--l. 3370--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mfenced separators="" 
open="\delimiter "4262304 "  close="\delimiter "5263305 " ><mrow><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mi 
>i</mi></mrow></msup 
></mrow></mfrac></mrow></math>.
Also let <!--l. 3371--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
><msub><mrow 
>
                            <mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <mo 
class="MathClass-op">&#x222B;</mo>
  </mrow><mrow 
><mi 
>h</mi><mo 
class="MathClass-punc">:</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x003E;</mo><mi 
>K</mi><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B5;</mi><mo 
class="MathClass-bin">&#x2227;</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2264;</mo><mi 
>K</mi></mrow></msub 
><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mi 
>d</mi><mi 
>h</mi>
</mrow></math>
                                                                     

                                                                     
be  the  actual  measure  of  bad  hypotheses.  Then,  Markov&#x2019;s  inequality  tells  us:  <!--l. 3374--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
              <mi 
>&#x2200;</mi><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x003E;</mo> <mn>0</mn> <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow> 
  <mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac> <msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow></mfrac>
</mrow></math>
Taking the union bound over all values of <!--l. 3376--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
></mrow></math>,
we get: <!--l. 3377--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                   <mi 
>&#x2200;</mi><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x003E;</mo> <mn>0</mn> <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x2200;</mi><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>i</mi><msub><mrow 
> <mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow> 
  <mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac> <msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
So, with probability <!--l. 3379--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math>
for all values of <!--l. 3379--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>K</mi></mrow></math>
and <!--l. 3379--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>i</mi></mrow></math>,
we have: <!--l. 3380--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>1</mn><mo 
class="MathClass-bin">+</mo><mi 
>i</mi></mrow></msup 
></mrow></mfrac></mrow></math>.
Therefore, if <!--l. 3381--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mi 
>i</mi></mrow></msup 
></mrow></mfrac></mrow></math>,
we know that <!--l. 3381--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
>    <mfrac><mrow 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow>
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfrac> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>m</mi></mrow></math>.
Assuming that all of the bad hypotheses have true error <!--l. 3382--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn></mrow></math>
and all the rest have true error at most <!--l. 3383--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mfrac><mrow 
><mi 
>K</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mi 
>K</mi><mi 
>i</mi></mrow></msub 
></mrow></math>,
                                                                     

                                                                     
we get the following true error bound: <!--l. 3385--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                                     <msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <msub><mrow 
><mi 
>&#x03B5;</mi></mrow><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mi 
>i</mi></mrow></msub 
>
</mrow></math>
   <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 3389--><p class="indent">   The stochastic full knowledge bound is loose when applied to the full knowledge setting by <!--l. 3390--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2192;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow></mfrac> </mrow></math>. Typically, this results
in only a <!--l. 3391--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mfrac><mrow 
><mn>2</mn><mo 
> ln</mo><!--nolimits--> <mi 
>m</mi></mrow>
  <mrow 
><mi 
>m</mi></mrow></mfrac>  </mrow></math> increase
in the size of <!--l. 3391--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B5;</mi></mrow></math>,
although the increase can sometimes be much larger near phase transitions. These factors of <!--l. 3393--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mfrac><mrow 
><mo 
>ln</mo><!--nolimits--><mi 
>m</mi></mrow>
 <mrow 
><mi 
>m</mi></mrow></mfrac> </mrow></math> may
be removable with improved argumentation. Naturally, the stochastic full knowledge
bound can be used to prove a stochastic observable bound.
</p><!--l. 3397--><p class="indent">   The next theorem is the observable analog of the stochastic full knowledge bound.
Here, we eliminate the unobservable quantities to produce a stochastic observable shell
bound. The observable quantity we will use is:
</p><!--l. 3402--><p class="indent">   <!--l. 3402--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
><msub><mrow 
>
                    <mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2261;</mo> <mn>2</mn><msub><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
   </mrow><mrow 
><mi 
>h</mi></mrow></msub 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
                                                                     

                                                                     
where <!--l. 3405--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
               <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>i</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <!--mstyle 
class="text"--><mtext class="textrm">max&#x000A0;</mtext><!--/mstyle--> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>K</mi> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo> <!--mstyle 
class="text"--><mtext class="textrm">min</mtext><!--/mstyle--> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo>  <mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>&#x03B4;</mi></mrow></mfenced></mrow></mfenced>
</mrow></math>
is a slightly pessimal estimate of the true error rate given the empirical error rate as
before.
</p>
   <div class="newtheorem">
<!--l. 3410--><p class="noindent"><span class="head">
<a 
  name="x53-78002r2"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>8.4.2<span 
class="eccc-1000">.</span></span>
</p><!--l. 3411--><p class="indent">   <span 
class="ecti-1000">(Stochastic Observable Shell Bound) For all distributions </span><!--l. 3411--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>Q</mi></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">For all </span><!--l. 3412--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 3413--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                        <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x2203;</mi><mi 
>U</mi> <msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>U</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mo 
class="MathClass-punc">,</mo><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced>
</mrow></math>
<span 
class="ecti-1000">where </span><!--l. 3415--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B5;</mi> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac><mo 
class="MathClass-punc">,</mo><mi 
>U</mi></mrow></mfenced> <mo 
class="MathClass-rel">=</mo><mo 
> min</mo> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>&#x03B5;</mi> <mo 
class="MathClass-punc">:</mo><msub><mrow 
>  <mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi><mfenced separators="" 
open="\delimiter "4262304 "  close="\delimiter "5263305 " ><mrow><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>U</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced></mrow> 
  <mrow 
><mn>2</mn><msup><mrow 
><mi 
>m</mi></mrow><mrow 
><mn>3</mn></mrow></msup 
></mrow></mfrac>    </mrow></mfenced></mrow></math>
</p>
   </div>
   <div class="proof">
<!--l. 3418--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>First note that the unobservable bound lemma  <a 
href="thesisse34.xml#x50-75002r3">8.1.3<!--tex4ht:ref: unobservable bound --></a> implies that with
probability <!--l. 3419--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>,
we have <!--l. 3419--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>s</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
                                                                     

                                                                     
Given that this is the case, our choice of <!--l. 3420--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B5;</mi></mrow></math>
will be at least as pessimistic as the choice defined by the Stochastic Full Knowledge
bound  <a 
href="#x53-78001r1">8.4.1<!--tex4ht:ref: Stochastic Full Knowledge --></a>. We thus have two sources of failure: the unobservable bound lemma
will fail with probability at most <!--l. 3423--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>
and the stochastic full knowledge bound will fail with probability <!--l. 3424--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>.
The union bound then implies that the Stochastic Observable Shell Bound holds
with probability <!--l. 3426--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>.
<span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 3428--><p class="indent">   The information requirements for the continuous space shell bound are
even more severe than the requirements for the discrete space shell bound. In
particular, we need to know the exact measure of the hypotheses with any
particular empirical error. Clearly, this is unrealistic. We can relax this requirement
to observations computable in a finite amount of time using the sampling
techniques of theorem  <a 
href="thesisse35.xml#x51-76001r1">8.2.1<!--tex4ht:ref: th-sample_shell --></a>. In particular, given a well-behaved distribution <!--l. 3433--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>Q</mi></mrow></math>, we
can make a bounded estimate of the measure of hypotheses with some empirical error
rate. Given this bounded estimate, we can then calculate a pessimistic shell bound for
the continuous space.
</p><!--l. 3439--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse38.xml" >next</a>] [<a 
href="thesisse36.xml" >prev</a>] [<a 
href="thesisse36.xml#tailthesisse36.xml" >prev-tail</a>] [<a 
href="thesisse37.xml" >front</a>] [<a 
href="thesisch8.xml#thesisse37.xml" >up</a>] </p></div><a 
  name="tailthesisse37.xml"></a>  
</body> 
</html> 
