<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>8.1 The Discrete Shell Bound</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse35.xml" >next</a>] [<a 
href="#tailthesisse34.xml">tail</a>] [<a 
href="thesisch8.xml#thesisse34.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">8.1. </span> <a 
  name="x50-730008.1"></a>The Discrete Shell Bound</h3>
   <h4 class="subsectionHead"><span class="titlemark">8.1.1. </span> <a 
  name="x50-740008.1.1"></a>Knowledge of learning distribution</h4>
<!--l. 3043--><p class="noindent">Let <!--l. 3043--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><mfenced separators="" 
open="(" close=")"><mfrac linethickness="0.0pt"><mrow> <mi 
>m</mi></mrow> 
<mrow><mi 
>K</mi></mrow></mfrac></mfenced> <mi 
>e</mi><msup><mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mrow 
><mi 
>K</mi></mrow></msup 
><msup><mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mrow 
><mi 
>m</mi><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>K</mi></mrow></msup 
></mrow></math>
be the probability that hypothesis with true error <!--l. 3044--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>p</mi></mrow></math> produces <!--l. 3044--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>K</mi></mrow></math> errors on
<!--l. 3044--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><mi 
>m</mi></mrow></math>
independent examples.
</p><!--l. 3047--><p class="indent">   The discrete shell bound works directly with the probability
that there will be a confusingly small empirical error. Let <!--l. 3049--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
                                                                     

                                                                     
<mrow 
>
                  <mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2261;</mo><msub><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
   </mrow><mrow 
><mi 
>h</mi><mo 
class="MathClass-punc">:</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x003E;</mo><mi 
>K</mi><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B5;</mi></mrow></msub 
><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
</p><!--l. 3053--><p class="indent">   Intuitively, <!--l. 3053--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is a bound on the probability that a hypotheses with a true error rate larger than <!--l. 3054--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mfrac><mrow 
><mi 
>K</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi></mrow></math> will have an empirical
error rate of <!--l. 3055--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mfrac><mrow 
><mi 
>K</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></math>.
The contribution to the sum will fall off exponentially as the true error, <!--l. 3056--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>,
increases.
</p><!--l. 3058--><p class="indent">   Our first step is stating a shell bound which requires unknown information. The
purpose of this bound is motivational - it provides incite into why we can expect a large
improvement. Later, we will remove the unknown information requirements and recover a
useful bound.
</p>
   <div class="newtheorem">
<!--l. 3063--><p class="noindent"><span class="head">
<a 
  name="x50-74001r1"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>8.1.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 3064--><p class="indent">   <span 
class="ecti-1000">(Full knowledge theorem) For all </span><!--l. 3064--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 3065--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                       <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x2203;</mi><mi 
>h</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>H</mi> <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
<span 
class="ecti-1000">where </span><!--l. 3067--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><mo 
> min</mo> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>&#x03B5;</mi> <mo 
class="MathClass-punc">:</mo>  <mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></math>
</p>
   </div>
                                                                     

                                                                     
<!--l. 3069--><p class="indent">   The full knowledge theorem relies on unobservable information&#x2014;the true error rates
of all hypotheses. This theorem is not (quite) trivial because it does not rely upon
information about <span 
class="ecti-1000">which </span>hypothesis has a particular true error.
</p>
   <div class="proof">
<!--l. 3075--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>For every hypothesis with a true error rate of <!--l. 3076--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                                         <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
the probability of producing an empirical error of <!--l. 3079--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                                               <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac>
</mrow></math>
is <!--l. 3081--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>m</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Applying the union bound over all hypotheses and all <!--l. 3082--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>m</mi></mrow></math>
possible nontrivial values of <!--l. 3082--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>K</mi></mrow></math>
completes the proof. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
                                                                     

                                                                     
<!--l. 3084--><p class="indent">   There are a couple things to note about this theorem. First, for most balanced
machine learning problems most hypotheses typically have a true error rate near to <!--l. 3086--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>0</mn><mo 
class="MathClass-punc">.</mo><mn>5</mn></mrow></math>.
Given this, and noticing that Binomial tails fall of exponentially, dramatic improvements
in the bound are feasible.
</p><!--l. 3089--><p class="indent">   Second, we must use <!--l. 3089--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></math>
rather than <!--l. 3089--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B4;</mi></mrow></math>
in order to make the proof work. It is possible that theorem holds without splitting <!--l. 3091--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math> &#x201C;<!--l. 3091--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi></mrow></math>-ways&#x201D;. Removing
the factor of <!--l. 3091--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>m</mi></mrow></math>
is an open problem. For the special case of empirical risk minimization algorithms, McDiarmid&#x2019;s
inequality <span class="cite">[<a 
href="thesisli2.xml#XMcD"><span 
class="ecbx-1000">40</span></a>]</span> implies that the range of hypotheses with minimum empirical error is of size <!--l. 3094--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>O</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msqrt><mi 
></mi><mrow><mi 
>m</mi></mrow></msqrt></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
with high probability. Therefore, we need only apply the union bound to <!--l. 3095--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>O</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msqrt><mi 
></mi><mrow><mi 
>m</mi></mrow></msqrt></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
possible minimum empirical error rates.
</p>
   <h4 class="subsectionHead"><span class="titlemark">8.1.2. </span> <a 
  name="x50-750008.1.2"></a>No knowledge of learning distribution</h4>
<!--l. 3101--><p class="noindent">Applying the full knowledge theorem ( <a 
href="#x50-74001r1">8.1.1<!--tex4ht:ref: Full_knowledge --></a>) is not practical in almost all learning
settings because we do not know the distribution of true error rates. Therefore, it is
necessary to construct a slightly looser theorem which relies upon only observable
quantities. Surprisingly, this is possible while introducing only slightly more
slack.
</p><!--l. 3107--><p class="indent">   First, we need a couple of definitions.
</p><!--l. 3110--><p class="indent">   <!--l. 3110--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
                <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>i</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <!--mstyle 
class="text"--><mtext class="textrm">max&#x000A0;</mtext><!--/mstyle--> <mfenced separators="" 
open="{"  close="}" ><mrow><mfrac><mrow 
><mi 
>K</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo> <!--mstyle 
class="text"--><mtext class="textrm">min</mtext><!--/mstyle--> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo>  <mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>i</mi><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>&#x03B4;</mi></mrow></mfenced></mrow></mfenced>
</mrow></math>
Intuitively, <!--l. 3112--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>i</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is a <span 
class="ecti-1000">lower </span>bound on the true error rate of the hypothesis with an empirical error of <!--l. 3113--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
                                                                     

                                                                     
<mrow 
> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></math>.
</p><!--l. 3115--><p class="indent">   <!--l. 3115--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
                         <mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2261;</mo> <mn>2</mn><msub><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
   </mrow><mrow 
><mi 
>h</mi></mrow></msub 
><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math> The
quantity <!--l. 3117--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is an upper bound on the probability that one of the hypotheses with a true error rate of <!--l. 3118--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow></math> or more could
produce <!--l. 3119--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>K</mi></mrow></math>
empirical errors.
</p><!--l. 3121--><p class="indent">   Noting that there are only <!--l. 3121--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>m</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math>
possible empirical errors, we can first let <!--l. 3123--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                         <mi 
>c</mi> <mfenced separators="" 
open="("  close=")" ><mrow> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced> <mo 
class="MathClass-rel">=</mo> <mfenced separators="" 
open="|"  close="|" ><mrow><mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>h</mi> <mo 
class="MathClass-punc">:</mo> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mi 
>i</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></mfenced>
</mrow></math> be the count of empirical
errors at <!--l. 3125--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></math>. Then
                                                                     

                                                                     
we can redefine: <!--l. 3126--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">       <mrow 
>
                     <mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2261;</mo> <mn>2</mn><msubsup><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
  </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>0</mn></mrow><mrow 
><mi 
>m</mi></mrow></msubsup 
><mi 
>c</mi> <mfenced separators="" 
open="("  close=")" ><mrow> <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow></mfenced><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>i</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
</p><!--l. 3130--><p class="indent">   Later, we will prove that with high probability, <!--l. 3130--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Given that this is so, we can prove a theorem which <span 
class="ecti-1000">only </span>relies on observable
quantities.
</p>
   <div class="newtheorem">
<!--l. 3134--><p class="noindent"><span class="head">
<a 
  name="x50-75001r2"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>8.1.2<span 
class="eccc-1000">.</span></span>
</p><!--l. 3135--><p class="indent">   <span 
class="ecti-1000">(Observable Shell Bound) For all </span><!--l. 3135--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x003E;</mo> <mn>0</mn></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 3136--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                       <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x2203;</mi><mi 
>h</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>H</mi> <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
<span 
class="ecti-1000">where </span><!--l. 3138--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>K</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><mo 
> min</mo> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>&#x03B5;</mi> <mo 
class="MathClass-punc">:</mo>  <mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo>&#x0302;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn><mi 
>m</mi></mrow></mfrac></mrow></mfenced></mrow></math>
</p>
   </div>
<!--l. 3140--><p class="indent">   The observable shell bound preserves the important locality property of the full knowledge
shell bound. In particular, when most of the true error rates are &#x201C;far&#x201D; from the empirical error rate
(and <!--l. 3142--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>m</mi></mrow></math>
is large enough), we expect to make large (functional) improvements on the discrete
hypothesis bound  <a 
href="thesisse16.xml#x23-32001r1">4.2.1<!--tex4ht:ref: th-DHSCP --></a>.
</p><!--l. 3146--><p class="indent">   The proof rests upon a technical lemma which allows us to bound the unobservable
&#x201C;probability of a misleading event&#x201D; with an observable event.
                                                                     

                                                                     
</p>
   <div class="newtheorem">
<!--l. 3149--><p class="noindent"><span class="head">
<a 
  name="x50-75002r3"></a>
  <span 
class="eccc-1000">L<small 
class="small-caps">E</small><small 
class="small-caps">M</small><small 
class="small-caps">M</small><small 
class="small-caps">A</small> </span>8.1.3<span 
class="eccc-1000">.</span></span>
</p><!--l. 3150--><p class="indent">   <span 
class="ecti-1000">(Unobservable bound) For all empirical errors, </span><!--l. 3150--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>K</mi></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">for all distributions </span><!--l. 3151--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">for all </span><!--l. 3151--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 3152--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
<msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mn>2</mn><msub><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
   </mrow><mrow 
><mi 
>h</mi></mrow></msub 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo><msub><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
   </mrow><mrow 
><mi 
>h</mi><mo 
class="MathClass-punc">:</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x003E;</mo><mi 
>K</mi><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B5;</mi></mrow></msub 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
</p>
   </div>
<!--l. 3156--><p class="indent">   This lemma is powerful because it bounds the unobservable right hand side in terms
of the observable left hand side.
</p>
   <div class="proof">
<!--l. 3160--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>Let <!--l. 3160--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>k</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow></mfenced> <mo 
class="MathClass-rel">&#x2261;</mo><msub><mrow 
><mo 
> min</mo></mrow><mrow 
><mi 
>p</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo>  <mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow></math>
                                                                     

                                                                     
<!--l. 3161--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
         <mi 
>&#x2200;</mi><mi 
>&#x03B1;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow> <mi 
>&#x2200;</mi><mi 
>h</mi> <mo 
class="MathClass-punc">:</mo>  <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mi 
>I</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003C;</mo> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B1;</mi>
</mrow></math>
<!--l. 3163--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                    <mo 
class="MathClass-rel">&#x21D2;</mo><mi 
>&#x2200;</mi><mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mi 
>I</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003C;</mo> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B1;</mi>
</mrow></math>
<!--l. 3165--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                    <mo 
class="MathClass-rel">&#x21D2;</mo><mi 
>&#x2200;</mi><mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mi 
>I</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003C;</mo> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B1;</mi>
</mrow></math>
                                                                     

                                                                     
<!--l. 3167--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                <mo 
class="MathClass-rel">&#x21D2;</mo><mi 
>&#x2200;</mi><mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mi 
>I</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003C;</mo> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><mi 
>&#x03B1;</mi></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac> </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
<!--l. 3169--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                 <mo 
class="MathClass-rel">&#x21D2;</mo><mi 
>&#x2200;</mi><mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>P</mi></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003C;</mo> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><mi 
>&#x03B1;</mi></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac> </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
Let <!--l. 3171--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x221D;</mo> <mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Then: <!--l. 3172--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
    <mo 
class="MathClass-rel">&#x21D2;</mo><mi 
>&#x2200;</mi><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mfrac><mrow 
><msub><mrow 
> <mo 
class="MathClass-op">&#x2211;</mo>
  </mrow><mrow 
><mi 
>h</mi><mo 
class="MathClass-punc">:</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x003E;</mo><mfrac><mrow 
><mi 
>K</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B5;</mi><mo 
class="MathClass-bin">&#x2227;</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x003E;</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></msub 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow>
           <mrow 
><msub><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
       </mrow><mrow 
><mi 
>h</mi><mo 
class="MathClass-punc">:</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x003E;</mo><mfrac><mrow 
><mi 
>K</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B5;</mi></mrow></msub 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mi 
>B</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfrac>            <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><mi 
>&#x03B1;</mi></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac> </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
set <!--l. 3174--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B1;</mi> <mo 
class="MathClass-rel">=</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>and
replace <!--l. 3174--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
with <!--l. 3174--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
to achieve the result. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 3177--><p class="indent">   We now have all the tools required to prove the theorem.
                                                                     

                                                                     
</p>
   <div class="proof">
<!--l. 3180--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>(of theorem <a 
href="#x50-75001r2">8.1.2<!--tex4ht:ref: Observable --></a>) Choose <!--l. 3180--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
= the uniform distribution on a our hypothesis space. Then, we know that with
probability <!--l. 3181--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math>,
<!--l. 3182--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">    <mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Therefore, we can (arbitrarily) allocate a <!--l. 3183--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>
probability of failure to the unobservable bound <a 
href="#x50-75002r3">8.1.3<!--tex4ht:ref: unobservable bound --></a> and a <!--l. 3184--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>
probability of failure to the full knowledge bound <a 
href="#x50-74001r1">8.1.1<!--tex4ht:ref: Full_knowledge --></a>. Assuming <!--l. 3185--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mi 
>P</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>P</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B5;</mi><mo 
class="MathClass-punc">,</mo><mi 
>K</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>,
the observable bound will be more pessimistic than the full knowledge bound. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 3188--><p class="indent">   The Observable Shell bound behaves in a strange manner which is unlike other true
error bounds. In particular, the true error bound can be discontinuous in the value of <!--l. 3190--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math>. This
discontinuity implies that relatively small improvements in the shell bounds can result in
dramatic improvements in the value of the true error bound. While dramatic
improvements can happen with small improvements in the shell bound, we expect that in
practice, such large improvements will not be too common, simply because a small
improvement is unlikely (amongst all learning problems) to shift the bound across one of
these discontinuous transitions.
</p><!--l. 3199--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse35.xml" >next</a>] [<a 
href="thesisse34.xml" >front</a>] [<a 
href="thesisch8.xml#thesisse34.xml" >up</a>] </p></div><a 
  name="tailthesisse34.xml"></a>   
</body> 
</html> 
