<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>7.3 Proof of main theorem</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse32.xml" >next</a>] [<a 
href="thesisse30.xml" >prev</a>] [<a 
href="thesisse30.xml#tailthesisse30.xml" >prev-tail</a>] [<a 
href="#tailthesisse31.xml">tail</a>] [<a 
href="thesisch7.xml#thesisse31.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">7.3. </span> <a 
  name="x46-670007.3"></a>Proof of main theorem</h3>
   <h4 class="subsectionHead"><span class="titlemark">7.3.1. </span> <a 
  name="x46-680007.3.1"></a>Definitions and observations</h4>
<!--l. 2708--><p class="noindent">The proof has the same structure as the original margin bound ( <a 
href="thesisse29.xml#x44-65001r1">7.1.1<!--tex4ht:ref: th-margin --></a>) proof with one
step replaced by the application of the relative entropy PAC-Bayes theorem (
<a 
href="thesisse26.xml#x39-59001r1">6.2.1<!--tex4ht:ref: th-repbb --></a>).
</p><!--l. 2712--><p class="indent">   Let <!--l. 2712--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>N</mi></mrow></math> be any natural
number; later, the choice of <!--l. 2712--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>N</mi></mrow></math>
will be optimized. In the first part of the proof, we regard <!--l. 2713--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B8;</mi></mrow></math> and <!--l. 2713--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>N</mi></mrow></math> as
fixed. Later we generalize this so that they may depend on the sample <!--l. 2714--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>S</mi></mrow></math>.
</p><!--l. 2716--><p class="indent">   We construct the distribution <!--l. 2716--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></math>
as follows. Draw <!--l. 2716--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>N</mi></mrow></math>
hypotheses <!--l. 2717--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
> <mo 
class="MathClass-rel">&#x223C;</mo> <mi 
>q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
independently. <!--l. 2717--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></math>
might therefore be viewed as the product distribution </p><table class="equation"><tr><td> <a 
  name="x46-68001r1"></a>
<!--l. 2719--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                                                 <msub><mrow 
><mi 
>q</mi></mrow><mrow 
><mi 
>c</mi></mrow></msub 
><msup><mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mrow 
><mi 
>N</mi></mrow></msup 
><mo 
class="MathClass-punc">.</mo>
</math>
<!--l. 2721--><p class="nopar"></p></td><td class="eq-no">(7.3.1)</td></tr></table>
Given the <!--l. 2722--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow></math>
we define <table class="equation"><tr><td> <a 
  name="x46-68002r2"></a>
                                                                     

                                                                     
<!--l. 2723--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                                      <mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>N</mi></mrow></mfrac><msubsup><mrow 
> <mo 
class="MathClass-op">&#x2211;</mo>
    </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><mi 
>N</mi></mrow></msubsup 
><msub><mrow 
><mi 
>h</mi></mrow><mrow 
>
<mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow>
</math>
<!--l. 2726--><p class="nopar"></p></td><td class="eq-no">(7.3.2)</td></tr></table>
<!--l. 2729--><p class="indent">   For fixed <!--l. 2729--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>x</mi><mo 
class="MathClass-punc">,</mo> <mi 
>y</mi></mrow></math>,
the value of <!--l. 2729--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>y</mi><mi 
>h</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
are i.i.d. Bernoulli variables with the mean equal to the expected margin: </p><table class="equation"><tr><td> <a 
  name="x46-68003r3"></a>
<!--l. 2731--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                                         <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>q</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>h</mi></mrow></msub 
><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow>
</math>
<!--l. 2733--><p class="nopar"></p></td><td class="eq-no">(7.3.3)</td></tr></table>
Therefore, <!--l. 2734--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x2200;</mi><mi 
>x</mi><mo 
class="MathClass-punc">,</mo><mi 
>y</mi> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
and <!--l. 2734--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is distributed according to the familiar Binomial distribution. Later, we will apply a
Binomial tail bound on this quantity.
<!--l. 2738--><p class="indent">   Before writing out the proof mathematically, it is helpful to consider a
graphical view of what we will prove. We will force convergence between three
quantities.
</p><!--l. 2742--><p class="noindent"><img 
src="thesis12x.gif" alt="PIC" class="graphics" width="211.79124pt" height="28.105pt"  /><!--tex4ht:graphics  
name="thesis12x.gif" src="averaging.eps"  
-->
</p><!--l. 2745--><p class="indent">   The convergences are then tied together with triangle inequalities resulting in the
overall proof. The critical improvement of this paper is a refined version of the second
convergence.
                                                                     

                                                                     
</p>
   <h4 class="subsectionHead"><span class="titlemark">7.3.2. </span> <a 
  name="x46-690007.3.2"></a>The Proof</h4>
<!--l. 2752--><p class="noindent">For every <!--l. 2752--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>&#x03B8;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math> and for
every (fixed) <!--l. 2752--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>g</mi></mrow></math>,
the following simple inequality holds: </p><table class="equation"><tr><td> <a 
  name="x46-69001r4"></a>
<!--l. 2754--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                                       <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>0</mn></mrow><mo 
class="MathClass-close">]</mo></mrow>
</math>
<!--l. 2757--><p class="nopar"></p></td><td class="eq-no">(7.3.4)</td></tr></table>
<!--l. 2758--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     <mrow 
>
       <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac><mo 
class="MathClass-punc">,</mo> <mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>0</mn></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">+</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac><mo 
class="MathClass-punc">,</mo> <mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>0</mn></mrow><mo 
class="MathClass-close">]</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>0</mn></mrow><mo 
class="MathClass-close">]</mo></mrow>
</mrow></math>
                                                                     

                                                                     
<!--l. 2760--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     <mrow 
>
                      <mo 
class="MathClass-rel">&#x2264;</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">+</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac> <mo 
class="MathClass-rel">&#x2223;</mo><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>0</mn></mrow><mo 
class="MathClass-close">]</mo></mrow>
</mrow></math>
Note that the left-hand side does not depend on <!--l. 2762--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>g</mi></mrow></math>. By taking the
expectation over <!--l. 2763--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>g</mi> <mo 
class="MathClass-rel">&#x223C;</mo> <msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></math>
(and exchanging the order of expectations in the second term on the right-hand side), we
arrive at <table class="equation"><tr><td> <a 
  name="x46-69002r5"></a>
<!--l. 2765--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
        <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced> <mo 
class="MathClass-bin">+</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac> <mo 
class="MathClass-rel">&#x2223;</mo><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>0</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced><mo 
class="MathClass-punc">.</mo>
</math>
<!--l. 2767--><p class="nopar"></p></td><td class="eq-no">(7.3.5)</td></tr></table>
For the rightmost term, we can apply a Binomial tail bound to get:
   <table class="equation"><tr><td> <a 
  name="x46-69003r6"></a>
<!--l. 2770--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
             <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced> <mo 
class="MathClass-bin">+</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mi 
>N</mi> <mfrac><mrow 
><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow> 
    <mrow 
><mn>2</mn></mrow></mfrac>    </mrow></mfenced><mo 
class="MathClass-punc">,</mo><mn>0</mn></mrow></mfenced></mrow></mfenced>
</math>
<!--l. 2772--><p class="nopar"></p></td><td class="eq-no">(7.3.6)</td></tr></table>
<table class="equation"><tr><td> <a 
  name="x46-69004r7"></a>
                                                                     

                                                                     
<!--l. 2773--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                   <mo 
class="MathClass-rel">=</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced> <mo 
class="MathClass-bin">+</mo> <mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mi 
>N</mi> <mfrac><mrow 
><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow> 
    <mrow 
><mn>2</mn></mrow></mfrac>    </mrow></mfenced><mo 
class="MathClass-punc">,</mo><mn>0</mn></mrow></mfenced>
</math>
<!--l. 2775--><p class="nopar"></p></td><td class="eq-no">(7.3.7)</td></tr></table>
We would like to apply the PAC-Bayes theorem <a 
href="thesisse26.xml#x39-59001r1">6.2.1<!--tex4ht:ref: th-repbb --></a> to the right-hand term. Here we use the loss
function <!--l. 2777--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2264;</mo><mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow></msub 
></mrow></math>.
The PAC-Bayes theorem applies for any &#x201C;prior&#x201D; distribution. We use as the &#x201C;prior&#x201D; the distribution
<!--l. 2779--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo> <mi 
>p</mi><msup><mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mrow 
><mi 
>N</mi></mrow></msup 
></mrow></math> where <!--l. 2779--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> is the
&#x201C;prior&#x201D; over the base hypothesis space. The choice of this prior allows us to use the identity: <!--l. 2782--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                    <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mi 
>N</mi><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
It follows from Theorem  <a 
href="thesisse26.xml#x39-59001r1">6.2.1<!--tex4ht:ref: th-repbb --></a> that: with probability at least <!--l. 2784--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math> over random
choices of <!--l. 2785--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>S</mi></mrow></math>,
for every <!--l. 2785--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>Q</mi></mrow></math>,
   <table class="equation"><tr><td> <a 
  name="x46-69005r8"></a>
                                                                     

                                                                     
<!--l. 2788--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                  <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x2203;</mi><mi 
>q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-punc">:</mo>  <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac> </mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>g</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac> </mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>g</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>m</mi></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac> </mrow> 
          <mrow 
><mi 
>m</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn></mrow></mfrac>          </mrow></mfenced>
</math>
<!--l. 2791--><p class="nopar"></p></td><td class="eq-no">(7.3.8)</td></tr></table>
where <!--l. 2792--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac> </mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>g</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math> with
probability <!--l. 2792--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced></mrow></math> and <!--l. 2793--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>0</mn></mrow></math> otherwise, and <!--l. 2793--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>g</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math> with probability <!--l. 2794--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced></mrow></math> and <!--l. 2795--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>0</mn></mrow></math>
otherwise.
<!--l. 2797--><p class="indent">   By the same argument as in ( <a 
href="#x46-69001r4">7.3.4<!--tex4ht:ref: eq-simple --></a>), we get: </p><table class="equation"><tr><td> <a 
  name="x46-69006r9"></a>
<!--l. 2798--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     <msub><mrow 
>
       <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow>
<mrow 
><mn>2</mn></mrow></mfrac> </mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>g</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac><mo 
class="MathClass-punc">,</mo><mspace width="2.77626pt" class="tmspace"/><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mi 
>&#x03B8;</mi></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">+</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B8;</mi></mrow><mo 
class="MathClass-close">]</mo></mrow>
</math>
<!--l. 2800--><p class="nopar"></p></td><td class="eq-no">(7.3.9)</td></tr></table>
   <table class="equation"><tr><td> <a 
  name="x46-69007r10"></a>
<!--l. 2802--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                       <mo 
class="MathClass-rel">&#x2264;</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac> <mo 
class="MathClass-rel">&#x2223;</mo><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mi 
>&#x03B8;</mi></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">+</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B8;</mi></mrow><mo 
class="MathClass-close">]</mo></mrow>
</math>
<!--l. 2804--><p class="nopar"></p></td><td class="eq-no">(7.3.10)</td></tr></table>
<table class="equation"><tr><td> <a 
  name="x46-69008r11"></a>
                                                                     

                                                                     
<!--l. 2805--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                             <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac> <mo 
class="MathClass-rel">&#x2223;</mo><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mi 
>&#x03B8;</mi></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">+</mo><msub><mrow 
> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi></mrow><mo 
class="MathClass-close">)</mo></mrow>
</math>
<!--l. 2807--><p class="nopar"></p></td><td class="eq-no">(7.3.11)</td></tr></table>
Again, we take expectations over <!--l. 2808--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>g</mi> <mo 
class="MathClass-rel">&#x223C;</mo> <msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></math>
on both sides, interchange the order of the expectations and apply the Binomial tail
bound to get: <table class="equation"><tr><td> <a 
  name="x46-69009r12"></a>
<!--l. 2810--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
        <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>S</mi></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mi 
>P</mi></mrow><mrow 
><mi 
>g</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>Q</mi></mrow><mrow 
><mi 
>N</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B8;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mi 
>N</mi> <mfrac><mrow 
><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow> 
    <mrow 
><mn>2</mn></mrow></mfrac>    </mrow></mfenced><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B8;</mi></mrow> 
  <mrow 
><mn>2</mn></mrow></mfrac>   </mrow></mfenced> <mo 
class="MathClass-bin">+</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B8;</mi></mrow></mfenced><mo 
class="MathClass-punc">.</mo>
</math>
<!--l. 2813--><p class="nopar"></p></td><td class="eq-no">(7.3.12)</td></tr></table>
Combining our inequalities, we conclude that with probability at least <!--l. 2814--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math>, for every
<!--l. 2815--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><mi 
>q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<table class="equation"><tr><td> <a 
  name="x46-69010r13"></a>
<!--l. 2816--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                                <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>q</mi></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>N</mi><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>m</mi></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac> </mrow> 
        <mrow 
><mi 
>m</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn></mrow></mfrac>
</math>
                                                                     

                                                                     
<!--l. 2818--><p class="nopar"></p></td><td class="eq-no">(7.3.13)</td></tr></table>
where <!--l. 2819--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>q</mi></mrow><mrow 
><mi 
>S</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math> with
probability <!--l. 2819--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mi 
>N</mi> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow> 
   <mrow 
><mn>2</mn></mrow></mfrac>    </mrow></mfenced> <mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B8;</mi></mrow> 
  <mrow 
><mn>2</mn></mrow></mfrac>  </mrow></mfenced> <mo 
class="MathClass-bin">+</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi></mrow></msub 
> <mfenced separators="" 
open="["  close="]" ><mrow><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B8;</mi></mrow></mfenced></mrow></math> and <!--l. 2820--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>0</mn></mrow></math> otherwise, and <!--l. 2820--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math> with probability <!--l. 2820--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>y</mi><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>0</mn></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn> <mo 
class="MathClass-bin">+</mo> <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mi 
>N</mi> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">+</mo><mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow> 
   <mrow 
><mn>2</mn></mrow></mfrac>    </mrow></mfenced> <mo 
class="MathClass-punc">,</mo><mn>0</mn></mrow></mfenced></mrow></math> and <!--l. 2821--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>0</mn></mrow></math>
otherwise.
<!--l. 2823--><p class="indent">   This bound holds for any fixed <!--l. 2823--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>N</mi></mrow></math>
and <!--l. 2823--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B8;</mi></mrow></math>,
which is not yet what we need here, since we want to allow these to depend on the data <!--l. 2824--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>S</mi></mrow></math>. In
essence, the bound we proved so far is a statement about certain events, parameterized by <!--l. 2826--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>N</mi></mrow></math> and <!--l. 2826--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B8;</mi></mrow></math>,
namely the probability of each event is smaller than <!--l. 2827--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math>. However,
we need to prove that the probability of the <span 
class="ecti-1000">union </span>of all these events is smaller than <!--l. 2828--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math>. To this end, we
first observe that this union is contained in the union over only a <span 
class="ecti-1000">countable </span>number of events. Note
that <!--l. 2830--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2208;</mo><mrow><mo 
class="MathClass-open">{</mo><mrow><mrow><mo 
class="MathClass-open">(</mo><mrow><mn>2</mn><mi 
>k</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>N</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-bin">/</mo><mi 
>N</mi><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>k</mi> <mo 
class="MathClass-rel">=</mo> <mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-op">&#x2026;</mo><mo 
class="MathClass-punc">,</mo><mi 
>N</mi></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow></math>.
Thus, even with all the possible (positive) values of <!--l. 2831--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B8;</mi></mrow></math>, there are no more than <!--l. 2832--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>N</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math> events of the form <!--l. 2832--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow></math>. Denote by <!--l. 2833--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>k</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B8;</mi><mo 
class="MathClass-punc">,</mo><mi 
>N</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> the largest integer <!--l. 2833--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>k</mi></mrow></math> such that <!--l. 2833--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>k</mi><mo 
class="MathClass-bin">/</mo><mi 
>N</mi> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow></math>. We observe that for
every <!--l. 2834--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B8;</mi> <mo 
class="MathClass-rel">&#x003E;</mo> <mn>0</mn></mrow></math>, every <!--l. 2834--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>g</mi></mrow></math> and every distribution
over <!--l. 2835--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi><mo 
class="MathClass-punc">,</mo><mi 
>y</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>:
</p><table class="equation"><tr><td> <a 
  name="x46-69011r14"></a>
                                                                     

                                                                     
<!--l. 2836--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                            <mo 
>Pr</mo> <mfenced separators="" 
open="["  close="]" ><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow></mfenced> <mo 
class="MathClass-rel">=</mo><mo 
> Pr</mo> <mfenced separators="" 
open="["  close="]" ><mrow><mi 
>y</mi><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>k</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B8;</mi><mo 
class="MathClass-punc">,</mo><mi 
>N</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
   <mrow 
><mi 
>N</mi></mrow></mfrac>   </mrow></mfenced><mo 
class="MathClass-punc">.</mo>
</math>
<!--l. 2838--><p class="nopar"></p></td><td class="eq-no">(7.3.14)</td></tr></table>
This means that the middle step in the proof above, i.e. the application of theorem  <a 
href="thesisse26.xml#x39-59001r1">6.2.1<!--tex4ht:ref: th-repbb --></a>, depends
on <!--l. 2840--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B8;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> only
through <!--l. 2840--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Since the other steps are true with probability one, we see that we can
restrict ourselves to the union of countably many events, indexed by <!--l. 2842--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Now, we &#x201C;allocate&#x201D;  parts of the confidence quantity <!--l. 2843--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math> to each of these
events, namely <!--l. 2844--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
receives <!--l. 2844--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>&#x03B4;</mi></mrow><mrow 
><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo> <mi 
>&#x03B4;</mi><mo 
class="MathClass-bin">/</mo><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>N</mi><msup><mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>N</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mspace width="2.77626pt" class="tmspace"/><mi 
>N</mi> <mo 
class="MathClass-rel">=</mo> <mn>1</mn><mo 
class="MathClass-punc">,</mo><mn>2</mn><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-op">&#x2026;</mo><mo 
class="MathClass-punc">;</mo> <mi 
>k</mi> <mo 
class="MathClass-rel">=</mo> <mn>0</mn><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-op">&#x2026;</mo><mo 
class="MathClass-punc">,</mo><mi 
>N</mi></mrow></math>.
It follows easily that the union of all these events has probability at most <!--l. 2846--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
  </mrow><mrow 
><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi></mrow></msub 
><msub><mrow 
><mi 
>&#x03B4;</mi></mrow><mrow 
><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo> <mi 
>&#x03B4;</mi></mrow></math>.
Therefore we have proved that with probability at least <!--l. 2847--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math> over random choices
of <!--l. 2847--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>S</mi></mrow></math> it holds true
that for <span 
class="ecti-1000">all </span><!--l. 2848--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>N</mi></mrow></math>
and <span 
class="ecti-1000">all </span><!--l. 2848--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>&#x03B8;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math>,
<table class="equation"><tr><td> <a 
  name="x46-69012r15"></a>
<!--l. 2849--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
      <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>q</mi></mrow><mrow 
><mi 
>S</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>N</mi><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>Q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>P</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits-->  <mfrac><mrow 
><mn>2</mn><mi 
>m</mi></mrow> 
<mrow 
><msub><mrow 
><mi 
>&#x03B4;</mi></mrow><mrow 
><mi 
>N</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi></mrow></msub 
></mrow></mfrac></mrow> 
           <mrow 
><mi 
>m</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn></mrow></mfrac>           <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>N</mi><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>Q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>P</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--> <mfrac><mrow 
><mn>2</mn><mi 
>m</mi></mrow> 
 <mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac>  <mo 
class="MathClass-bin">+</mo> <mn>3</mn><mo 
>ln</mo><!--nolimits--><mi 
>N</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow> 
                  <mrow 
><mi 
>m</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn></mrow></mfrac>
</math>
<!--l. 2852--><p class="nopar"></p></td><td class="eq-no">(7.3.15)</td></tr></table>
We can now choose <!--l. 2853--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>N</mi></mrow></math> to
minimize this bound. <!--l. 2853--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>N</mi></mrow></math>
may depend on <!--l. 2853--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>&#x03B8;</mi><mo 
class="MathClass-punc">,</mo> <mi 
>Q</mi></mrow></math>
and the sample <!--l. 2854--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>S</mi></mrow></math>.
<!--l. 2856--><p class="indent">   The asymptotic bound stated in the theorem can be derived by choosing <!--l. 2856--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>N</mi></mrow></math> (with respect
to <!--l. 2857--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B8;</mi></mrow></math>
and <!--l. 2857--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>Q</mi></mrow></math>)
so as to <span 
class="ecti-1000">approximately </span>minimize the bound we have derived above. The
                                                                     

                                                                     
first step in this minimization is the replacement of the Binomial tail
with a looser approximation such as the Hoeffding bound which gives us: <!--l. 2861--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                <mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mi 
>N</mi> <mfrac><mrow 
><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow> 
    <mrow 
><mn>2</mn></mrow></mfrac>    </mrow></mfenced><mo 
class="MathClass-punc">,</mo><mn>0</mn></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <msup><mrow 
><mi 
>e</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mn>2</mn><mi 
>N</mi> <mfrac><mrow 
><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow> 
<mrow 
><msup><mrow 
><mn>4</mn></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow></mfrac> </mrow></msup 
> <mo 
class="MathClass-rel">=</mo> <msup><mrow 
><mi 
>e</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>N</mi> <mfrac><mrow 
><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow> 
 <mrow 
><mn>8</mn></mrow></mfrac>  </mrow></msup 
>
</mrow></math> <!--l. 2863--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>N</mi><mo 
class="MathClass-punc">,</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mi 
>N</mi> <mfrac><mrow 
><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B8;</mi><mo 
class="MathClass-bin">/</mo><mn>2</mn></mrow> 
    <mrow 
><mn>2</mn></mrow></mfrac>    </mrow></mfenced><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B8;</mi></mrow> 
  <mrow 
><mn>2</mn></mrow></mfrac>   </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <msup><mrow 
><mi 
>e</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mn>2</mn><mi 
>N</mi> <mfrac><mrow 
><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow> 
<mrow 
><msup><mrow 
><mn>4</mn></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow></mfrac> </mrow></msup 
> <mo 
class="MathClass-rel">=</mo> <msup><mrow 
><mi 
>e</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>N</mi> <mfrac><mrow 
><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow> 
 <mrow 
><mn>8</mn></mrow></mfrac>  </mrow></msup 
>
</mrow></math>
                                                                     

                                                                     
</p><!--l. 2867--><p class="indent">   We can then choose <!--l. 2868--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">       <mrow 
>
                                         <mi 
>N</mi> <mo 
class="MathClass-rel">=</mo> <mfenced separators="" 
open="\delimiter "4264306 "  close="\delimiter "5265307 " ><mrow><mn>8</mn><mfrac><mrow 
><mo 
>ln</mo><!--nolimits-->     <mfrac><mrow 
><mi 
>m</mi></mrow> 
<mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfrac></mrow>
       <mrow 
><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow></mfrac>       </mrow></mfenced> <mo 
class="MathClass-punc">.</mo>
</mrow></math>
</p><!--l. 2872--><p class="indent">   This choice gives us:
</p><!--l. 2875--><p class="indent">   <!--l. 2875--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
                                          <msup><mrow 
><mi 
>e</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mfrac><mrow 
><mi 
>N</mi><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
></mrow> 
  <mrow 
><mn>8</mn></mrow></mfrac>   </mrow></msup 
> <mo 
class="MathClass-rel">=</mo> <mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
    <mrow 
><mi 
>m</mi></mrow></mfrac>
</mrow></math>
</p><!--l. 2879--><p class="indent">   Which implies we have an equation of the form:
</p>
   <table class="equation"><tr><td> <a 
  name="x46-69013r16"></a>
<!--l. 2882--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
           <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>q</mi> <mo 
class="MathClass-bin">+</mo> <mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
    <mrow 
><mi 
>m</mi></mrow></mfrac>    <mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo><mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
    <mrow 
><mi 
>m</mi></mrow></mfrac>    </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><msup><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mn>2</mn></mrow></msup 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--><mi 
>m</mi> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--><mi 
>m</mi> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac></mrow> 
                   <mrow 
><mi 
>m</mi></mrow></mfrac>
</math>
<!--l. 2884--><p class="nopar"></p></td><td class="eq-no">(7.3.16)</td></tr></table>
In order to prove the result, we must show that: <table class="equation"><tr><td> <a 
  name="x46-69014r17"></a>
                                                                     

                                                                     
<!--l. 2886--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                         <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>q</mi> <mo 
class="MathClass-bin">+</mo> <mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
    <mrow 
><mi 
>m</mi></mrow></mfrac>    <mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo><mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
    <mrow 
><mi 
>m</mi></mrow></mfrac>    </mrow></mfenced> <mo 
class="MathClass-rel">=</mo> <mi 
>O</mi> <mfenced separators="" 
open="("  close=")" ><mrow><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow></mfenced></mrow></mfenced>
</math>
<!--l. 2888--><p class="nopar"></p></td><td class="eq-no">(7.3.17)</td></tr></table>
<!--l. 2891--><p class="indent">   We can do this by taking the difference to get an equation of the form:
</p><!--l. 2893--><p class="indent">   <!--l. 2893--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi> <mo 
class="MathClass-bin">+</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi><mo 
class="MathClass-bin">+</mo><mi 
>k</mi></mrow> 
<mrow 
><mi 
>p</mi><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>k</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>k</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi><mo 
class="MathClass-bin">+</mo><mi 
>k</mi></mrow></mfrac> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi></mrow> 
<mrow 
><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">&#x2212;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac></mrow></math>
</p><!--l. 2895--><p class="indent">   If <!--l. 2895--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi> <mo 
class="MathClass-rel">&#x003E;</mo> <mn>2</mn><mi 
>k</mi></mrow></math>
and <!--l. 2895--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>k</mi> <mo 
class="MathClass-rel">&#x003C;</mo> <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math>
then we have:
</p><!--l. 2897--><p class="indent">   <!--l. 2897--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2243;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi> <mo 
class="MathClass-bin">+</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi><mo 
class="MathClass-bin">+</mo><mi 
>k</mi><mo 
class="MathClass-bin">+</mo><mfrac><mrow 
><mi 
>k</mi></mrow>
<mrow 
><mi 
>p</mi></mrow></mfrac></mrow> 
    <mrow 
><mi 
>p</mi></mrow></mfrac>     <mo 
class="MathClass-bin">+</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>k</mi><mo 
class="MathClass-bin">&#x2212;</mo> <mfrac><mrow 
><mi 
>k</mi></mrow>
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac></mrow> 
     <mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac>      <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi></mrow> 
<mrow 
><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">&#x2212;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac></mrow></math>
</p><!--l. 2899--><p class="indent">   <!--l. 2899--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2243;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi> <mo 
class="MathClass-bin">+</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mrow><mo 
class="MathClass-open">[</mo><mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi></mrow> 
<mrow 
><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mfrac><mrow 
><mi 
>k</mi><mo 
class="MathClass-bin">+</mo><mfrac><mrow 
><mi 
>k</mi></mrow>
<mrow 
><mi 
>p</mi></mrow></mfrac></mrow> 
  <mrow 
><mfrac><mrow 
><mi 
>q</mi></mrow> 
<mrow 
><mi 
>p</mi></mrow></mfrac></mrow></mfrac>   <mo 
class="MathClass-bin">+</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">&#x2212;</mo><mfenced separators="" 
open="["  close="]" ><mrow><mfrac><mrow 
><mi 
>k</mi><mo 
class="MathClass-bin">+</mo>  <mfrac><mrow 
><mi 
>k</mi></mrow>
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac></mrow>
  <mrow 
><mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac></mrow></mfrac>  </mrow></mfenced> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi></mrow> 
<mrow 
><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">&#x2212;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac></mrow></math>
</p><!--l. 2901--><p class="indent">   <!--l. 2901--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2243;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi> <mo 
class="MathClass-bin">+</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mrow><mo 
class="MathClass-open">[</mo><mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi></mrow> 
<mrow 
><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">+</mo> <mi 
>k</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mfrac><mrow 
><mi 
>p</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow>
  <mrow 
><mi 
>q</mi></mrow></mfrac>  </mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>k</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mrow><mo 
class="MathClass-open">[</mo><mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>k</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mfrac><mrow 
><mn>2</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow>
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow></mfrac></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mi 
>q</mi></mrow> 
<mrow 
><mi 
>p</mi></mrow></mfrac> <mo 
class="MathClass-bin">&#x2212;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>q</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
>ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>q</mi></mrow> 
<mrow 
><mn>1</mn><mo 
class="MathClass-bin">&#x2212;</mo><mi 
>p</mi></mrow></mfrac></mrow> </math>
</p><!--l. 2903--><p class="indent">   <!--l. 2903--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2243;</mo> <mi 
>k</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mn>1</mn> <mo 
class="MathClass-bin">+</mo> <mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>k</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mn>2</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
</p><!--l. 2905--><p class="indent">   <!--l. 2905--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2243;</mo> <mi 
>k</mi></mrow></math>
</p><!--l. 2907--><p class="indent">   Since we have that the difference </p><table class="equation"><tr><td> <a 
  name="x46-69015r18"></a>
<!--l. 2908--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                   <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>q</mi> <mo 
class="MathClass-bin">+</mo> <mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
    <mrow 
><mi 
>m</mi></mrow></mfrac>    <mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo><mfrac><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow> 
    <mrow 
><mi 
>m</mi></mrow></mfrac>    </mrow></mfenced> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow></mfenced> <mo 
class="MathClass-rel">&#x2243;</mo> <mrow><mo 
class="MathClass-open">[</mo><mrow><!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>q</mi><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>p</mi></mrow></mfenced></mrow><mo 
class="MathClass-close">]</mo></mrow>
</math>
<!--l. 2910--><p class="nopar"></p></td><td class="eq-no">(7.3.18)</td></tr></table>
the main theorem follows immediately.
<!--l. 2914--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse32.xml" >next</a>] [<a 
href="thesisse30.xml" >prev</a>] [<a 
href="thesisse30.xml#tailthesisse30.xml" >prev-tail</a>] [<a 
href="thesisse31.xml" >front</a>] [<a 
href="thesisch7.xml#thesisse31.xml" >up</a>] </p></div><a 
  name="tailthesisse31.xml"></a>   
</body> 
</html> 
