<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>3.4 Converting to a P-value approach</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse13.xml" >next</a>] [<a 
href="thesisse11.xml" >prev</a>] [<a 
href="thesisse11.xml#tailthesisse11.xml" >prev-tail</a>] [<a 
href="#tailthesisse12.xml">tail</a>] [<a 
href="thesisch3.xml#thesisse12.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">3.4. </span> <a 
  name="x18-260003.4"></a>Converting to a P-value approach</h3>
<!--l. 752--><p class="noindent">When making judgments about which hypothesis to choose, the relevant quantity is <span 
class="ecti-1000">not </span>the
probability of error as we calculate above. Instead, it is a bound on the true error rate
which holds with high probability over draws of the sample set. We might decide that <!--l. 755--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">=</mo> <mn>0</mn><mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>5</mn></mrow></math>
was an acceptable rate of bound failure and then ask ourselves,
&#x201C;What is a bound on the true error rate that holds with probability <!--l. 757--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>0</mn><mo 
class="MathClass-punc">.</mo><mn>9</mn></mrow></math>?&#x201D;
</p><!--l. 759--><p class="indent">   Functionally, instead of calculating: <!--l. 760--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                            <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mi 
>&#x03B4;</mi>
</mrow></math> we want to invert the
output, <!--l. 762--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B4;</mi></mrow></math>, with
respect to the input, <!--l. 762--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>p</mi></mrow></math>.
Since <!--l. 763--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>p</mi></mrow></math>
and <!--l. 763--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B4;</mi></mrow></math>
are monotonically related to each other, this inversion can be defined as: <!--l. 765--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
                                                                     

                                                                     
<mrow 
>
                  <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2261;</mo><msub><mrow 
><mo 
> max</mo></mrow><mrow 
><mi 
>p</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo>  <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mi 
>&#x03B4;</mi></mrow></mfenced>
</mrow></math>
</p><!--l. 769--><p class="indent">   What is the interpretation of <!--l. 769--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow></math>?
The inversion <!--l. 769--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is a high confidence bound on the true error rate. With probability at least <!--l. 771--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math>
(over the draws of the examples), the true error rate will be less than <!--l. 772--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>. This
is exactly the kind of quantity that we desire in making decisions about which
hypothesis is more desirable. This calculation has been done for several values of <!--l. 774--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi></mrow></math> and
Binomial tail bounds in figure ( <a 
href="#x18-260021">3.4.1<!--tex4ht:ref: fig-bounds --></a>).
</p><!--l. 777--><p class="indent">   We will use the process of inversion in many places. The fundamental soundness of
inversion rests upon the following lemma.
</p>
   <div class="newtheorem">
<!--l. 780--><p class="noindent"><span class="head">
<a 
  name="x18-26001r1"></a>
  <span 
class="eccc-1000">L<small 
class="small-caps">E</small><small 
class="small-caps">M</small><small 
class="small-caps">M</small><small 
class="small-caps">A</small> </span>3.4.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 781--><p class="indent">   <span 
class="ecti-1000">(Inversion           Lemma)           For           all           predicates,           </span><!--l. 781--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03C6;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>X</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<!--l. 782--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
            <msub><mrow 
><mo 
>sup</mo></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>X</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>X</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x21D2;</mo><mi 
>&#x2200;</mi><mi 
>P</mi> <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>X</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>X</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
                                                                     

                                                                     
</p>
   </div>
<!--l. 786--><p class="indent">   This lemma is the trivial statement that a set of objects is less than the <!--l. 786--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
>sup</mo></mrow></math> over
the set of objects. Nonetheless, it is a very important step which will appeal to implicitly
and explicitly later on.
</p>
   <div class="proof">
<!--l. 791--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>By         contradiction.         Assume         there         exists         <!--l. 791--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>P</mi></mrow></math>
such that the right hand side is not satisfied. Then, the left hand side can not be
correct. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 794--><p class="indent">   The Inversion lemma allows us to implicitly parameterize <span 
class="ecti-1000">all </span>precision parameters <!--l. 795--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B5;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> in terms of a fixed
probability of failure, <!--l. 796--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>&#x03B4;</mi></mrow></math>.
This is important for practical application because it means we can choose our probability of
failure <!--l. 797--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B4;</mi></mrow></math>
before looking at any examples. </p><hr class="figure" /><div align="center" class="figure" 
><table class="figure"><tr class="figure"><td class="figure" 
>
                                                                     

                                                                     
<a 
  name="x18-260021"></a>
                                                                     

                                                                     
<!--l. 800--><p class="noindent"><img 
src="thesis1x.gif" alt="PIC" class="graphics" width="279.04251pt" height="397.48499pt"  /><!--tex4ht:graphics  
name="thesis1x.gif" src="bound_plot.ps"  
-->
<br /> </p><div align="center" class="caption"><table class="caption" 
><tr valign="baseline" class="caption"><td class="id">Figure&#x00A0;3.4.1:  </td><td  
class="content"><a 
  name="x18-260021"></a>A  plot  of  the  difference  between  the  true  error  upper  bound
and the empirical error for various Binomial tail bound approximations. Here <!--l. 812--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi> <mo 
class="MathClass-rel">=</mo> <mn>1</mn><mn>0</mn><mn>0</mn><mn>0</mn></mrow></math>
coins    of    an    unknown    bias    are    used    and    a    confidence    of    <!--l. 812--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">=</mo> <mn>0</mn><mo 
class="MathClass-punc">.</mo><mn>1</mn></mrow></math>
and    the    horizontal    axis    is    the    number    of    errors    between    <!--l. 812--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>k</mi> <mo 
class="MathClass-rel">=</mo> <mn>0</mn></mrow></math>
and <!--l. 812--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>k</mi> <mo 
class="MathClass-rel">=</mo> <mn>1</mn><mn>0</mn><mn>0</mn><mn>0</mn></mrow></math>.
The  relative  entropy  approximation  to  the  Binomial  tail  is  relatively  well
behaved   while   the   Hoeffding   approximation   is   not.   In   particular,   the
Hoeffding  approximation  does  not  take  into  account  the  decreased  variance
of  low  bias  Binomials.  Note  that  the  dip  at  the  end  of  the  Hoeffding
bound  is  due  to  the  fact  that  the  true  error  rate  is  <span 
class="ecti-1000">always  </span>less  than  <!--l. 812--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn></mrow></math>.</td></tr></table></div><!--tex4ht:label?: x18-260021 -->
                                                                     

                                                                     
   </td></tr></table></div><hr class="endfigure" />
<!--l. 817--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse13.xml" >next</a>] [<a 
href="thesisse11.xml" >prev</a>] [<a 
href="thesisse11.xml#tailthesisse11.xml" >prev-tail</a>] [<a 
href="thesisse12.xml" >front</a>] [<a 
href="thesisch3.xml#thesisse12.xml" >up</a>] </p></div><a 
  name="tailthesisse12.xml"></a>  
</body> 
</html> 
