<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>11.2 General Approaches for Combined Bounds</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse51.xml" >next</a>] [<a 
href="thesisse49.xml" >prev</a>] [<a 
href="thesisse49.xml#tailthesisse49.xml" >prev-tail</a>] [<a 
href="#tailthesisse50.xml">tail</a>] [<a 
href="thesisch11.xml#thesisse50.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">11.2. </span> <a 
  name="x69-9500011.2"></a>General Approaches for Combined Bounds</h3>
<!--l. 4226--><p class="noindent">Showing that a more general technique works must start with a discussion of confidence
intervals. Fundamentally, a bound can be viewed as a set of outcomes. Let <!--l. 4228--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>X</mi></mrow></math> be space of outcomes,
then a bound <!--l. 4228--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03C6;</mi> <mo 
class="MathClass-rel">&#x2286;</mo> <mi 
>X</mi></mrow></math>
is a subset. The probability that this generalized bound is violated is given by: <!--l. 4230--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                            <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>&#x03C6;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math> Typically, we
parameterize <!--l. 4232--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03C6;</mi></mrow></math>
with both <!--l. 4232--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>P</mi></mrow></math>
and <!--l. 4232--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B4;</mi></mrow></math> to get: <!--l. 4234--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                         <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math> We can expand
the definition of a high probability set to include a <span 
class="ecti-1000">randomized </span>high probability set. In particular, let <!--l. 4237--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
                                                                     

                                                                     
<mrow 
><msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>w</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> satisfy: <!--l. 4238--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                     <mi 
>&#x2200;</mi><mi 
>w</mi> <mo 
class="MathClass-punc">:</mo>  <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>w</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math> Then, statement
such as: <!--l. 4241--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
                                  <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>w</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>Q</mi><mo 
class="MathClass-punc">,</mo><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>w</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
In fact, we can make a stronger statement. If </p><table class="equation"><tr><td> <a 
  name="x69-95001r1"></a>
<!--l. 4244--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                                 <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>w</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>Q</mi></mrow></msub 
><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>w</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</math>
<!--l. 4247--><p class="nopar"></p></td><td class="eq-no">(11.2.1)</td></tr></table>
then <table class="equation"><tr><td> <a 
  name="x69-95002r2"></a>
                                                                     

                                                                     
<!--l. 4249--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
                                  <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>w</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>Q</mi><mo 
class="MathClass-punc">,</mo><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>P</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>w</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</math>
<!--l. 4252--><p class="nopar"></p></td><td class="eq-no">(11.2.2)</td></tr></table>
.
<!--l. 4255--><p class="indent">   Randomized confidence intervals are useful here because we can regard the the draw
of the &#x201C;test&#x201D; set as constructing a randomized interval for the &#x201C;training&#x201D; set. As long
as constraint  <a 
href="#x69-95001r1">11.2.1<!--tex4ht:ref: eq-randomized --></a> is obeyed, the bound will hold with probability at least <!--l. 4258--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math>. A
P-value Inversion of the bound will then yield the following theorem:
</p>
   <div class="newtheorem">
<!--l. 4261--><p class="noindent"><span class="head">
<a 
  name="x69-95003r1"></a>
  <span 
class="eccc-1000">L<small 
class="small-caps">E</small><small 
class="small-caps">M</small><small 
class="small-caps">M</small><small 
class="small-caps">A</small> </span>11.2.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 4262--><p class="indent">   <span 
class="ecti-1000">(Exact test and train bound) Let </span><!--l. 4262--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow></math>
<span 
class="ecti-1000">be the set of test examples and </span><!--l. 4263--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>S</mi></mrow></math>
<span 
class="ecti-1000">be the set of training examples. Let </span><!--l. 4263--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<span 
class="ecti-1000">be any bound satisfying </span><!--l. 4264--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x2208;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi></mrow></math><span 
class="ecti-1000">.</span>
<span 
class="ecti-1000">Let </span><!--l. 4265--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<span 
class="ecti-1000">be any function satisfying: </span><!--l. 4265--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msub><mrow 
><mi 
>S</mi></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow></msup 
></mrow></msub 
><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi></mrow></math>
<span 
class="ecti-1000">, then: </span><!--l. 4267--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                      <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>S</mi></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi><mo 
class="MathClass-bin">+</mo><msub><mrow 
><mi 
>m</mi></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
                                                                     

                                                                     
</p>
   </div>
   <div class="proof">
<!--l. 4272--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>Linearity of expectation. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 4274--><p class="indent">   There are many possible choices of the function <!--l. 4274--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>, each
of which leads a different combined training and testing bound. Typically, it will be
important to invert this bound into a P-value form where we take a worst case over all <!--l. 4277--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>D</mi></mrow></math> just
as in section  <a 
href="thesisse12.xml#x18-260003.4">3.4<!--tex4ht:ref: sec-pvalue --></a>.
</p><!--l. 4279--><p class="indent">   The functional form of <!--l. 4279--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
becomes more constrained when we only work with a bound on the test error cumulative
distribution rather than the exact distribution.
</p><!--l. 4284--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse51.xml" >next</a>] [<a 
href="thesisse49.xml" >prev</a>] [<a 
href="thesisse49.xml#tailthesisse49.xml" >prev-tail</a>] [<a 
href="thesisse50.xml" >front</a>] [<a 
href="thesisch11.xml#thesisse50.xml" >up</a>] </p></div><a 
  name="tailthesisse50.xml"></a>  
</body> 
</html> 
