<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>11.3 Approximations in Combinations</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse52.xml" >next</a>] [<a 
href="thesisse50.xml" >prev</a>] [<a 
href="thesisse50.xml#tailthesisse50.xml" >prev-tail</a>] [<a 
href="#tailthesisse51.xml">tail</a>] [<a 
href="thesisch11.xml#thesisse51.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">11.3. </span> <a 
  name="x70-9600011.3"></a>Approximations in Combinations</h3>
<!--l. 4286--><p class="noindent">The inexact nature of bounds forces us to impose a monotonic structure on the function <!--l. 4287--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
For simplicity, we will restrict to functions of the form <!--l. 4288--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> where <!--l. 4288--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> is the test error on
hypothesis <!--l. 4289--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>h</mi></mrow></math>.
This simplification is not necessary and this technique can be extended to arbitrary test
set based techniques.
</p><!--l. 4292--><p class="indent">   We can consider any upper bound <!--l. 4292--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> on the true
error rate, <!--l. 4293--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>,
as inducing a cumulative distribution on the test set events.
</p><!--l. 4296--><p class="indent">   This cumulative distribution is <span 
class="ecti-1000">not </span>the cumulative distribution of the
underlying (Binomial) probability. To construct this distribution, let: <!--l. 4298--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                 <msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><mo 
> inf</mo><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>&#x03B4;</mi> <mo 
class="MathClass-punc">:</mo><msub><mrow 
> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">}</mo></mrow>
</mrow></math> Intuitively, <!--l. 4300--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> is the smallest <!--l. 4301--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math> such that the
test error <!--l. 4301--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is rejected.
</p>
   <div class="newtheorem">
<!--l. 4304--><p class="noindent"><span class="head">
<a 
  name="x70-96001r1"></a>
                                                                     

                                                                     
  <span 
class="eccc-1000">L<small 
class="small-caps">E</small><small 
class="small-caps">M</small><small 
class="small-caps">M</small><small 
class="small-caps">A</small> </span>11.3.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 4305--><p class="indent">   <span 
class="ecti-1000">The                                        function                                        </span><!--l. 4305--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<span 
class="ecti-1000">is a cumulative distribution function.</span>
</p>
   </div>
   <div class="proof">
<!--l. 4309--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>In order to show that the function is a cumulative distribution function,
we must show that it varies between <!--l. 4310--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>0</mn></mrow></math>
and <!--l. 4310--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mn>1</mn></mrow></math>
for all values of <!--l. 4310--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Since <!--l. 4311--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>&#x03B8;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is an upper bound, the following inequality holds: <!--l. 4313--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                     <mi 
>&#x2200;</mi><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
> <msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>m</mi> <mo 
class="MathClass-bin">&#x2217;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>e</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
This inequality implies the value of <!--l. 4315--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is always at least as large as the CDF of the underlying Binomial distribution. Note,
that different <!--l. 4317--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow></math>
are implicitly aliased under this technique. We also have the inequality <!--l. 4318--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>1</mn></mrow></math>
because all true error rate upper bounds are vacuous above a true error rate bound of
<!--l. 4320--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">    <mrow 
><mn>1</mn></mrow></math>.
<span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 4322--><p class="indent">   We have shown that <!--l. 4322--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
></mrow></math>
is a cumulative distribution function over the value of the empirical error. Given the upper bound cumulative, <!--l. 4323--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
                                                                     

                                                                     
<mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
></mrow></math>, we can look at distributions
satisfying: <!--l. 4325--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">       <mrow 
>
                     <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
></mrow></msub 
><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>&#x03C6;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math> If we
are guaranteed that <!--l. 4327--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
decreases monotonically then equation  <a 
href="thesisse50.xml#x69-95002r2">11.2.2<!--tex4ht:ref: eq-pr_randomized --></a> will hold. This is the essence of our
theorem.
</p>
   <div class="newtheorem">
<!--l. 4331--><p class="noindent"><span class="head">
<a 
  name="x70-96002r2"></a>
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>11.3.2<span 
class="eccc-1000">.</span></span>
</p><!--l. 4332--><p class="indent">   <span 
class="ecti-1000">(Approximate test and train bound) Let </span><!--l. 4332--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<span 
class="ecti-1000">be any bound satisfying </span><!--l. 4333--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi></mrow></math><span 
class="ecti-1000">.</span>
<span 
class="ecti-1000">Let </span><!--l. 4334--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<span 
class="ecti-1000">be any monotonic decreasing function satisfying: </span><!--l. 4336--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                   <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
></mrow></msub 
><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
                                                                     

                                                                     
<span 
class="ecti-1000">, then: </span><!--l. 4339--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                    <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>S</mi></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi><mo 
class="MathClass-bin">+</mo><msub><mrow 
><mi 
>m</mi></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
</p>
   </div>
   <div class="proof">
<!--l. 4344--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>Note that <!--l. 4344--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is a monotonic decreasing function of <!--l. 4345--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
For any monotonic decreasing function <!--l. 4346--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
and any two cumulative distribution functions <!--l. 4347--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
and <!--l. 4347--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mn>2</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
satisfying <!--l. 4347--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x2200;</mi><mi 
>x</mi> <msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mn>2</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
we have: <!--l. 4349--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                                  <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></msub 
><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><mi 
>x</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mn>2</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></msub 
><mi 
>g</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
Let <!--l. 4351--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>F</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
be the cumulative distribution of the Binomial and note that the definition of a
bound implies: <!--l. 4352--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x2200;</mi><mi 
>x</mi> <msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2265;</mo> <mi 
>F</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
                                                                     

                                                                     
Applying these inequalities, we get: <!--l. 4354--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">      <mrow 
>
                      <mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2265;</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
></mrow></msub 
><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
<!--l. 4356--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                       <mo 
class="MathClass-rel">&#x2265;</mo> <msub><mrow 
><mi 
>E</mi></mrow><mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>F</mi></mrow></msub 
><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>&#x03C6;</mi></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>f</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
Given this, an application of theorem  <a 
href="thesisse50.xml#x69-95003r1">11.2.1<!--tex4ht:ref: th-tnt --></a> completes the proof. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 4360--><p class="indent">   The only constraint that we must check in applying a combined training
and test set bound is the monotonicity constraint. Heuristically, this is
satisfied for the functions graphed in the pictures of techniques (1)-(3)
because the set of excluded events increases monotonically along the <!--l. 4363--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>x</mi></mrow></math> axis as it decreases
along the <!--l. 4364--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>y</mi></mrow></math>
axis.
</p><!--l. 4366--><p class="indent">   An explicit mathematical form for technique (3) can be
given by considering the bound-based cumulative distributions, <!--l. 4367--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
></mrow></math>, and a similar distribution
for the training set, <!--l. 4368--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03C6;</mi></mrow></msub 
></mrow></math>.
In particular, we can define the rejection region to be <!--l. 4369--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mrow><mo 
class="MathClass-open">{</mo><mrow><msub><mrow 
><mi 
>S</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>S</mi> <mo 
class="MathClass-punc">:</mo> <msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03C6;</mi></mrow></msub 
> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>t</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow></math> where <!--l. 4370--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>t</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> is a function
satisfying <!--l. 4370--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msub><mrow 
><mi 
>S</mi></mrow><mrow 
>
<!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mi 
>S</mi><mo 
class="MathClass-rel">&#x223C;</mo><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03C6;</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03C6;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>t</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi></mrow></math>.
The monotonic constraint is satisfied by this construction because <!--l. 4371--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03B8;</mi></mrow></msub 
></mrow></math> is implicitly monotonic
decreasing with <!--l. 4372--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>F</mi></mrow><mrow 
><mi 
>&#x03C6;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>S</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
                                                                     

                                                                     
given a constant <!--l. 4373--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>t</mi></mrow></math>.
We will use technique (3) for combining training and test sets in the experiments.
Appendix section  <a 
href="thesisse63.xml#x89-13200016.4">16.4<!--tex4ht:ref: sec-combined-bound-calc --></a> details the programming interface to calculate this bound.
</p><!--l. 4378--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse52.xml" >next</a>] [<a 
href="thesisse50.xml" >prev</a>] [<a 
href="thesisse50.xml#tailthesisse50.xml" >prev-tail</a>] [<a 
href="thesisse51.xml" >front</a>] [<a 
href="thesisch11.xml#thesisse51.xml" >up</a>] </p></div><a 
  name="tailthesisse51.xml"></a>  
</body> 
</html> 
