<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>10.1 Progressive Validation Technique</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse45.xml" >next</a>] [<a 
href="#tailthesisse44.xml">tail</a>] [<a 
href="thesisch10.xml#thesisse44.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">10.1. </span> <a 
  name="x62-8800010.1"></a>Progressive Validation Technique</h3>
<!--l. 3808--><p class="noindent">Progressive validation is a technique which allows you to use almost <!--l. 3808--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mfrac><mrow 
><mn>1</mn></mrow>
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math> of the
data in a holdout set for training purposes while still providing the same guarantee as
the holdout bound. It first appeared in <span class="cite">[<a 
href="thesisli2.xml#XProgressive"><span 
class="ecbx-1000">3</span></a>]</span> and is discussed in a more refined and detailed
form here.
</p><!--l. 3813--><p class="indent">   Suppose that you have a training set of size <!--l. 3813--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">train</mtext><!--/mstyle--></mrow></msub 
></mrow></math> and test set
of size <!--l. 3814--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
></mrow></math>.
Progressive validation starts by first learning a hypothesis on the training set and then testing
on the first example of the test set. Then, we train on training set plus the first example
of the test set and test on the second example of the test set. The process continues <!--l. 3817--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">test</mtext><!--/mstyle--></mrow></msub 
></mrow></math> iterations. Let <!--l. 3818--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi></mrow></math> abbreviate <!--l. 3818--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
></mrow></math>. Then, we have <!--l. 3818--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi></mrow></math> hypotheses, <!--l. 3819--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mi 
>m</mi></mrow></msub 
></mrow></math> and <!--l. 3819--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi></mrow></math> error observations,
<!--l. 3819--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mn>1</mn></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>m</mi></mrow></msub 
></mrow></math>. The
hypothesis output by progressive validation is the randomized hypothesis which chooses uniformly
from <!--l. 3821--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mi 
>m</mi></mrow></msub 
></mrow></math>
and evaluates to get an estimated output. Note that this protocol is similar to those in
<span class="cite">[<a 
href="thesisli2.xml#XLittlestone89"><span 
class="ecbx-1000">36</span></a>]</span> and the new thing here is an analysis of performance.
</p><!--l. 3825--><p class="indent">   Since we are randomizing over hypotheses trained on <!--l. 3825--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">train</mtext><!--/mstyle--></mrow></msub 
></mrow></math> to <!--l. 3826--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">train</mtext><!--/mstyle--></mrow></msub 
> <mo 
class="MathClass-bin">+</mo> <msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn></mrow></math>
examples, the expected number of examples used by any hypothesis is <!--l. 3827--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">train</mtext><!--/mstyle--></mrow></msub 
> <mo 
class="MathClass-bin">+</mo> <mfrac><mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
><mo 
class="MathClass-bin">&#x2212;</mo><mn>1</mn></mrow> 
     <mrow 
><mn>2</mn></mrow></mfrac>     </mrow></math>.
Given that training can exhibit phase transitions, the extra few examples can greatly
improve the accuracy of the trained example.
</p><!--l. 3831--><p class="indent">   Viewed as an interactive proof of learning, the progressive validation technique
follows the protocol of figure  <a 
href="#x62-880011">10.1.1<!--tex4ht:ref: fig-pv-protocol --></a>.
</p>
   <hr class="figure" /><div align="center" class="figure" 
><table class="figure"><tr class="figure"><td class="figure" 
>
                                                                     

                                                                     
<a 
  name="x62-880011"></a>
<!--l. 3835--><p class="indent">
                                                                     

                                                                     
</p><!--l. 3835--><p class="noindent"><img 
src="thesis15x.gif" alt="PIC" class="graphics" width="677.53125pt" height="444.66124pt"  /><!--tex4ht:graphics  
name="thesis15x.gif" src="thesis-presentation/progressive_validation.eps"  
-->
<br /> </p><div align="center" class="caption"><table class="caption" 
><tr valign="baseline" class="caption"><td class="id">Figure&#x00A0;10.1.1: </td><td  
class="content"><a 
  name="x62-880011"></a> The progressive validation protocol has a learner repeatedly commit
to a hypothesis before it is given a new example. Based upon the test errors, a bound
on the true error rate of the metahypothesis which chooses randomly from each of <!--l. 3842--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mi 
>m</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
before each evaluation is provided. </td></tr></table></div><!--tex4ht:label?: x62-880011 -->
                                                                     

                                                                     
   </td></tr></table></div><hr class="endfigure" />
<!--l. 3846--><p class="indent">   The true error rate of this randomized hypothesis will be: <!--l. 3847--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                       <msub><mrow 
><mi 
>e</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
> <mo 
class="MathClass-rel">=</mo>   <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
></mrow></mfrac><msubsup><mrow 
> <mo 
class="MathClass-op">&#x2211;</mo>
      </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
></mrow></msubsup 
><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>h</mi></mrow><mrow 
>
<mi 
>i</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math> where
<!--l. 3849--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>h</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>x</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2260;</mo><mi 
>y</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
and the empirical error estimate of this randomized hypothesis will be: <!--l. 3851--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
><msub><mrow 
>
                        <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
> <mo 
class="MathClass-rel">=</mo>   <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
></mrow></mfrac><msubsup><mrow 
> <mo 
class="MathClass-op">&#x2211;</mo>
      </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><msub><mrow 
><mi 
>m</mi></mrow><mrow 
><!--mstyle 
class="text"--><mtext class="textrm">pv</mtext><!--/mstyle--></mrow></msub 
></mrow></msubsup 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
>
<mi 
>i</mi></mrow></msub 
>
</mrow></math>
</p><!--l. 3856--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse45.xml" >next</a>] [<a 
href="thesisse44.xml" >front</a>] [<a 
href="thesisch10.xml#thesisse44.xml" >up</a>] </p></div><a 
  name="tailthesisse44.xml"></a>  
</body> 
</html> 
