<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>10.4 A Quick Experiment</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse48.xml" >next</a>] [<a 
href="thesisse46.xml" >prev</a>] [<a 
href="thesisse46.xml#tailthesisse46.xml" >prev-tail</a>] [<a 
href="#tailthesisse47.xml">tail</a>] [<a 
href="thesisch10.xml#thesisse47.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">10.4. </span> <a 
  name="x65-9100010.4"></a>A Quick Experiment</h3>
<!--l. 4031--><p class="noindent">The motivation behind progressive validation is that it allows one to train
on more examples than the hold-out estimate. With the extra examples
training algorithms should be able to choose a better hypothesis. Many
learning problems exhibit thresholding where a small increase in the number of
examples dramatically improves the accuracy of the hypothesis. Consider an <!--l. 4035--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>N</mi></mrow></math>
dimensional feature space in the boolean setting where it is known that one
feature is an exact predictor. Consider the learning algorithm: cross off features
inconsistent with the training data and output the hypothesis that takes a majority
vote over all features remaining. If the example distribution is uniform over <!--l. 4039--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mrow><mo 
class="MathClass-open">{</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow><mrow 
><mi 
>N</mi></mrow></msup 
></mrow></math>, then
this example exhibits a thresholding behavior because the accuracy of the current
hypothesis is almost 50% until the number of consistent features is reduced
to a constant, at which point it quickly increases to 100%. In expectation, <!--l. 4043--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mfrac><mrow 
><mn>1</mn></mrow>
<mrow 
><mn>2</mn></mrow></mfrac></mrow></math> of the
features will be eliminated with each example, leading us to expect a threshold near <!--l. 4044--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
>lg</mo><!--nolimits--><mi 
>N</mi></mrow></math>.
</p><!--l. 4046--><p class="indent">   In our experiments, we built a synthetic data generator which picks a feature uniformly
at random then produces some number of correctly-labeled examples consisting of <!--l. 4048--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>N</mi> <mo 
class="MathClass-rel">=</mo> <mn>1</mn><mn>0</mn><mn>0</mn><mn>0</mn></mrow></math> boolean features,
with <!--l. 4048--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>P</mi><mi 
>r</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><!--mstyle 
class="text"--><mtext class="textrm">true</mtext><!--/mstyle--></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mo 
class="MathClass-punc">.</mo><mn>5</mn></mrow></math>.
The output of this generator was given to the learning algorithm.
</p><!--l. 4051--><p class="indent">   In the first test, we trained on <!--l. 4051--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>m</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn><mn>0</mn></mrow></math>
examples and tested on <!--l. 4051--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mn>1</mn><mn>0</mn></mrow></math>
examples. In the second test, we trained on <!--l. 4052--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>1</mn><mn>0</mn></mrow></math>
examples and applied progressive validation to the next <!--l. 4053--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn><mn>0</mn></mrow></math>
examples. We repeated this experiment 1000 times for <!--l. 4054--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn><mn>0</mn> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>m</mi> <mo 
class="MathClass-rel">&#x2264;</mo> <mn>3</mn><mn>0</mn></mrow></math> and
averaged the results in order to get an empirical estimate of the true error of all
hypotheses produced, shown in Figure  <a 
href="#x65-910011">10.4.1<!--tex4ht:ref: fig-pv-results --></a>.
</p>
   <hr class="figure" /><div align="center" class="figure" 
><table class="figure"><tr class="figure"><td class="figure" 
>
                                                                     

                                                                     
<a 
  name="x65-910011"></a>
<!--l. 4059--><p class="indent">
                                                                     

                                                                     
</p><!--l. 4059--><p class="noindent"><img 
src="thesis16x.gif" alt="PIC" class="graphics" width="505.89pt" height="722.7pt"  /><!--tex4ht:graphics  
name="thesis16x.gif" src="plot.ps"  
-->
<br /> </p><div align="center" class="caption"><table class="caption" 
><tr valign="baseline" class="caption"><td class="id">Figure&#x00A0;10.4.1: </td><td  
class="content"><a 
  name="x65-910011"></a> True error vs. training size for hold-out and progressive validation.
Error bars in the figure are using computed by fitting a gaussian to the empirical
mean and variance and are at one standard deviation. </td></tr></table></div><!--tex4ht:label?: x65-910011 -->
                                                                     

                                                                     
   </td></tr></table></div><hr class="endfigure" />
<!--l. 4068--><p class="indent">   As expected, the hold-out&#x2019;s performance was much worse than that of progressive
validation. In general, the degree of improvement in empirical error due to the
progressive validation depends on the learning algorithm. The improvement can be large
if the data set is small or the learning problem exhibits thresholding behavior at some
point past the number of training examples.
</p><!--l. 4074--><p class="indent">   In order to compare the quality of error estimation, we did another set of runs calculating the error
discrepancy <!--l. 4075--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>true
error<!--l. 4075--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-bin">&#x2212;</mo></mrow></math>estimated
error<!--l. 4075--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>.
Five training examples were used followed by either progressive validation on ten examples
or evaluation on a hold-out set of size ten. The &#x201C;true error&#x201D; was calculated empirically by
evaluating the resulting hypothesis for each case on another hold-out set of <!--l. 4079--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn><mn>0</mn><mn>0</mn><mn>0</mn><mn>0</mn></mrow></math>
examples. The hold-out estimate on five examples has larger variance then the
progressive validation estimate. One might suspect that this is not due to a good
estimation procedure but due to the fact that it is easier to estimate a lower error. To
investigate this further, we performed a hold-out test which was trained on nine
examples, because the true error of the progressive validation hypothesis with five
training examples and ten progressive validation examples was close to the
true error of a hypothesis trained on nine examples, as shown in the following
table:
</p>
   <div class="tabular"><table class="tabular" 
cellspacing="0pt" cellpadding="0" rules="groups" 
frame="border" id="TBL-8-" ><colgroup id="TBL-8-1g"><col 
id="TBL-8-1" /></colgroup><colgroup id="TBL-8-2g"><col 
id="TBL-8-2" /></colgroup><colgroup id="TBL-8-3g"><col 
id="TBL-8-3" /></colgroup><tr 
class="hline"><td><hr /></td><td><hr /></td><td><hr /></td></tr><tr  
 valign="baseline" id="TBL-8-1-"><td  align="center" nowrap="nowrap" id="TBL-8-1-1"  
class="td11">                                                                                                                                   </td><td  align="center" nowrap="nowrap" id="TBL-8-1-2"  
class="td11">                                                    true error                                                    </td><td  align="center" nowrap="nowrap" id="TBL-8-1-3"  
class="td11"><!--l. 4094--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><!--mstyle 
class="text"--><mtext class="textrm">true&#x000A0;error&#x000A0;</mtext><!--/mstyle--> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">&#x000A0;est</mtext><!--/mstyle--><mo 
class="MathClass-punc">.</mo><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math></td>
</tr><tr 
class="hline"><td><hr /></td><td><hr /></td><td><hr /></td></tr><tr  
 valign="baseline" id="TBL-8-2-"><td  align="center" nowrap="nowrap" id="TBL-8-2-1"  
class="td11">Prog. Val. <!--l. 4096--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mn>5</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn><mn>0</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math></td><td  align="center" nowrap="nowrap" id="TBL-8-2-2"  
class="td11"><!--l. 4097--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mo 
class="MathClass-punc">.</mo><mn>2</mn><mn>0</mn><mn>5</mn> <mo 
class="MathClass-bin">&#x00B1;</mo> <mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>0</mn><mn>3</mn></mrow></math></td><td  align="center" nowrap="nowrap" id="TBL-8-2-3"  
class="td11"><!--l. 4098--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>8</mn><mn>8</mn> <mo 
class="MathClass-bin">&#x00B1;</mo> <mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>1</mn><mn>1</mn></mrow></math></td>
</tr><tr 
class="hline"><td><hr /></td><td><hr /></td><td><hr /></td></tr><tr  
 valign="baseline" id="TBL-8-3-"><td  align="center" nowrap="nowrap" id="TBL-8-3-1"  
class="td11"> Hold-out <!--l. 4100--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mn>5</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn><mn>0</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> </td><td  align="center" nowrap="nowrap" id="TBL-8-3-2"  
class="td11"><!--l. 4101--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mo 
class="MathClass-punc">.</mo><mn>4</mn><mn>3</mn><mn>6</mn> <mo 
class="MathClass-bin">&#x00B1;</mo> <mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>0</mn><mn>5</mn></mrow></math></td><td  align="center" nowrap="nowrap" id="TBL-8-3-3"  
class="td11"><!--l. 4102--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mo 
class="MathClass-punc">.</mo><mn>1</mn><mn>2</mn><mn>0</mn> <mo 
class="MathClass-bin">&#x00B1;</mo> <mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>1</mn><mn>5</mn></mrow></math></td>
</tr><tr 
class="hline"><td><hr /></td><td><hr /></td><td><hr /></td></tr><tr  
 valign="baseline" id="TBL-8-4-"><td  align="center" nowrap="nowrap" id="TBL-8-4-1"  
class="td11"> Hold-out <!--l. 4104--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mn>9</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn><mn>0</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> </td><td  align="center" nowrap="nowrap" id="TBL-8-4-2"  
class="td11"><!--l. 4105--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mo 
class="MathClass-punc">.</mo><mn>2</mn><mn>3</mn><mn>5</mn> <mo 
class="MathClass-bin">&#x00B1;</mo> <mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>0</mn><mn>5</mn></mrow></math></td><td  align="center" nowrap="nowrap" id="TBL-8-4-3"  
class="td11"><!--l. 4106--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">         <mrow 
><mo 
class="MathClass-punc">.</mo><mn>1</mn><mn>0</mn><mn>9</mn> <mo 
class="MathClass-bin">&#x00B1;</mo> <mo 
class="MathClass-punc">.</mo><mn>0</mn><mn>1</mn><mn>5</mn></mrow></math></td>
</tr><tr 
class="hline"><td><hr /></td><td><hr /></td><td><hr /></td></tr><tr  
 valign="baseline" id="TBL-8-5-"><td  align="center" nowrap="nowrap" id="TBL-8-5-1"  
class="td11">                                                                                                                                   </td> </tr><!--"|c|c|c|"--></table>
</div>
<!--l. 4112--><p class="indent">   Averages of the true error and estimate accuracy favor progressive validation in this
experiment with a hold-out set of size 10. In fact, the progressive estimate and
hypothesis on a data set of size 15 were better than the hold-out estimate and hypothesis
on a data set of size 19.
</p><!--l. 4118--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse48.xml" >next</a>] [<a 
href="thesisse46.xml" >prev</a>] [<a 
href="thesisse46.xml#tailthesisse46.xml" >prev-tail</a>] [<a 
href="thesisse47.xml" >front</a>] [<a 
href="thesisch10.xml#thesisse47.xml" >up</a>] </p></div><a 
  name="tailthesisse47.xml"></a>  
</body> 
</html> 
