<?xml version="1.0"?> 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "mathml.dtd"> 
<?xml-stylesheet type="text/css" href="thesis.css"?> 
<html  
xmlns="http://www.w3.org/1999/xhtml"  
><head><title>5.3 Combining Microchoice with Freund&#x2019;s Query Tree approach</title> 
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> 
<meta name="generator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<meta name="originator" content="TeX4ht (http://www.cis.ohio-state.edu/~gurari/TeX4ht/mn.html)" /> 
<!-- 3,early_,early^,xhtml,mozilla --> 
<meta name="src" content="thesis.tex" /> 
<meta name="date" content="2002-08-28 13:56:00" /> 
<link rel="stylesheet" type="text/css" href="thesis.css" /> 
</head><body 
>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse24.xml" >next</a>] [<a 
href="thesisse22.xml" >prev</a>] [<a 
href="thesisse22.xml#tailthesisse22.xml" >prev-tail</a>] [<a 
href="#tailthesisse23.xml">tail</a>] [<a 
href="thesisch5.xml#thesisse23.xml" >up</a>] </p></div>
   <h3 class="sectionHead"><span class="titlemark">5.3. </span> <a 
  name="x33-460005.3"></a>Combining Microchoice with Freund&#x2019;s Query Tree approach</h3>
<!--l. 1712--><p class="noindent">The next section is devoted to an improvement of the microchoice bound called
adaptive microchoice, which arises from synthesizing Freund&#x2019;s query trees <span class="cite">[<a 
href="thesisli2.xml#XSB"><span 
class="ecbx-1000">17</span></a>]</span>
with the microchoice bound. This improvement is not easily expressed as a
simplification of Structural Risk Minimization. In essence, the adaptive
microchoice bound can gain from dependence on the learning problem distribution <!--l. 1716--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>D</mi></mrow></math> and
can take advantage of an &#x201C;easy&#x201D; distribution.
</p><!--l. 1719--><p class="indent">   First we require some background material in order to state and understand Freund&#x2019;s
bound.
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.1. </span> <a 
  name="x33-470005.3.1"></a>Preliminaries and Definitions</h4>
<!--l. 1725--><p class="noindent">The statistical query framework introduced by Kearns <span class="cite">[<a 
href="thesisli2.xml#XSQ"><span 
class="ecbx-1000">26</span></a>]</span> restricts learning algorithm to only
access the data using statistical queries. A statistical query takes as input a binary predicate, <!--l. 1727--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03C7;</mi></mrow></math>, mapping examples to
a binary output: <!--l. 1728--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>X</mi><mo 
class="MathClass-punc">,</mo><mi 
>Y</mi> </mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x2192;</mo><mrow><mo 
class="MathClass-open">{</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow></math>.
The output of the statistical query is the average of <!--l. 1729--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03C7;</mi></mrow></math> over the examples
seen. Let <!--l. 1729--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>z</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow></math> be
the <!--l. 1730--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>i</mi></mrow></math>th labeled
example, then: <!--l. 1731--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">       <mrow 
><msub><mrow 
>
                                       <mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac><msubsup><mrow 
> <mo 
class="MathClass-op">&#x2211;</mo>
    </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><mi 
>m</mi></mrow></msubsup 
><mi 
>&#x03C7;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>z</mi></mrow><mrow 
>
<mi 
>i</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow>
</mrow></math>
</p><!--l. 1735--><p class="indent">   The output is an empirical estimate of the true value <!--l. 1735--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <!--mstyle 
class="text"--><mtext class="textbf">E</mtext><!--/mstyle--></mrow><mrow 
><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">[</mo><mrow><mi 
>&#x03C7;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>z</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">]</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> Pr</mo></mrow><mrow 
><mi 
>z</mi><mo 
class="MathClass-rel">&#x223C;</mo><mi 
>D</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03C7;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>z</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>of the query under the distribution <!--l. 1736--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
                                                                     

                                                                     
<mrow 
><mi 
>D</mi></mrow></math><a 
href="thesis34.xml" name="thesis34.xml" ><sup>2</sup></a>
. One simple example of a predicate <!--l. 1742--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03C7;</mi></mrow></math>
is &#x201C;the first bit is <!--l. 1742--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mn>1</mn></mrow></math>&#x201D;.
A more complicated predicate might be &#x201C;the third bit xor the 4th bit is <!--l. 1743--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>0</mn></mrow></math>&#x201D;. Naturally, the
distribution of <!--l. 1744--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>
will be the familiar Binomial distribution.
</p><!--l. 1747--><p class="indent">   It is convenient to define <!--l. 1748--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">       <mrow 
><msub><mrow 
>
                        <mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow><mrow 
><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac><msub><mrow 
><mo 
>max</mo></mrow><mrow 
><mi 
>k</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>k</mi> <mo 
class="MathClass-punc">:</mo>  <mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></mfenced> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></mfenced>
</mrow></math>
and <!--l. 1751--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
><msub><mrow 
>
                           <munder class="mml-underline"><mrow><mi 
>I</mi></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mrow 
><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo>  <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac><msub><mrow 
><mo 
>min</mo></mrow><mrow 
><mi 
>k</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>k</mi> <mo 
class="MathClass-punc">:</mo>  <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></mfenced> <mo 
class="MathClass-rel">&#x2265;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></mfenced>
</mrow></math>
                                                                     

                                                                     
and let <!--l. 1754--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">        <mrow 
>
                                      <msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><munder class="mml-underline"><mrow><mi 
>I</mi></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mrow 
><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow><mrow 
><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced>
</mrow></math>
Intuitively, <!--l. 1756--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is a (fixed) interval in which the random variable <!--l. 1757--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>
will fall with high probability. In other words, we know that: <!--l. 1759--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
>
                          <mo 
>Pr</mo> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-rel">/</mo><mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math> Now,
we want to construct a confidence interval based upon the high confidence interval <!--l. 1762--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
We can do this using the inversion lemma ( <a 
href="thesisse12.xml#x18-26001r1">3.4.1<!--tex4ht:ref: lem-inversion --></a>) to get: <!--l. 1764--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
><msub><mrow 
>
                     <mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> max</mo></mrow><mrow 
><mi 
>p</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo><msub><mrow 
> <munder class="mml-underline"><mrow><mi 
>I</mi></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mrow 
><mi 
>p</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></mfenced>
</mrow></math> and <!--l. 1767--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
                                                                     

                                                                     
<mrow 
><msub><mrow 
>
                      <munder class="mml-underline"><mrow><mi 
>D</mi></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> min</mo></mrow><mrow 
><mi 
>p</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo><msub><mrow 
> <mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo>&#x0304;</mo></mover></mrow><mrow 
><mi 
>p</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></mfenced>
</mrow></math>
</p><!--l. 1771--><p class="indent">   The random interval defined here contains the &#x201C;real&#x201D; answer <!--l. 1771--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math> with high probability. In
other words, we have: <!--l. 1773--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">       <mrow 
>
                                   <mo 
>Pr</mo> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-rel">/</mo><mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">[</mo><mrow><msub><mrow 
><munder class="mml-underline"><mrow><mi 
>D</mi></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo>&#x0304;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.2. </span> <a 
  name="x33-480005.3.2"></a>Background and Summary</h4>
<!--l. 1780--><p class="noindent">Freund <span class="cite">[<a 
href="thesisli2.xml#XSB"><span 
class="ecbx-1000">17</span></a>]</span> considers choice algorithms that at each step perform a Statistical Query on
the sample, using the result to determine which choice to take. For an algorithm <!--l. 1782--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>A</mi></mrow></math>, tolerance <!--l. 1782--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B1;</mi></mrow></math> (defined next), and
distribution <!--l. 1783--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>D</mi></mrow></math>, Freund
defines the query tree <!--l. 1783--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>T</mi></mrow><mrow 
><mi 
>A</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>D</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
as the choice tree created by considering only those choices resulting from answers <!--l. 1784--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math> to queries <!--l. 1785--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03C7;</mi></mrow></math> such that <!--l. 1785--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mfenced separators="" 
open="|"  close="|" ><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mo 
class="MathClass-bin">&#x2212;</mo> <msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B1;</mi></mrow></math>. The idea is that if a
particular predicate, <!--l. 1786--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>&#x03C7;</mi></mrow></math>,
is true with probability <!--l. 1787--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mo 
class="MathClass-punc">.</mo><mn>9</mn></mrow></math>
(for example) on a random sample it is very unlikely that the empirical result of the query will
be <!--l. 1788--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-punc">.</mo><mn>1</mn></mrow></math>.
More generally, the chance the answer to a given query is off by more than <!--l. 1789--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B1;</mi></mrow></math> is at most <!--l. 1789--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>2</mn><msup><mrow 
><mi 
>e</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mn>2</mn><mi 
>m</mi><msup><mrow 
><mi 
>&#x03B1;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
>
           </mrow></msup 
></mrow></math>
                                                                     

                                                                     
by Hoeffding&#x2019;s inequality. So, if the entire tree contains a total of <!--l. 1790--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>T</mi></mrow><mrow 
><mi 
>A</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>D</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>
queries in it, the probability <span 
class="ecti-1000">any </span>of these queries is off by more than <!--l. 1792--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B1;</mi></mrow></math> is at most <!--l. 1792--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>2</mn> <mo 
class="MathClass-punc">&#x22C5;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>Q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>T</mi></mrow><mrow 
><mi 
>A</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>D</mi><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B1;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-punc">&#x22C5;</mo> <msup><mrow 
><mi 
>e</mi></mrow><mrow 
><mo 
class="MathClass-bin">&#x2212;</mo><mn>2</mn><mi 
>m</mi><msup><mrow 
><mi 
>&#x03B1;</mi></mrow><mrow 
><mn>2</mn></mrow></msup 
>
           </mrow></msup 
></mrow></math>.
In other words, this is an upper bound on the probability the algorithm
ever &#x201C;falls off the tree&#x201D; and makes a low probability choice. The point
of this is that we can allocate half (say) of the confidence parameter <!--l. 1795--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math> to the
event that the algorithm ever falls off the tree, and then spread the remaining half evenly
on the hypotheses in the tree (which hopefully is a much smaller set than the entire
hypothesis set).
</p><!--l. 1800--><p class="indent">   Unfortunately, the query tree suffers from the same problem as the <!--l. 1800--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>q</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
distribution considered in section ( <a 
href="thesisse21.xml#x30-390005.1">5.1<!--tex4ht:ref: sec:motivating --></a>), namely that to compute it, one needs to know <!--l. 1802--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>D</mi></mrow></math>. So,
Freund proposes an algorithmic method to find a super-set approximation of the tree. The
idea is that by analyzing the results of queries, it is possible to determine which outcomes were
unlikely given that the query is close to the desired outcome. In particular, each time a query <!--l. 1806--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03C7;</mi></mrow></math> is asked and a response <!--l. 1806--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math> is received, if it is true
that <!--l. 1807--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-bin">&#x2212;</mo> <msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B1;</mi></mrow></math>, then the
range <!--l. 1808--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-bin">&#x2212;</mo> <mn>2</mn><mi 
>&#x03B1;</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-bin">+</mo> <mn>2</mn><mi 
>&#x03B1;</mi></mrow></mfenced></mrow></math> contains
the range <!--l. 1809--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B1;</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-bin">+</mo> <mi 
>&#x03B1;</mi></mrow></mfenced></mrow></math>.
Thus, under the assumption that no query in the <span 
class="ecti-1000">correct </span>tree is answered badly, a super-set
of the correct tree can be produced by exploring all choices resulting from responses within <!--l. 1812--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>2</mn><mi 
>&#x03B1;</mi></mrow></math> of
the response actually received. By applying this method to every node in the
query tree we can generate an empirically observable super-set of the query
tree: that is, the original query tree is a pruning of the empirically constructed
tree.
</p><!--l. 1817--><p class="indent">   A drawback of this method is that it can easily take exponential time to produce the
approximate tree, because even the smaller correct tree can have a size exponential in the
running time of the learning algorithm. Instead, we would much rather simply keep track
of the choices actually made and the sizes of the nodes actually followed, which is
what the microchoice approach allows us to do. As a secondary point, given <!--l. 1822--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03B4;</mi></mrow></math>, computing a good
value of <!--l. 1823--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B1;</mi></mrow></math>
for Freund&#x2019;s approach is not trivial, see <span class="cite">[<a 
href="thesisli2.xml#XSB"><span 
class="ecbx-1000">17</span></a>]</span>; we will be able to finesse that issue and use the tighter
bound <!--l. 1824--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
</p><!--l. 1826--><p class="indent">   In order to apply the microchoice approach, we modify Freund&#x2019;s
query tree so that different nodes in the tree receive different confidence, <!--l. 1827--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
                                                                     

                                                                     
<mrow 
><mi 
>&#x03B4;</mi></mrow></math>,
much in the same way that different hypotheses <!--l. 1828--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>h</mi></mrow></math> in our choice tree receive
different values of <!--l. 1829--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>&#x03B4;</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.3. </span> <a 
  name="x33-490005.3.3"></a>Microchoice Bounds for Query Trees</h4>
<!--l. 1834--><p class="noindent">The manipulations of the choice tree are now reasonably straightforward. We begin by
describing the <span 
class="ecti-1000">true </span>microchoice query tree and then give the algorithmic approximation.
As with the choice tree in section ( <a 
href="thesisse22.xml#x32-400005.2">5.2<!--tex4ht:ref: sec:mc --></a>), one should think of each node in the tree as
representing the current internal state of the algorithm.
</p><!--l. 1840--><p class="indent">   We incorporate Freund&#x2019;s approach into the choice tree
construction by having each internal node allocate a portion, <!--l. 1841--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math>
of its &#x201C;supply&#x201D;  of failure probability to the event that <!--l. 1842--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-rel">/</mo><mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi> <mo 
class="MathClass-bin">&#x2217;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>. The node then
splits the remainder of its supply evenly among the children corresponding to choices that result from
answers <!--l. 1844--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>
with <!--l. 1844--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi> <mo 
class="MathClass-bin">&#x2217;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Choices that would result from &#x201C;bad&#x201D; answers to the query are <span 
class="ecti-1000">pruned away </span>from the tree
and get nothing. This continues down the tree to the leaves. Pictorially, this looks
like:
</p><!--l. 1850--><p class="noindent"><img 
src="thesis8x.gif" alt="PIC" class="graphics" width="748.79749pt" height="326.21875pt"  /><!--tex4ht:graphics  
name="thesis8x.gif" src="thesis-presentation/amicro_tree.eps"  
-->
</p><!--l. 1853--><p class="indent">   How should <!--l. 1853--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math> be
chosen? Smaller values of <!--l. 1853--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math>
result in larger intervals <!--l. 1854--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi> <mo 
class="MathClass-bin">&#x2217;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
leading to more children in the pruned tree and less confidence given to each. Larger values
of <!--l. 1855--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math>
result in less left over to divide among the children. Unfortunately, our
                                                                     

                                                                     
algorithmic approximation (which only sees the empirical answers <!--l. 1857--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math> and
needs to be efficient) will not be able to make this optimization. Therefore, we define <!--l. 1859--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math> in the <span 
class="ecti-1000">true </span>microchoice
query tree to be <!--l. 1859--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
>  <mfrac><mrow 
><mi 
>p</mi></mrow>
<mrow 
><mi 
>d</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></mfrac></mrow></math>
where <!--l. 1860--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>d</mi></mrow></math>
is the depth of the current node. This choice will imply that the adaptive microchoice
bound is never much worse than the Microchoice bound, and sometimes much
better.
</p><!--l. 1864--><p class="indent">   Since a particular query value <!--l. 1864--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>
implies a particular choice <!--l. 1865--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>c</mi></mrow></math>,
we can think of the interval <!--l. 1865--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
as containing <span 
class="ecti-1000">choices </span>rather than query results. After all, we only care about the choices
the algorithm makes. We can calculate the probability assigned to a hypothesis in
the <span 
class="ecti-1000">true </span>adaptive microchoice query tree according to the following algorithm:
</p>
   <div class="newtheorem">
<!--l. 1872--><p class="noindent"><span class="head">
<a 
  name="x33-49001r1"></a>
  <span 
class="eccc-1000">A<small 
class="small-caps">L</small><small 
class="small-caps">G</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">I</small><small 
class="small-caps">T</small><small 
class="small-caps">H</small><small 
class="small-caps">M</small> </span>5.3.1<span 
class="eccc-1000">.</span></span>
</p><!--l. 1873--><p class="indent">   <span 
class="ecti-1000">True_Adaptive_Microchoice(</span><!--l. 1873--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi></mrow></math><span 
class="ecti-1000">)</span>
</p>
   </div>
<!--l. 1875--><p class="indent">
           </p><ol type="1" class="enumerate1" start="1" 
>
        <li class="enumerate"><a 
  name="x33-49003x1"></a>set <!--l. 1876--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>p</mi> <mo 
class="MathClass-rel">&#x2190;</mo> <mn>1</mn></mrow></math>
           </li>
        <li class="enumerate"><a 
  name="x33-49005x2"></a>set <!--l. 1877--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>d</mi> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math>
           </li>
        <li class="enumerate"><a 
  name="x33-49007x3"></a>while learning algorithm has not halted.
        <!--l. 1880--><p class="indent">
            </p><ol type="a" class="enumerate2" start="1" 
>
            <li class="enumerate"><a 
  name="x33-49009x1"></a><!--l. 1881--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>d</mi> <mo 
class="MathClass-rel">&#x2190;</mo> <mi 
>d</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math>
            </li>
            <li class="enumerate"><a 
  name="x33-49011x2"></a><!--l. 1882--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
><mo 
class="MathClass-rel">&#x2190;</mo> <mfrac><mrow 
><mi 
>p</mi></mrow> 
<mrow 
><mi 
>d</mi></mrow></mfrac></mrow></math>
            </li>
            <li class="enumerate"><a 
  name="x33-49013x3"></a>Let <!--l. 1883--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>C</mi></mrow></math>
            = the current set of possible data-dependent choices.
                                                                     

                                                                     
            </li>
            <li class="enumerate"><a 
  name="x33-49015x4"></a><!--l. 1884--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2190;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>c</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>C</mi><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>c</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi> <mo 
class="MathClass-bin">&#x2217;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">}</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>
            </li>
            <li class="enumerate"><a 
  name="x33-49017x5"></a><!--l. 1885--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">   <mrow 
><mi 
>p</mi> <mo 
class="MathClass-rel">&#x2190;</mo> <mfrac><mrow 
><mi 
>p</mi><mo 
class="MathClass-bin">&#x2212;</mo><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow> 
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mo 
class="MathClass-rel">&#x2223;</mo></mrow></mfrac> </mrow></math></li></ol>
        <!--l. 1886--><p class="nopar">
           </p></li>
        <li class="enumerate"><a 
  name="x33-49019x4"></a>return <!--l. 1887--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>p</mi></mrow></math></li></ol>
<!--l. 1888--><p class="nopar"> There are two important things to note about this algorithm. First of all, we could plug the
value <!--l. 1890--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
it returns into the Occam&#x2019;s Razor Bound  <a 
href="thesisse20.xml#x27-36001r1">4.6.1<!--tex4ht:ref: th-ORB --></a> and receive a bound on the true error
rate of our chosen classifier.
</p><!--l. 1893--><p class="indent">   Second, this algorithm can not be executed. The essential problem is determining whether
or not <!--l. 1894--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>c</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi> <mo 
class="MathClass-bin">&#x2217;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
which cannot be done without knowledge of the underlying distribution <!--l. 1895--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>D</mi></mrow></math>. However, we
<span 
class="ecti-1000">can </span>calculate an approximate version of this algorithm which, with high probability, returns a
value <!--l. 1897--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>p</mi></mrow></math>
which is smaller. Since a smaller value is pessimistic, we can use it in our bounds.
</p><!--l. 1900--><p class="indent">   The algorithmic approximation uses the idea in <span class="cite">[<a 
href="thesisli2.xml#XSB"><span 
class="ecbx-1000">17</span></a>]</span> of including
all choices within the <span 
class="ecti-1000">double </span>confidence interval of the observed value <!--l. 1901--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>.
Unlike <span class="cite">[<a 
href="thesisli2.xml#XSB"><span 
class="ecbx-1000">17</span></a>]</span>, however, we do not actually create the tree; instead we just follow the
path taken by the learning algorithm, and argue that the &#x201C;supply&#x201D; probability
remaining at the leaf is no greater than the amount that would have been there
in the original tree. Finally, the algorithm outputs a bound calculated with <!--l. 1906--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
</p><!--l. 1908--><p class="indent">   Specifically, the algorithm is as follows. Suppose we are at a node of the tree containing statistical
query <!--l. 1909--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03C7;</mi></mrow></math> at
depth <!--l. 1909--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03C7;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> and
we have a <!--l. 1910--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>p</mi></mrow></math>
supply of parameter. (If the current node is the root, then <!--l. 1910--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>p</mi> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math> and <!--l. 1911--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03C7;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math>). We choose <!--l. 1911--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
> <mo 
class="MathClass-rel">=</mo> <mi 
>p</mi><mo 
class="MathClass-bin">/</mo><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>d</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>, ask the query <!--l. 1911--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>&#x03C7;</mi></mrow></math>, and receive <!--l. 1912--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>. Let <!--l. 1913--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
                                                                     

                                                                     
<mrow 
><msub><mrow 
>
               <mover 
accent="true"><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> min</mo></mrow><mrow 
><mi 
>k</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow> <mfrac><mrow 
><mi 
>k</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-punc">:</mo>  <mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo><!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo>&#x0304;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></mfenced>
</mrow></math> and <!--l. 1916--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
><msub><mrow 
>
                 <munder class="mml-underline"><mrow><munder class="mml-underline"><mrow><mi 
>D</mi></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msub><mrow 
><mo 
> max</mo></mrow><mrow 
><mi 
>k</mi></mrow></msub 
> <mfenced separators="" 
open="{"  close="}" ><mrow> <mfrac><mrow 
><mi 
>k</mi></mrow>
<mrow 
><mi 
>m</mi></mrow></mfrac> <mo 
class="MathClass-punc">:</mo>  <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo>&#x0304;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mfrac><mrow 
><mi 
>&#x03B4;</mi></mrow> 
<mrow 
><mn>2</mn></mrow></mfrac></mrow></mfenced>
</mrow></math> with <!--l. 1919--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">
<mrow 
><msub><mrow 
>
                         <mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mfenced separators="" 
open="["  close="]" ><mrow><msub><mrow 
><munder class="mml-underline"><mrow><munder class="mml-underline"><mrow><mi 
>D</mi></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mo 
accent="true">&OverBar;</mo></munder></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced>
</mrow></math>
</p><!--l. 1923--><p class="indent">   We now let <!--l. 1923--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>k</mi></mrow></math>
be the number of children of our node corresponding to answers in the range <!--l. 1924--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
We then go to the child corresponding to the answer <!--l. 1925--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>
that we received, giving this child a confidence parameter supply of <!--l. 1926--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-bin">/</mo><mi 
>k</mi></mrow></math>.
This is the same as we would have given it had we allocated <!--l. 1927--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math> to the
                                                                     

                                                                     
children equally. We then continue from that child. Finally, when we reach a
leaf, we output the probability left for the hypothesis. Pictorially, this looks
like:
</p><!--l. 1932--><p class="noindent"><img 
src="thesis9x.gif" alt="PIC" class="graphics" width="748.79749pt" height="386.44374pt"  /><!--tex4ht:graphics  
name="thesis9x.gif" src="thesis-presentation/amcicro_alg.eps"  
-->
</p><!--l. 1935--><p class="indent">   Notice that the second choice set is larger than in the true adaptive microchoice set
tree. This can easily happen and it makes our results somewhat more pessimistic. The
approximate adaptive microchoice algorithm is specified as follows:
</p>
   <div class="newtheorem">
<!--l. 1939--><p class="noindent"><span class="head">
<a 
  name="x33-49020r2"></a>
  <span 
class="eccc-1000">A<small 
class="small-caps">L</small><small 
class="small-caps">G</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">I</small><small 
class="small-caps">T</small><small 
class="small-caps">H</small><small 
class="small-caps">M</small> </span>5.3.2<span 
class="eccc-1000">.</span></span>
</p><!--l. 1940--><p class="indent">   <span 
class="ecti-1000">Approximate_Adaptive_Microchoice(</span><!--l. 1940--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi></mrow></math><span 
class="ecti-1000">)</span>
</p>
   </div>
<!--l. 1942--><p class="indent">
           </p><ol type="1" class="enumerate1" start="1" 
>
        <li class="enumerate"><a 
  name="x33-49022x1"></a>set <!--l. 1943--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>p</mi> <mo 
class="MathClass-rel">&#x2190;</mo> <mn>1</mn></mrow></math>
           </li>
        <li class="enumerate"><a 
  name="x33-49024x2"></a>set <!--l. 1944--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>d</mi> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math>
           </li>
        <li class="enumerate"><a 
  name="x33-49026x3"></a>while learning algorithm has not halted.
        <!--l. 1947--><p class="indent">
            </p><ol type="a" class="enumerate2" start="1" 
>
            <li class="enumerate"><a 
  name="x33-49028x1"></a><!--l. 1948--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>d</mi> <mo 
class="MathClass-rel">&#x2190;</mo> <mi 
>d</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math>
            </li>
                                                                     

                                                                     
            <li class="enumerate"><a 
  name="x33-49030x2"></a><!--l. 1949--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
><mo 
class="MathClass-rel">&#x2190;</mo> <mfrac><mrow 
><mi 
>p</mi></mrow> 
<mrow 
><mi 
>d</mi></mrow></mfrac></mrow></math>
            </li>
            <li class="enumerate"><a 
  name="x33-49032x3"></a>Let <!--l. 1950--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>C</mi></mrow></math>
            = the current set of possible data-dependent choices.
            </li>
            <li class="enumerate"><a 
  name="x33-49034x4"></a><!--l. 1951--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2190;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>c</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>C</mi><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>c</mi> <mo 
class="MathClass-rel">&#x2208;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi> <mo 
class="MathClass-bin">&#x2217;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">}</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>
            </li>
            <li class="enumerate"><a 
  name="x33-49036x5"></a><!--l. 1952--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">   <mrow 
><mi 
>p</mi> <mo 
class="MathClass-rel">&#x2190;</mo> <mfrac><mrow 
><mi 
>p</mi><mo 
class="MathClass-bin">&#x2212;</mo><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow> 
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover><mo 
class="MathClass-rel">&#x2223;</mo></mrow></mfrac> </mrow></math></li></ol>
        <!--l. 1953--><p class="nopar">
           </p></li>
        <li class="enumerate"><a 
  name="x33-49038x4"></a>return <!--l. 1954--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>p</mi></mrow></math></li></ol>
<!--l. 1955--><p class="nopar"> Let <!--l. 1956--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> be the depth of
some hypothesis <!--l. 1956--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>h</mi></mrow></math> in the
empirical path and <!--l. 1957--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mn>1</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>,
<!--l. 1957--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mn>2</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>, <!--l. 1957--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
class="MathClass-op">&#x2026;</mo></mrow></math>, <!--l. 1957--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>d</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> be the sequence of choice
sets resulting in <!--l. 1958--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>h</mi></mrow></math> in the
algorithmic construction; i.e., <!--l. 1959--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
is the number of unpruned children of the <!--l. 1959--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>i</mi></mrow></math>-th node. Then, the
confidence placed on <!--l. 1960--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>h</mi></mrow></math>
will be:
</p>
   <table class="equation"><tr><td> <a 
  name="x33-49039r1"></a>
<!--l. 1962--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">     
              <mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo><msubsup><mrow 
> <mo 
class="MathClass-op">&#x220F;</mo>
  </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></msubsup 
> <mfenced separators="" 
open="("  close=")" ><mrow>    <mfrac><mrow 
><mi 
>i</mi></mrow>
<mrow 
><mi 
>i</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></mfrac>     <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></mfrac></mrow></mfenced> <mo 
class="MathClass-rel">=</mo>      <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></mfrac><msubsup><mrow 
> <mo 
class="MathClass-op">&#x220F;</mo>
        </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></msubsup 
>     <mfrac><mrow 
><mn>1</mn></mrow>
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></mfrac>
</math>
<!--l. 1965--><p class="nopar"></p></td><td class="eq-no">(5.3.1)</td></tr></table>
   <div class="newtheorem">
<!--l. 1968--><p class="noindent"><span class="head">
<a 
  name="x33-49040r3"></a>
                                                                     

                                                                     
  <span 
class="eccc-1000">T<small 
class="small-caps">H</small><small 
class="small-caps">E</small><small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">E</small><small 
class="small-caps">M</small> </span>5.3.3<span 
class="eccc-1000">.</span></span>
</p><!--l. 1969--><p class="indent">   <span 
class="ecti-1000">(Adaptive Microchoice Bound) For all hypothesis spaces, </span><!--l. 1969--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>H</mi></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">for all </span><!--l. 1970--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">:</span>
<!--l. 1971--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
                     <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x2203;</mi><mi 
>h</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>H</mi> <mo 
class="MathClass-punc">:</mo>  <mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0304;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">&#x2217;</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
<span 
class="ecti-1000">where </span><!--l. 1973--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo> <mfrac><mrow 
><mi 
>k</mi></mrow> 
<mrow 
><mi 
>m</mi></mrow></mfrac><mo 
class="MathClass-punc">,</mo><mi 
>&#x03B4;</mi></mrow></mfenced> <mo 
class="MathClass-rel">&#x2261;</mo><msub><mrow 
><mo 
> max</mo></mrow><mrow 
><mi 
>p</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">{</mo><mrow><mi 
>p</mi> <mo 
class="MathClass-punc">:</mo>  <!--mstyle 
class="text"--><mtext class="textrm">Bin</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>m</mi><mo 
class="MathClass-punc">,</mo><mi 
>k</mi><mo 
class="MathClass-punc">,</mo><mi 
>p</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">=</mo> <mi 
>&#x03B4;</mi></mrow><mo 
class="MathClass-close">}</mo></mrow></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">and </span><!--l. 1974--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
<span 
class="ecti-1000">is as defined in equation  </span><a 
href="#x33-49039r1"><span 
class="ecti-1000">5.3.1</span><!--tex4ht:ref: eqn:adaptiveMC --></a><span 
class="ecti-1000">.</span>
</p>
   </div>
   <div class="proof">
<!--l. 1977--><p class="indent">   <span class="head">
   <span 
class="eccc-1000">P<small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">O</small><small 
class="small-caps">F</small>.</span> </span>By              design,              with              probability              <!--l. 1977--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>&#x03B4;</mi></mrow></math>
all  queries  in  the  true  microchoice  query  tree  receive  good  answers,  <span 
class="ecti-1000">and  </span>all
hypotheses in that tree have their true errors within their estimates.
</p><!--l. 1981--><p class="noindent">We         will         prove         that         in         the         high         probability
case,  the  output  of  the  Approximate_Adaptive_Microchoice  algorithm  is  less
than the output of the True_Adaptive_Microchoice algorithm. Since a smaller <!--l. 1983--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>p</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
makes the bound more pessimistic, we will prove the bound.
</p><!--l. 1986--><p class="noindent">Assume inductively that at the current node of our empirical path the supply <!--l. 1987--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>e</mi><mi 
>m</mi><mi 
>p</mi></mrow></msub 
></mrow></math>
is no greater than the supply <!--l. 1987--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>t</mi><mi 
>r</mi><mi 
>u</mi><mi 
>e</mi></mrow></msub 
></mrow></math>
given to that node in the true tree. This is clearly satisfied in the base case when <!--l. 1989--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>e</mi><mi 
>m</mi><mi 
>p</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo> <msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>t</mi><mi 
>r</mi><mi 
>u</mi><mi 
>e</mi></mrow></msub 
> <mo 
class="MathClass-rel">=</mo> <mn>1</mn></mrow></math>.
</p><!--l. 1991--><p class="noindent">Under the assumption that the response <!--l. 1991--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>D</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
></mrow></math>
                                                                     

                                                                     
falls in the interval <!--l. 1992--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>t</mi><mi 
>r</mi><mi 
>u</mi><mi 
>e</mi></mrow></msub 
><mo 
class="MathClass-bin">/</mo><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>d</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>,
it must be the case that the interval <!--l. 1993--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>e</mi><mi 
>m</mi><mi 
>p</mi></mrow></msub 
><mo 
class="MathClass-bin">/</mo><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>d</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
contains the interval <!--l. 1994--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mi 
>I</mi></mrow><mrow 
><mi 
>&#x03C7;</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>t</mi><mi 
>r</mi><mi 
>u</mi><mi 
>e</mi></mrow></msub 
><mo 
class="MathClass-bin">/</mo><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>d</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
Therefore, the supply given to any child in the empirical path is no greater than
the supply given in the true tree. <span class="qed"><span 
class="msam-10">&#x25AB;</span></span>
</p>
   </div>
<!--l. 1998--><p class="indent">   The corresponding relative entropy corollary is:
</p>
   <div class="newtheorem">
<!--l. 2000--><p class="noindent"><span class="head">
<a 
  name="x33-49041r4"></a>
  <span 
class="eccc-1000">C<small 
class="small-caps">O</small><small 
class="small-caps">R</small><small 
class="small-caps">O</small><small 
class="small-caps">L</small><small 
class="small-caps">L</small><small 
class="small-caps">A</small><small 
class="small-caps">R</small><small 
class="small-caps">Y</small> </span>5.3.4<span 
class="eccc-1000">.</span></span>
</p><!--l. 2001--><p class="indent">   <span 
class="ecti-1000">(Relative Entropy Microchoice Bound) For all hypothesis spaces, </span><!--l. 2002--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>H</mi></mrow></math><span 
class="ecti-1000">,</span>
<span 
class="ecti-1000">for all </span><!--l. 2002--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mrow><mo 
class="MathClass-open">(</mo><mrow><mn>0</mn><mo 
class="MathClass-punc">,</mo><mn>1</mn></mrow><mo 
class="MathClass-close">]</mo></mrow></mrow></math><span 
class="ecti-1000">,</span>
<!--l. 2003--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="display">    <mrow 
>
 <msub><mrow 
><mo 
>Pr</mo></mrow><mrow 
><msup><mrow 
><mi 
>D</mi></mrow><mrow 
><mi 
>m</mi></mrow></msup 
></mrow></msub 
> <mfenced separators="" 
open="("  close=")" ><mrow><mi 
>&#x2203;</mi><mi 
>h</mi> <mo 
class="MathClass-rel">&#x2208;</mo> <mi 
>H</mi> <mo 
class="MathClass-punc">:</mo>  <!--mstyle 
class="text"--><mtext class="textrm">KL</mtext><!--/mstyle--><mrow><mo 
class="MathClass-open">(</mo><mrow><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo>&#x0302;</mo></mover><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-rel">&#x2223;</mo><mi 
>e</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-rel">&#x003E;</mo> <mfrac><mrow 
><msubsup><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
  </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></msubsup 
><mo 
> ln</mo><!--nolimits--><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo>&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo><mo 
> ln</mo><!--nolimits--> <mfrac><mrow 
><mn>1</mn></mrow> 
<mrow 
><mi 
>&#x03B4;</mi></mrow></mfrac></mrow> 
                          <mrow 
><mi 
>m</mi></mrow></mfrac>                      </mrow></mfenced> <mo 
class="MathClass-rel">&#x2264;</mo> <mi 
>&#x03B4;</mi>
</mrow></math>
</p>
   </div>
<!--l. 2007--><p class="indent">   The bound in theorem ( <a 
href="#x33-49040r3">5.3.3<!--tex4ht:ref: th-amb --></a>) is very similar to ( <a 
href="thesisse22.xml#x32-40012r2">5.2.2<!--tex4ht:ref: th-smb --></a>)
except that the choice complexity is slightly worsened with the <!--l. 2008--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
>ln</mo><!--nolimits--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> term but improved by
replacing <!--l. 2009--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>C</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math> with
the smaller <!--l. 2009--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.4. </span> <a 
  name="x33-500005.3.4"></a>Allowing batch queries</h4>
                                                                     

                                                                     
<!--l. 2014--><p class="noindent">Most natural Statistical Query algorithms make each choice based on responses to a <span 
class="ecti-1000">set </span>of
queries, not just one. For instance, to decide what variable to put at the top of a decision tree,
we ask <!--l. 2016--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>F</mi></mrow></math>
queries, one for each feature; we then choose the feature whose answer was most
&#x201C;interesting&#x201D;. This suggests generalizing the query tree model to allow each tree node to
contain a set of queries, executed in batch. Requiring each node in the query tree to
contain just a single query as in the above construction would result in an unfortunately
high branching factor just for the purpose of &#x201C;remembering&#x201D; the answers received so far.
<a 
href="thesis35.xml" name="thesis35.xml" ><sup>3</sup></a>
</p><!--l. 2033--><p class="indent">   Extending the algorithmic construction to allow for batch queries is easily done. If a node has <!--l. 2034--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>q</mi></mrow></math> queries <!--l. 2034--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>&#x03C7;</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-op">&#x2026;</mo><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>&#x03C7;</mi></mrow><mrow 
><mi 
>q</mi></mrow></msub 
></mrow></math>, we choose the query
confidence <!--l. 2035--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>&#x03B4;</mi> <mo 
class="MathClass-bin">&#x2217;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math>
as before, but we now split the mass evenly among all <!--l. 2036--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>q</mi></mrow></math> queries. We
then let <!--l. 2036--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>k</mi></mrow></math>
be the number of children corresponding to answers to the queries <!--l. 2037--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mi 
>&#x03C7;</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-op">&#x2026;</mo><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mi 
>&#x03C7;</mi></mrow><mrow 
><mi 
>q</mi></mrow></msub 
></mrow></math> in the ranges
<!--l. 2038--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><msub><mrow 
><mi 
>&#x03C7;</mi></mrow><mrow 
><mn>1</mn></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
><mo 
class="MathClass-bin">/</mo><mi 
>q</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-punc">,</mo><mo 
class="MathClass-op">&#x2026;</mo><mo 
class="MathClass-punc">,</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>I</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><msub><mrow 
><mi 
>&#x03C7;</mi></mrow><mrow 
><mi 
>q</mi></mrow></msub 
></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>&#x03B4;</mi><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
><mo 
class="MathClass-bin">/</mo><mi 
>q</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
respectively. We then go to the child corresponding to the answers we
actually received, and as before give the child a probability supply of <!--l. 2040--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-bin">/</mo><mi 
>k</mi></mrow></math>.
Theorem ( <a 
href="#x33-49040r3">5.3.3<!--tex4ht:ref: th-amb --></a>) holds exactly as before; the only change is that <!--l. 2041--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math> means the size
of the <!--l. 2042--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>i</mi></mrow></math>-th
choice set in the batch tree rather than the size in the single-query-per-node
tree.
                                                                     

                                                                     
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.5. </span> <a 
  name="x33-510005.3.5"></a>Example: Batch Queries for Decision trees</h4>
<!--l. 2048--><p class="noindent">When growing a decision tree, it is natural to make a batch of queries and then make a decision
about which feature to place in a node. The process is then repeated to grow the full tree
structure. As in the decision tree example described in the simple microchoice section, if we
have <!--l. 2051--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>F</mi></mrow></math>
features and are considering adding a node at depth <!--l. 2052--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>v</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>, there are
<!--l. 2052--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">     <mrow 
><mi 
>F</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>v</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math>
possible features that could be chosen for placement in a particular node.
The decision of which feature to use is made by comparing the results of <!--l. 2054--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>F</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>v</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math> queries
to pick the best feature according to some criteria, such as information gain. We can choose <!--l. 2056--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
> <mo 
class="MathClass-rel">=</mo> <mi 
>p</mi><mo 
class="MathClass-bin">/</mo><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>d</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>, then further divide <!--l. 2056--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math> into confidences of size <!--l. 2057--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
><mo 
class="MathClass-bin">/</mo><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>F</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>v</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>, placing each divided
confidence on one of the <!--l. 2058--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mi 
>F</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>v</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math>
statistical queries. We now may be able to eliminate some of the <!--l. 2059--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>F</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>v</mi></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow></math>
choices from consideration, allowing the remaining confidence, <!--l. 2060--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>p</mi> <mo 
class="MathClass-bin">&#x2212;</mo> <msup><mrow 
><mi 
>p</mi></mrow><mrow 
><mi 
>&#x2032;</mi></mrow></msup 
></mrow></math> to be
apportioned evenly amongst the remaining choices. Depending on the underlying
distribution this could substantially reduce the size of the choice set. The best case
occurs when one feature partitions all examples reaching the node perfectly and all other
features are independent of the target. In this case the choice set will have size <!--l. 2064--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn></mrow></math> if
there are enough examples.
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.6. </span> <a 
  name="x33-520005.3.6"></a>Adaptive Microchoice vs.&#x00A0;Basic Microchoice</h4>
<!--l. 2069--><p class="noindent">The adaptive microchoice bound is a significant improvement over the simple microchoice
bound when the distribution is such that each choice is clear. For example, consider <!--l. 2071--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>F</mi></mrow></math> Boolean features
and <!--l. 2071--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mi 
>N</mi> <mo 
class="MathClass-rel">=</mo> <mi 
>O</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>F</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
examples. Suppose that one feature is identical to the label and all the rest of the
features are determined with a coin flip independent of the label.
</p><!--l. 2075--><p class="indent">   When we apply a decision tree to a data set generated with this distribution, what
will be the resulting bound? Given enough examples, with high probability
there will only be one significant choice for the first batch query: the feature
identical to the label. The second and third batch queries, corresponding
to the children of the root feature, will also have a choice space of size <!--l. 2079--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
                                                                     

                                                                     
<mrow 
><mn>1</mn></mrow></math> with
very high probability. The &#x201C;right&#x201D; choice will be the label value. Each choice set has size <!--l. 2081--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>1</mn></mrow></math> resulting in a
complexity of <!--l. 2081--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
>ln</mo><!--nolimits--><mn>4</mn></mrow></math>
due to allocation of confidence to the statistical queries necessary for learning the decision tree. <!--l. 2083--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
>ln</mo><!--nolimits--><mn>4</mn></mrow></math> is considerably
better than <!--l. 2083--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
>ln</mo><!--nolimits--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>F</mi> <mo 
class="MathClass-bin">+</mo> <mn>2</mn></mrow><mo 
class="MathClass-close">)</mo></mrow> <mo 
class="MathClass-bin">+</mo> <mn>2</mn><mo 
>ln</mo><!--nolimits--><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>F</mi> <mo 
class="MathClass-bin">+</mo> <mn>1</mn></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
which the simple version of the microchoice bound provides. Note that the
complexity reduction only occurs with a large enough number of examples <!--l. 2085--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mi 
>m</mi></mrow></math> implying that the
value of <!--l. 2086--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mover 
accent="true"><mrow 
><mi 
>e</mi></mrow><mo 
class="MathClass-op">&#x0304;</mo></mover></mrow></math>
calculated can improve faster than (inverse) linearly in the number of examples.
</p><!--l. 2089--><p class="indent">   The adaptive microchoice bound is never much looser than the simple microchoice
bound because under the assumption that choice sets are of size at least <!--l. 2090--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mn>2</mn></mrow></math>, the penalty for using the
adaptive version, <!--l. 2091--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mo 
>ln</mo><!--nolimits--><mi 
>d</mi></mrow></math>,
is always small compared to the complexity term for the simple microchoice bound, <!--l. 2092--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msubsup><mrow 
><mo 
class="MathClass-op">&#x2211;</mo>
  </mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-rel">=</mo><mn>1</mn></mrow><mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></msubsup 
><mo 
> ln</mo><!--nolimits--><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mi 
>C</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>.
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.7. </span> <a 
  name="x33-530005.3.7"></a>Other Adaptive Microchoice</h4>
<!--l. 2097--><p class="noindent">The adaptive microchoice bound provides a simple scheme for dividing confidence
between choices and queries. There are other choices which may be useful in some
settings. Any scheme which <span 
class="ecti-1000">a priori </span>divides the confidence between queries and
choices at every node will generally work. Here are two schemes which may be
useful:
</p><!--l. 2103--><p class="indent">
           </p><ul class="itemize1">
        <li class="itemize">Assign a constant proportion of confidence to the query. This scheme is
        more aggressive than the one used in the adaptive microchoice bounds and
        may result in a lower complexity when many choices are eliminatable. The
        drawback is we no longer get the telescoping in equation ( <a 
href="#x33-49039r1">5.3.1<!--tex4ht:ref: eqn:adaptiveMC --></a>) and so the
        term logarithmic in <!--l. 2108--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>
        in theorem ( <a 
href="#x33-49041r4">5.3.4<!--tex4ht:ref: th-reamb --></a>) becomes linear in <!--l. 2109--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><mi 
>d</mi><mrow><mo 
class="MathClass-open">(</mo><mrow><mi 
>h</mi></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
           </li>
        <li class="itemize">For a decision tree, assign a portion dependent on the depth of the node in
        the decision tree that the choice set is over. It is unlikely that choices are
        eliminatable from nodes not near the root because the number of examples
        available at a node typically decays exponentially with the (decision tree)
        depth. A progressive scheme which allocates less confidence to queries for
        deep nodes will probably behave better in practice.</li></ul>
                                                                     

                                                                     
<!--l. 2116--><p class="nopar">
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.8. </span> <a 
  name="x33-540005.3.8"></a>Comparison with Freund&#x2019;s Self-Bounding algorithms</h4>
<!--l. 2120--><p class="noindent">Freund&#x2019;s approach for self-bounding learning algorithms can require exponentially more
computation then the microchoice approach. In its basic form, it requires explicit
construction of every path in the state space of the algorithm not pruned in the tree.
There exist some learning algorithms where this process can be done implicitly
making the computation feasible. However, in general this does not appear to be
possible.
</p><!--l. 2127--><p class="indent">   The adaptive microchoice bound only requires explicit construction of the size of
each subset from which a choice is made. Because many common learning algorithms
work by a process of making choices from small subsets, this is often computationally
easy. The adaptive microchoice bound does poorly, however, when Freund&#x2019;s query
tree has a high degree of sharing; for example, when many nodes of the tree
correspond to the same query, or many leaves of the tree have the same final
hypothesis. Allowing batch queries alleviates the most egregious examples of this. It
is also possible to interpolate between the adaptive microchoice bound and
Freund&#x2019;s bound by a process of conglomerating the subsets of the microchoice
bound.
</p>
   <h4 class="subsectionHead"><span class="titlemark">5.3.9. </span> <a 
  name="x33-550005.3.9"></a>Choice Set Conglomeration </h4>
<!--l. 2141--><p class="noindent">The mechanism of choice set conglomeration is a similar to the batch query technique. It
allows you to trade increased computation for a tighter bound. When starting with the
simple microchoice bound, this technique can smoothly interpolate with the discrete
hypothesis bound ( <a 
href="thesisse16.xml#x23-32001r1">4.2.1<!--tex4ht:ref: th-DHSCP --></a>). When starting with the adaptive microchoice bound, we can
interpolate with Freund&#x2019;s bound.
</p><!--l. 2147--><p class="indent">   Consider a particular choice set, <!--l. 2147--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">      <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow></math>,
with elements <!--l. 2147--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mi 
>c</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow></math>.
Each <!--l. 2148--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><msub><mrow 
><mi 
>c</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow></math> indexes
another choice set, <!--l. 2148--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>c</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>.
If the computational resources exist to calculate the union <!--l. 2149--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-punc">,</mo><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <mo 
class="MathClass-op">&#x22C3;</mo>
  </mrow><mrow 
><msub><mrow 
><mi 
>c</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2208;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow></msub 
><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>c</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow></mrow></math>, then <!--l. 2150--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-punc">,</mo><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math> can be used in
place of <!--l. 2150--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-punc">&#x22C5;</mo><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>
in the adaptive microchoice bound. The conglomeration can be done repeatedly to build
large choice sets and also applies to the simple microchoice bound ( <a 
href="thesisse22.xml#x32-40012r2">5.2.2<!--tex4ht:ref: th-smb --></a>).
Conglomeration can be useful for tightening the bound when there are multiple choice
sequences leading to the same hypothesis. However, choice set conglomeration is not
always helpful because it trades away the fine granularity of the microchoice bound. The
                                                                     

                                                                     
extreme case where all choice sets are conglomerated into one choice set and
every hypothesis and query have the same weight is equivalent to Freund&#x2019;s
bound.
</p><!--l. 2160--><p class="indent">   When the choices of the attached choice sets are all different, conglomeration will have little
use because the size of the union of the choice sets is the sum of the sizes of each choice set <!--l. 2162--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-punc">,</mo><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo> <mo 
class="MathClass-rel">=</mo><msub><mrow 
> <mo 
class="MathClass-op">&#x2211;</mo>
  </mrow><mrow 
><msub><mrow 
><mi 
>c</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2208;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mrow><mo 
class="MathClass-open">(</mo><mrow><msub><mrow 
><mi 
>c</mi></mrow><mrow 
><mi 
>i</mi></mrow></msub 
></mrow><mo 
class="MathClass-close">)</mo></mrow><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>. If the child sets each have
the same size <!--l. 2163--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math> then
this simplifies to <!--l. 2164--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">       <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-punc">,</mo><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo> <mo 
class="MathClass-rel">=</mo> <mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo><mo 
class="MathClass-punc">&#x22C5;</mo><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>
which results in the same confidence applied to each choice whether conglomerating
or not. The best case for conglomeration is equivalent to the batch query
case: every sub-choice set contains the same elements. Then we have <!--l. 2167--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">
<mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-punc">,</mo><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo> <mo 
class="MathClass-rel">=</mo> <mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi><mo 
class="MathClass-bin">+</mo><mn>1</mn></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math> and can pay no cost for
the choice set <!--l. 2168--><math 
xmlns="http://www.w3.org/1998/Math/MathML" 
mode="inline">        <mrow 
><mo 
class="MathClass-rel">&#x2223;</mo><msub><mrow 
><mover 
accent="true"><mrow 
><mi 
>C</mi></mrow><mo 
class="MathClass-op">&#x0302;</mo></mover></mrow><mrow 
><mi 
>i</mi></mrow></msub 
><mo 
class="MathClass-rel">&#x2223;</mo></mrow></math>.
</p><!--l. 2171--><p class="indent">
                                                                     

                                                                     
</p>
   <div class="crosslinks"><p class="noindent">[<a 
href="thesisse24.xml" >next</a>] [<a 
href="thesisse22.xml" >prev</a>] [<a 
href="thesisse22.xml#tailthesisse22.xml" >prev-tail</a>] [<a 
href="thesisse23.xml" >front</a>] [<a 
href="thesisch5.xml#thesisse23.xml" >up</a>] </p></div><a 
  name="tailthesisse23.xml"></a>   
</body> 
</html> 
