<?xml version="1.0" encoding="utf-8"?>
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" xmlns:q="http://cnx.rice.edu/qml/1.0" id="new" module-id="" cnxml-version="0.6">
  <title>F Distribution and ANOVA: The F Distribution And The F Ratio</title>
  <metadata xmlns:md="http://cnx.rice.edu/mdml/0.4">
  <!-- WARNING! The 'metadata' section is read only. Do not edit below.
       Changes to the metadata section in the source will not be saved. -->
  <md:content-id>m17076</md:content-id>
  <md:title>F Distribution and ANOVA: The F Distribution And The F Ratio</md:title>
  <md:version>1.7</md:version>
  <md:created>2008/06/23 14:40:52 GMT-5</md:created>
  <md:revised>2009/02/05 18:35:58.974 US/Central</md:revised>
  <md:authorlist>
    <md:author id="sdean">
        <md:firstname>Susan</md:firstname>
        <md:surname>Dean</md:surname>
        <md:fullname>Susan Dean</md:fullname>
        <md:email>deansusan@deanza.edu</md:email>
    </md:author>
    <md:author id="billowsky">
        <md:firstname>Barbara</md:firstname>
        <md:surname>Illowsky</md:surname>
        <md:fullname>Dr. Barbara Illowsky</md:fullname>
        <md:email>illowskybarbara@deanza.edu</md:email>
    </md:author>
  </md:authorlist>
  <md:maintainerlist>
    <md:maintainer id="sdean">
        <md:firstname>Susan</md:firstname>
        <md:surname>Dean</md:surname>
        <md:fullname>Susan Dean</md:fullname>
        <md:email>deansusan@deanza.edu</md:email>
    </md:maintainer>
    <md:maintainer id="billowsky">
        <md:firstname>Barbara</md:firstname>
        <md:surname>Illowsky</md:surname>
        <md:fullname>Dr. Barbara Illowsky</md:fullname>
        <md:email>illowskybarbara@deanza.edu</md:email>
    </md:maintainer>
    <md:maintainer id="cnxorg">
        <md:firstname/>
        <md:surname>Connexions</md:surname>
        <md:fullname>Connexions</md:fullname>
        <md:email>cnx@cnx.org</md:email>
    </md:maintainer>
  </md:maintainerlist>
  <md:license href="http://creativecommons.org/licenses/by/2.0/"/>
  <md:licensorlist>
    <md:licensor id="MaxfieldFoundation">
        <md:firstname/>
        <md:surname>Maxfield Foundation</md:surname>
        <md:fullname>Maxfield Foundation</md:fullname>
        <md:email>cnx@cnx.org</md:email>
    </md:licensor>
  </md:licensorlist>
  <md:keywordlist>
    <md:keyword>alternate hypothesis</md:keyword>
    <md:keyword>ANOVA</md:keyword>
    <md:keyword>degrees of freedom</md:keyword>
    <md:keyword>F Distribution</md:keyword>
    <md:keyword>F Ratio</md:keyword>
    <md:keyword>hypothesis test</md:keyword>
    <md:keyword>means square</md:keyword>
    <md:keyword>null hypothesis</md:keyword>
    <md:keyword>One-Way Analysis of Variance</md:keyword>
    <md:keyword>population</md:keyword>
    <md:keyword>sample</md:keyword>
    <md:keyword>Sir Ronald Fisher</md:keyword>
    <md:keyword>statistics</md:keyword>
    <md:keyword>sum of squares</md:keyword>
    <md:keyword>variance</md:keyword>
  </md:keywordlist>
  <md:subjectlist>
    <md:subject>Mathematics and Statistics</md:subject>
  </md:subjectlist>
  <md:abstract>This module describes how to calculate the F Ratio and F Distribution based on the hypothesis test for the ANOVA.</md:abstract>
  <md:language>en</md:language>
  <!-- WARNING! The 'metadata' section is read only. Do not edit above.
       Changes to the metadata section in the source will not be saved. -->
</metadata>
  <content>
    <para id="delete_me">The distribution used for the hypothesis test is a new one. It is called the F distribution,
named after Sir Ronald Fisher, an English statistician. The F statistic is a ratio (a
fraction). There are two sets of degrees of freedom; one for the numerator and one for
the denominator.</para><para id="element-674">For example, if 
<m:math>
<m:mi>F</m:mi>
</m:math> follows an <m:math>
<m:mi>F</m:mi>
</m:math> distribution and the degrees of freedom for the
numerator are 4 and the degrees of freedom for the denominator are 10, then
<m:math>
<m:mi>F</m:mi></m:math> ~
<m:math>
<m:msub>
<m:mi>F</m:mi>
<m:mrow>
<m:mn>4</m:mn>
<m:mo>,</m:mo>
<m:mn>10</m:mn>
</m:mrow>
</m:msub>
</m:math>.</para><para id="element-967">To calculate the <m:math>
<m:mi>F</m:mi>
</m:math> 
ratio, two estimates of the variance are made.</para><list id="element-236" list-type="enumerated"><item><emphasis>Variance between samples:</emphasis> An estimate of <m:math>
<m:msup>
<m:mi>σ</m:mi>
<m:mn>2</m:mn>
</m:msup>
</m:math> that is the variance of the sample
means. If the samples are different sizes, the variance between samples is weighted to
account for the different sample sizes. The variance is also called <emphasis>variation due to treatment or
explained
variation.</emphasis></item>
<item><emphasis>Variance within samples:</emphasis> An estimate of <m:math>
<m:msup>
<m:mi>σ</m:mi>
<m:mn>2</m:mn>
</m:msup>
</m:math> that is the average of the sample
variances (also known as a pooled variance). When the sample sizes are different, the
variance within samples is weighted. The variance is also called the <emphasis>variation due to error or
unexplained variation.</emphasis></item>
</list><list id="eip-197"><item><m:math>
<m:msub>
<m:mi>SS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
<m:mo>=</m:mo>
</m:math> the sum of squares that represents the variation among the different
samples.</item>

<item><m:math>
<m:msub>
<m:mi>SS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
<m:mo>=</m:mo>
</m:math> the sum of squares that represents the variation within samples that is
due to chance.</item></list><para id="eip-668">To find a "sum of squares" means to add together squared quantities which, in some
cases, may be weighted. We used sum of squares to calculate the sample variance and
the sample standard deviation in <emphasis>Descriptive Statistics</emphasis>.</para><para id="eip-39"><m:math><m:mi>MS</m:mi></m:math> means "mean square." 
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:math> is the variance between groups and
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math> is the variance within groups.</para><list id="eip-198"><title>Calculation of Sum of Squares and Mean Square</title><item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>k</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{k} {}</m:annotation></m:semantics></m:math> = the number of different groups</item>

<item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:msub><m:mi>n</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mi>j</m:mi></m:mrow></m:mstyle></m:msub></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{n rSub { size 8{j} } } {}</m:annotation></m:semantics></m:math> = the size of the 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mstyle fontstyle="italic"><m:mrow><m:mtext>jth</m:mtext></m:mrow></m:mstyle></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ ital "jth"} {}</m:annotation></m:semantics></m:math> group</item>

<item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:msub><m:mi>s</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mi>j</m:mi></m:mrow></m:mstyle></m:msub></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{s rSub { size 8{j} } } {}</m:annotation></m:semantics></m:math>= the sum of the values in the 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mstyle fontstyle="italic"><m:mrow><m:mtext>jth</m:mtext></m:mrow></m:mstyle></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ ital "jth"} {}</m:annotation></m:semantics></m:math> group</item>

<item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>N</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{N} {}</m:annotation></m:semantics></m:math> = total number of all the values combined. (total sample size: <m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mo stretchy="false">∑</m:mo><m:msub><m:mi>n</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mi>j</m:mi></m:mrow></m:mstyle></m:msub></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ Sum {n rSub { size 8{j} } } } {}</m:annotation></m:semantics></m:math>)
</item>



<item><m:math><m:mi>x</m:mi></m:math> = one value:  
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow><m:mo stretchy="false">∑</m:mo><m:mi>x</m:mi></m:mrow><m:mo stretchy="false">=</m:mo><m:mrow><m:mo stretchy="false">∑</m:mo><m:msub><m:mi>s</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mi>j</m:mi></m:mrow></m:mstyle></m:msub></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ Sum {x} = Sum {s rSub { size 8{j} } } } {}</m:annotation></m:semantics></m:math></item>

<item>Sum of squares of all values from every group combined: <m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow><m:mo stretchy="false">∑</m:mo><m:msup><m:mi>x</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mn>2</m:mn></m:mrow></m:mstyle></m:msup></m:mrow><m:mrow/></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ Sum {x rSup { size 8{2} } } ={}} {}</m:annotation></m:semantics></m:math></item>

<item>Between group variability: <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>SS</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>total</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mrow>
                    <m:mrow>
                      <m:mo stretchy="false">∑</m:mo>
                      <m:msup>
                        <m:mi>x</m:mi>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mn>2</m:mn>
                          </m:mrow>
                        </m:mstyle>
                      </m:msup>
                    </m:mrow>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mfrac>
                      <m:msup>
                        <m:mfenced open="(" close=")">
                          <m:mrow>
                            <m:mo stretchy="false">∑</m:mo>
                            <m:mi>x</m:mi>
                          </m:mrow>
                        </m:mfenced>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mn>2</m:mn>
                          </m:mrow>
                        </m:mstyle>
                      </m:msup>
                      <m:mi>N</m:mi>
                    </m:mfrac>
                  </m:mrow>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "SS" rSub { size 8{ ital "total"} } = Sum {x rSup { size 8{2} } }  -  {  { left ( Sum {x}  right ) rSup { size 8{2} } }  over  {N} } } {}</m:annotation>
        </m:semantics>
      </m:math>
</item>

<item>Total sum of squares: 
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mrow>
                    <m:mo stretchy="false">∑</m:mo>
                    <m:msup>
                      <m:mi>x</m:mi>
                      <m:mstyle fontsize="8pt">
                        <m:mrow>
                          <m:mn>2</m:mn>
                        </m:mrow>
                      </m:mstyle>
                    </m:msup>
                  </m:mrow>
                  <m:mo stretchy="false">−</m:mo>
                  <m:mfrac>
                    <m:mrow>
                      <m:mo stretchy="false">(</m:mo>
                      <m:mrow>
                        <m:mo stretchy="false">∑</m:mo>
                        <m:mi>x</m:mi>
                      </m:mrow>
                      <m:msup>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mn>2</m:mn>
                          </m:mrow>
                        </m:mstyle>
                      </m:msup>
                    </m:mrow>
                    <m:mi>N</m:mi>
                  </m:mfrac>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ Sum {x rSup { size 8{2} } }  -  {  { \(  Sum {x}  \)  rSup { size 8{2} } }  over  {N} } } {}</m:annotation>
        </m:semantics>
      </m:math>
    </item>

<item>Explained variation- sum of squares representing variation among the different samples
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>SS</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>between</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mrow>
                    <m:mrow>
                      <m:mo stretchy="false">∑</m:mo>
                      <m:mrow>
                        <m:mo stretchy="false">[</m:mo>
                        <m:mfrac>
                          <m:mrow>
                            <m:mo stretchy="false">(</m:mo>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>sj</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                            <m:msup>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mstyle fontsize="8pt">
                                <m:mrow>
                                  <m:mn>2</m:mn>
                                </m:mrow>
                              </m:mstyle>
                            </m:msup>
                          </m:mrow>
                          <m:msub>
                            <m:mi>n</m:mi>
                            <m:mstyle fontsize="8pt">
                              <m:mrow>
                                <m:mi>j</m:mi>
                              </m:mrow>
                            </m:mstyle>
                          </m:msub>
                        </m:mfrac>
                        <m:mo stretchy="false">]</m:mo>
                      </m:mrow>
                    </m:mrow>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mfrac>
                      <m:mrow>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mrow>
                          <m:mo stretchy="false">∑</m:mo>
                          <m:mrow>
                            <m:msub>
                              <m:mi>s</m:mi>
                              <m:mstyle fontsize="8pt">
                                <m:mrow>
                                  <m:mi>j</m:mi>
                                </m:mrow>
                              </m:mstyle>
                            </m:msub>
                            <m:msup>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mstyle fontsize="8pt">
                                <m:mrow>
                                  <m:mn>2</m:mn>
                                </m:mrow>
                              </m:mstyle>
                            </m:msup>
                          </m:mrow>
                        </m:mrow>
                      </m:mrow>
                      <m:mi>N</m:mi>
                    </m:mfrac>
                  </m:mrow>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "SS" rSub { size 8{ ital "between"} } = Sum { \[  {  { \(  ital "sj" \)  rSup { size 8{2} } }  over  {n rSub { size 8{j} } } }  \] }  -  {  { \(  Sum {s rSub { size 8{j} }  \)  rSup { size 8{2} } } }  over  {N} } } {}</m:annotation>
        </m:semantics>
      </m:math>
</item>

<item>Unexplained variation- sum of squares representing variation within samples due to chance: 
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>SS</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>within</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mrow>
                    <m:mstyle fontstyle="italic">
                      <m:mrow>
                        <m:msub>
                          <m:mtext>SS</m:mtext>
                          <m:mstyle fontsize="8pt">
                            <m:mrow>
                              <m:mstyle fontstyle="italic">
                                <m:mrow>
                                  <m:mtext>total</m:mtext>
                                </m:mrow>
                              </m:mstyle>
                            </m:mrow>
                          </m:mstyle>
                        </m:msub>
                      </m:mrow>
                    </m:mstyle>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mstyle fontstyle="italic">
                      <m:mrow>
                        <m:msub>
                          <m:mtext>SS</m:mtext>
                          <m:mstyle fontsize="8pt">
                            <m:mrow>
                              <m:mstyle fontstyle="italic">
                                <m:mrow>
                                  <m:mtext>between</m:mtext>
                                </m:mrow>
                              </m:mstyle>
                            </m:mrow>
                          </m:mstyle>
                        </m:msub>
                      </m:mrow>
                    </m:mstyle>
                  </m:mrow>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "SS" rSub { size 8{ ital "within"} } = ital "SS" rSub { size 8{ ital "total"} }  -  ital "SS" rSub { size 8{ ital "between"} } } {}</m:annotation>
        </m:semantics>
      </m:math>  
</item>

<item>df's for different groups (df's for the numerator):
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>df</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>between</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mrow>
                    <m:mi>k</m:mi>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mn>1</m:mn>
                  </m:mrow>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "df" rSub { size 8{ ital "between"} } =k - 1} {}</m:annotation>
        </m:semantics>
      </m:math>
    </item>

<item>
Equation for errors within samples (df's for the denominator):
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>df</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>within</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mrow>
                    <m:mi>N</m:mi>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mi>k</m:mi>
                  </m:mrow>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "df" rSub { size 8{ ital "within"} } =N - k} {}</m:annotation>
        </m:semantics>
      </m:math></item>

<item>Mean square (variance estimate) explained by the different groups: 
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>MS</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>between</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mfrac>
                    <m:mstyle fontstyle="italic">
                      <m:mrow>
                        <m:msub>
                          <m:mtext>SS</m:mtext>
                          <m:mstyle fontsize="8pt">
                            <m:mrow>
                              <m:mstyle fontstyle="italic">
                                <m:mrow>
                                  <m:mtext>between</m:mtext>
                                </m:mrow>
                              </m:mstyle>
                            </m:mrow>
                          </m:mstyle>
                        </m:msub>
                      </m:mrow>
                    </m:mstyle>
                    <m:mstyle fontstyle="italic">
                      <m:mrow>
                        <m:msub>
                          <m:mtext>df</m:mtext>
                          <m:mstyle fontsize="8pt">
                            <m:mrow>
                              <m:mstyle fontstyle="italic">
                                <m:mrow>
                                  <m:mtext>between</m:mtext>
                                </m:mrow>
                              </m:mstyle>
                            </m:mrow>
                          </m:mstyle>
                        </m:msub>
                      </m:mrow>
                    </m:mstyle>
                  </m:mfrac>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "MS" rSub { size 8{ ital "between"} } = {  { ital "SS" rSub { size 8{ ital "between"} } }  over  { ital "df" rSub { size 8{ ital "between"} } } } } {}</m:annotation>
        </m:semantics>
      </m:math>
    
</item>

<item>
 Mean square (variance estimate) that is due to chance (unexplained): <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>MS</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>within</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mfrac>
                    <m:mstyle fontstyle="italic">
                      <m:mrow>
                        <m:msub>
                          <m:mtext>SS</m:mtext>
                          <m:mstyle fontsize="8pt">
                            <m:mrow>
                              <m:mstyle fontstyle="italic">
                                <m:mrow>
                                  <m:mtext>within</m:mtext>
                                </m:mrow>
                              </m:mstyle>
                            </m:mrow>
                          </m:mstyle>
                        </m:msub>
                      </m:mrow>
                    </m:mstyle>
                    <m:mstyle fontstyle="italic">
                      <m:mrow>
                        <m:msub>
                          <m:mtext>df</m:mtext>
                          <m:mstyle fontsize="8pt">
                            <m:mrow>
                              <m:mstyle fontstyle="italic">
                                <m:mrow>
                                  <m:mtext>within</m:mtext>
                                </m:mrow>
                              </m:mstyle>
                            </m:mrow>
                          </m:mstyle>
                        </m:msub>
                      </m:mrow>
                    </m:mstyle>
                  </m:mfrac>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "MS" rSub { size 8{ ital "within"} } = {  { ital "SS" rSub { size 8{ ital "within"} } }  over  { ital "df" rSub { size 8{ ital "within"} } } } } {}</m:annotation>
        </m:semantics>
      </m:math>
   
</item>

</list><para id="eip-858"><m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:math> and 
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math> can be written as follows:</para><list id="eip-822"><item><m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:msub>
<m:mi>SS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:mrow>
<m:mrow>
<m:msub>
<m:mi>df</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:msub>
<m:mi>SS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:mrow>
<m:mrow>
<m:mi>k</m:mi>
<m:mo>−</m:mo>
<m:mn>1</m:mn>
</m:mrow>
</m:mfrac>
</m:math></item>
<item><m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:msub>
<m:mi>SS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:mrow>
<m:mrow>
<m:msub>
<m:mi>df</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:msub>
<m:mi>SS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:mrow>
<m:mrow>
<m:mi>N</m:mi>
<m:mo>−</m:mo>
<m:mn>k</m:mn>
</m:mrow>
</m:mfrac>
</m:math></item>
</list><para id="element-138">The ANOVA test depends on the fact that 
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:math> can be influenced by population
differences among means of the several groups. Since <m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math> compares values of
each group to its own group mean, the fact that group means might be different does
not affect <m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math>.</para><para id="element-807">The null hypothesis says that all groups are samples from populations having the same
normal distribution. The alternate hypothesis says that at least two of the sample
groups come from populations with different normal distributions. If the null hypothesis
is true, 
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:math> and 
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math> should both estimate the same value.</para><note id="eip-82">The null hypothesis says that all the group population means are equal.  The hypothesis of equal means implies that the populations have the same normal distribution because it is assumed that the populations are normal and that they have equal variances.</note><equation id="eip-787"><title>F-Ratio or F Statistic</title><m:math>
<m:mi>F</m:mi>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:mrow>
<m:mrow>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:mrow>
</m:mfrac>
</m:math></equation><para id="element-712">If 
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:math> and 
<m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math> estimate the same value (following the belief that 
<m:math>
<m:msub>
<m:mi>H</m:mi>
<m:mi>o</m:mi>
</m:msub>
</m:math> is
true), then the F-ratio should be approximately equal to 1. Only sampling errors
would contribute to variations away from 1. As it turns out, <m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:math> consists of
the population variance plus a variance produced from the differences between the
samples. <m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math> is an estimate of the population variance. Since variances are
always positive, if the null hypothesis is false, <m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>between</m:mtext>
</m:msub>
</m:math> will be larger than <m:math>
<m:msub>
<m:mi>MS</m:mi>
<m:mtext>within</m:mtext>
</m:msub>
</m:math>.
The F-ratio will be larger than 1.</para><para id="eip-611">The above calculations were done with groups of different sizes. If the groups are the same size, the calculations simplify somewhat and the F ratio can be written as: </para><equation id="eip-382"><title>F-Ratio Formula when the groups are the same size</title><m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mi>F</m:mi>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mfrac>
                    <m:mrow>
                      <m:mrow>
                        <m:mi>n</m:mi>
                        <m:mo stretchy="false">⋅</m:mo>
                        <m:mo stretchy="false">(</m:mo>
                      </m:mrow>
                      <m:msub>
                        <m:mi>s</m:mi>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mrow>
                              <m:mo stretchy="false">−</m:mo>
                              <m:mi>x</m:mi>
                            </m:mrow>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                      <m:msup>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mn>2</m:mn>
                          </m:mrow>
                        </m:mstyle>
                      </m:msup>
                    </m:mrow>
                    <m:mrow>
                      <m:mo stretchy="false">(</m:mo>
                      <m:msub>
                        <m:mi>s</m:mi>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>pooled</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                      <m:msup>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mn>2</m:mn>
                          </m:mrow>
                        </m:mstyle>
                      </m:msup>
                    </m:mrow>
                  </m:mfrac>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{F= {  {n cdot  \( s rSub { size 8{ - x} }  \)  rSup { size 8{2} } }  over  { \( s rSub { size 8{ ital "pooled"} }  \)  rSup { size 8{2} } } } } {}</m:annotation>
        </m:semantics>
      </m:math>
</equation><list id="eip-389"><title>where ...</title><item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>s</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mrow><m:mo stretchy="false">−</m:mo><m:mi>x</m:mi></m:mrow></m:mrow></m:mstyle></m:msub><m:mrow><m:msup><m:mo stretchy="false">)</m:mo><m:mstyle fontsize="8pt"><m:mrow><m:mn>2</m:mn></m:mrow></m:mstyle></m:msup><m:mo stretchy="false">=</m:mo><m:mrow/></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ \( s rSub { size 8{ - x} }  \)  rSup { size 8{2} } ={}} {}</m:annotation></m:semantics></m:math>the variance of the sample means</item>
<item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>n</m:mi><m:mo stretchy="false">=</m:mo><m:mrow/></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{n={}} {}</m:annotation></m:semantics></m:math>the sample size of each group</item>
<item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>s</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mstyle fontstyle="italic"><m:mrow><m:mtext>pooled</m:mtext></m:mrow></m:mstyle></m:mrow></m:mstyle></m:msub><m:mrow><m:msup><m:mo stretchy="false">)</m:mo><m:mstyle fontsize="8pt"><m:mrow><m:mn>2</m:mn></m:mrow></m:mstyle></m:msup><m:mo stretchy="false">=</m:mo><m:mrow/></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ \( s rSub { size 8{ ital "pooled"} }  \)  rSup { size 8{2} } ={}} {}</m:annotation></m:semantics></m:math>the mean of the sample variances (pooled variance)</item>

<item>
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>df</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>numerator</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
                    </m:mrow>
                  </m:mstyle>
                  <m:mo stretchy="false">=</m:mo>
                  <m:mrow>
                    <m:mi>k</m:mi>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mn>1</m:mn>
                  </m:mrow>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "df" rSub { size 8{ ital "numerator"} } =k - 1} {}</m:annotation>
        </m:semantics>
      </m:math>
    </item>

<item>
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mstyle fontstyle="italic">
                    <m:mrow>
                      <m:msub>
                        <m:mtext>df</m:mtext>
                        <m:mstyle fontsize="8pt">
                          <m:mrow>
                            <m:mrow>
                              <m:mstyle fontstyle="italic">
                                <m:mrow>
                                  <m:mtext>denominator</m:mtext>
                                </m:mrow>
                              </m:mstyle>
                             
                              <m:mrow/>
                            </m:mrow>
                          </m:mrow>
                        </m:mstyle>
                      </m:msub>
 <m:mo stretchy="false">=</m:mo>
                    </m:mrow>
                  </m:mstyle>
                  <m:mi>k</m:mi>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mrow>
                    <m:mi>n</m:mi>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mn>1</m:mn>
                  </m:mrow>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">=</m:mo>
                    <m:mrow>
                      <m:mi>N</m:mi>
                      <m:mo stretchy="false">−</m:mo>
                      <m:mi>k</m:mi>
                    </m:mrow>
                  </m:mrow>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{ ital "df" rSub { size 8{ ital "denominator"={}} } k \( n - 1 \) =N - k} {}</m:annotation>
        </m:semantics>
      </m:math>
    </item>
</list><para id="element-260"><emphasis>The ANOVA hypothesis test is always right-tailed</emphasis> because larger F-values are
way out in the right tail of the F-distribution curve and tend to make us reject 
<m:math>
<m:msub>
<m:mi>H</m:mi>
<m:mi>o</m:mi>
</m:msub>
</m:math>.</para><section id="element-628"><title>Notation</title>
<para id="element-999">The notation for the F distribution is 
<m:math>
<m:mi>F</m:mi></m:math> ~
<m:math>
<m:msub>
<m:mi>F</m:mi>
<m:mrow>
<m:mtext>df(num)</m:mtext>
<m:mo>,</m:mo>
<m:mtext>df(denom)</m:mtext>
</m:mrow>
</m:msub>
</m:math></para><para id="element-966">where 
<m:math>
<m:mtext> df(num)</m:mtext>
<m:mo>=</m:mo>
<m:msub>
<m:mi>df</m:mi>
<m:mtext>between</m:mtext>
</m:msub></m:math>
and
<m:math>
<m:mtext> df(denom) </m:mtext>
<m:mo>=</m:mo>
<m:msub>
<m:mi>df</m:mi>
<m:mtext> within </m:mtext>
</m:msub></m:math></para><para id="element-181">The mean for the F distribution is 
<m:math>
<m:mi>μ</m:mi>
<m:mo>=</m:mo>
<m:mfrac>
<m:mi>df(num)</m:mi>
<m:mrow>
<m:mi>df(denom)</m:mi>
<m:mo>−</m:mo>
<m:mn>1</m:mn>
</m:mrow>
</m:mfrac>
</m:math></para></section>
  </content>
  
</document>
