<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="m11131">
  
  <name>Sampling Distribution of the Mean</name>
  
  <metadata>
  <md:version>2.4</md:version>
  <md:created>2003/04/28</md:created>
  <md:revised>2003/06/19 09:44:16.140 GMT-5</md:revised>
  <md:authorlist>
    <md:author id="dmlane">
      <md:firstname>David</md:firstname>
      
      <md:surname>Lane</md:surname>
      <md:email>lane@rice.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="dmlane">
      <md:firstname>David</md:firstname>
      
      <md:surname>Lane</md:surname>
      <md:email>lane@rice.edu</md:email>
    </md:maintainer>
    <md:maintainer id="liqun">
      <md:firstname>Liqun</md:firstname>
      
      <md:surname>Wang</md:surname>
      <md:email>liqun@rice.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>sampling distribution</md:keyword>
    <md:keyword>mean</md:keyword>
    <md:keyword>variance</md:keyword>
    <md:keyword>central limit theorem</md:keyword>
  </md:keywordlist>

  <md:abstract>This module discusses sampling distribution of the mean.</md:abstract>
</metadata>

  <content>
    <para id="para1">
     The sampling distribution of the mean was defined in our <cnxn document="m11130">Introduction to Sampling
     Distributions</cnxn>. This discussion reviews some important
     properties of the sampling distribution of the mean that were
     introduced in the demonstrations in this chapter (<cnxn document="m11203">Basic Demo</cnxn>, <cnxn document="m11205">Sample Size Demo</cnxn>, and <cnxn document="m11185">Central Limit Theorem Demo</cnxn>).
    </para>
    
    <section id="sec1">
      <name>Mean</name>
      <para id="sec1para1">
	The mean of the sampling distribution of the mean is the mean
	of the population from which the scores were
	sampled. Therefore, if a population has a mean,
	<m:math><m:ci>m</m:ci></m:math>, then the 
	sampling distribution of the mean is also 
	<m:math><m:ci>m</m:ci></m:math>. The symbol 
	<m:math>
	  <m:ci>
	    <m:msub><m:mi>m</m:mi><m:mi>M</m:mi></m:msub>
	  </m:ci>
	</m:math> is 
	used to refer to the mean of the sampling distribution of the
	mean. Therefore, the formula for the mean of the sampling
	distribution of the mean can be written as: 

	<m:math display="block">
	  <m:apply>
	    <m:eq/>
	    <m:ci>
	      <m:msub><m:mi>m</m:mi><m:mi>M</m:mi></m:msub>
	    </m:ci>
	    <m:ci>m</m:ci>
	  </m:apply>
	</m:math>
      </para>
    </section>

    <section id="sec2">
      <name>Variance</name>
      <para id="sec2para1">
	The variance of the sampling distribution of the mean is
	computed as follows:  

	<m:math display="block">
	  <m:apply>
	    <m:eq/>
	    <m:apply>
	      <m:power/>
	      <m:ci>
		<m:msub><m:mi>σ</m:mi><m:mi>M</m:mi></m:msub>
	      </m:ci>
	      <m:cn>2</m:cn>
	    </m:apply>
	    <m:apply>
	      <m:divide/>
	      <m:apply>
		<m:power/>
		<m:ci>σ</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	      <m:ci>N</m:ci>
	    </m:apply>
	  </m:apply>
	</m:math>

	That is, the variance of the sampling distribution of the mean
	is the population variance divided by
	<m:math><m:ci>N</m:ci></m:math>, the sample size (the number 
	of scores used to compute a mean). Thus, the larger the sample
	size, the smaller the variance of the sampling distribution of
	the mean.
	<note type="aside">
	  This expression can be derived very easily from the variance
	  sum law. Let's begin by computing the variance of the
	  sampling distribution of the sum of three numbers sampled
	  from a population with variance
	  <m:math>
	    <m:apply>
	      <m:power/>
	      <m:ci>s</m:ci>
	      <m:cn>2</m:cn>
	    </m:apply>
	  </m:math>. The variance of the sum would be 
	  <m:math>
	    <m:apply>
	      <m:plus/>
	      <m:apply>
		<m:power/>
		<m:ci>s</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	      <m:apply>
		<m:power/>
		<m:ci>s</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	      <m:apply>
		<m:power/>
		<m:ci>s</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	    </m:apply>
	  </m:math>. For <m:math><m:ci>N</m:ci></m:math> numbers, the
	  variance would be 
	  <m:math>
	    <m:apply>
	      <m:times/>
	      <m:ci>N</m:ci>
	      <m:apply>
		<m:power/>
		<m:ci>s</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	    </m:apply>
	  </m:math>. Since the mean is 
	  <m:math>
	    <m:apply>
	      <m:divide/>
	      <m:cn>1</m:cn>
	      <m:ci>N</m:ci>
	    </m:apply>
	  </m:math> times the sum, the variance of the sampling
	  distribution of the mean would be
	  <m:math>
	    <m:apply>
	      <m:divide/>
	      <m:cn>1</m:cn>
	      <m:apply>
		<m:power/>
		<m:ci>N</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	    </m:apply>
	  </m:math> times the variance of the sum, which equals 
	  <m:math>
	    <m:apply>
	      <m:divide/>
	      <m:apply>
		<m:power/>
		<m:ci>s</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	      <m:ci>N</m:ci>
	    </m:apply>
	  </m:math>.
	</note>
      </para>

      <para id="sec2para3">
	The standard error of the mean is the standard deviation of
	the sampling distribution of the mean. It is therefore the
	square root of the variance of the sampling distribution of
	the mean and can be written as:

	<m:math display="block">
	  <m:apply>
	    <m:eq/>
	    <m:ci>
	      <m:msub><m:mi>σ</m:mi><m:mi>M</m:mi></m:msub>
	    </m:ci>
	    <m:apply>
	      <m:divide/>
	      <m:ci>σ</m:ci>
	      <m:apply>
		<m:root/>
		<m:ci>N</m:ci>
	      </m:apply>
	    </m:apply>
	  </m:apply>
	</m:math> 
	
	The standard error is represented by an 
	<m:math><m:ci>s</m:ci></m:math> because it is a standard 
	deviation. The subscript (<m:math><m:ci>M</m:ci></m:math>)
	indicates that the standard error in question is the standard
	error of the mean.  
      </para>
    </section>

    <section id="sec3">
      <name>Central Limit Theorem</name>

      <rule type="theorem" id="rule1">
	<name>Central Limit Theorem</name>
	<statement>
	  <para id="rule1para1">
	    Given a population with a mean 
	    <m:math><m:ci>m</m:ci></m:math> and a finite non-zero variance 
	    <m:math>
	      <m:apply>
		<m:power/>
		<m:ci>s</m:ci>
		<m:cn>2</m:cn>
	      </m:apply>
	    </m:math>, 
	    the sampling distribution of the mean approaches a normal
	    distribution with a mean of <m:math><m:ci>m</m:ci></m:math>
	    and a variance of 
	    <m:math>
	      <m:apply>
		<m:divide/>
		<m:apply>
		  <m:power/>
		  <m:ci>s</m:ci>
		  <m:cn>2</m:cn>
		</m:apply>
		<m:ci>N</m:ci>
	      </m:apply>
	    </m:math> as <m:math><m:ci>N</m:ci></m:math>, the sample size
	    increases.
	  </para>
	</statement>
      </rule>

      <para id="sec3para3">
	The expressions for the mean and variance of the sampling
	distribution of the mean are not new or remarkable. What is
	remarkable is that regardless of the shape of the parent
	population, the sampling distribution of the mean approaches a
	normal distribution as <m:math><m:ci>N</m:ci></m:math>
	increases. If you have used the <cnxn document="m11185">Central Limit Theorem Demo</cnxn>, you have
	already seen this for yourself. As a reminder, <cnxn target="figure1" strength="7"/> shows the results of the
	simulation for
	<m:math>
	  <m:apply>
	    <m:eq/>
	    <m:ci>N</m:ci>
	    <m:cn>2</m:cn>
	  </m:apply>
	</m:math> and 
	<m:math>
	  <m:apply>
	    <m:eq/>
	    <m:ci>N</m:ci>
	    <m:cn>10</m:cn>
	  </m:apply>
	</m:math>. The parent population was a
	<emphasis>uniform</emphasis> distribution. You can see that
	the distribution for
	<m:math>
	  <m:apply>
	    <m:eq/>
	    <m:ci>N</m:ci>
	    <m:cn>2</m:cn>
	  </m:apply>
	</m:math> is far from a normal distribution. Nonetheless, it
	does show that the scores are denser in the middle than in the
	tails. For 
	<m:math>
	  <m:apply>
	    <m:eq/>
	    <m:ci>N</m:ci>
	    <m:cn>10</m:cn>
	  </m:apply>
	</m:math> the distribution is quite close to a normal
	distribution. Notice that the means of the two distributions
	are the same, but that the spread of the distribution for 
	<m:math>
	  <m:apply>
	    <m:eq/>
	    <m:ci>N</m:ci>
	    <m:cn>10</m:cn>
	  </m:apply>
	</m:math> is smaller. 
      </para>

      <figure id="figure1">
	<media type="image/gif" src="figure1.gif"/>
	<caption>
	  A simulation of a sampling distribution. The parent
	  population is uniform. The blue line under "16" indicates
	  that 16 is the mean. The red line extends from the mean plus
	  and minus one standard deviation.
	</caption>
      </figure>

      <para id="sec3para4">
	<cnxn target="figure2" strength="7"/> shows how closely the
	sampling distribution of the mean approximates a normal
	distribution even when the parent population is very
	non-normal. If you look closely you can see that the sampling
	distributions do have a slight <emphasis>positive
	skew</emphasis>. The larger the sample size, the closer the
	sampling distribution of the mean would be to a normal
	distribution.
      </para>

      <figure id="figure2">
	<media type="image/gif" src="figure2.gif"/>
	<caption>
	  A simulation of a sampling distribution. The parent
	  population is very non-normal.
	</caption>
      </figure>

    </section>

  </content>  
</document>
