<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/technology/cnxml/schema/dtd/0.5/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" xmlns:m="http://www.w3.org/1998/Math/MathML" id="new">
  <name>Descriptive Statistics: Measuring the Spread of the Data (edited: Teegarden)</name>
  <metadata>
  <md:version>1.1</md:version>
  <md:created>2008/08/12 14:17:01.928 GMT-5</md:created>
  <md:revised>2008/08/12 14:18:19.901 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="tteegard">
      <md:firstname>Mary</md:firstname>
      <md:othername>T</md:othername>
      <md:surname>Teegarden</md:surname>
      <md:email>tteegard@sdccd.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="tteegard">
      <md:firstname>Mary</md:firstname>
      <md:othername>T</md:othername>
      <md:surname>Teegarden</md:surname>
      <md:email>tteegard@sdccd.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>elementary</md:keyword>
    <md:keyword>statistics</md:keyword>
  </md:keywordlist>

  <md:abstract>This module describes a number of statistical measures used to describe data, such as percentiles, spread, and skewness. Labs changed to include information on mini-tabs.</md:abstract>
</metadata>
  <content>
   <para id="element-683">The most common measure of spread is the standard deviation.  The <term src="#stddev">standard deviation</term> is a number that measures how far data values are from their mean.  For example, if the mean of a set of data containing 7 is 5 and the <emphasis>standard deviation</emphasis> is 2, then the value 7 is one (1) standard deviation from its mean because  5 + (1)(2) = 7. </para><para id="element-517">The number line may help you understand standard deviation.  If we were to put 5 and 7 on a number line, 7 is to the right of 5.  We say, then, that 7 is <emphasis>one</emphasis> standard deviation to the <emphasis>right</emphasis> of 5.  If 1 were also part of the data set, then 1 is <emphasis>two</emphasis> standard deviations to the <emphasis>left</emphasis> of 5 because 5 +(-2)(2) = 1.  </para><para id="element-251">1=5+(-2)(2) ; 7=5+(1)(2)</para><para id="element-980"><media type="image/png" src="desc_stat_numline.png">
 <param name="alt" value="A number line labeled from 0 to 7."/>

 <param name="print-width" value="4in"/>
</media></para><para id="element-586"><emphasis>Formula:</emphasis> value = <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math> + (#ofSTDEVs)(s)</para><para id="element-247">Generally, a value = mean + (#ofSTDEVs)(standard deviation), where #ofSTDEVs = the number of standard deviations.</para><para id="element-315">If <m:math><m:mi>x</m:mi></m:math> is a value and  <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math>  is the sample mean, then  <m:math><m:mi>x</m:mi><m:mo>-</m:mo></m:math> <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math> is called a deviation.  In a data set, there are as many deviations as there are data values.  Deviations are used to calculate the sample standard deviation.  </para><para id="element-121">To calculate the standard deviation, calculate the variance first.  The <term src="#variance">variance</term> is the average of the squares of the deviations.  The standard deviation is the square root of the variance.  You can think of the standard deviation as a special average of the deviations (the <m:math><m:mi>x</m:mi><m:mo>-</m:mo></m:math> <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math> values).  The lower case letter <m:math><m:mi>s</m:mi></m:math> represents the sample standard deviation and the Greek letter <m:math><m:mi>σ</m:mi></m:math> (sigma) represents the population standard deviation.  We use <m:math>
 <m:msup>
    <m:mi>s</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math> to represent the sample variance and <m:math>
 <m:msup>
    <m:mi>σ</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math> to represent the population variance.  If the sample has the same characteristics as the population, then s should be a good estimate of <m:math><m:mi>σ</m:mi></m:math>.</para><note>In practice, use either a calculator or computer software to calculate the standard deviation.  However, please study the following step-by-step example.</note>

<example id="element-655">

<para id="element-440">In a fifth grade class, the teacher was interested in the average age and the standard deviation of the ages of her students.  What follows are the ages of her students to the nearest half year: 

</para><para id="element-433"><list id="element-124125" type="inline"><item>9  </item><item>  9.5  </item><item>  9.5  </item><item>  10  </item><item>  10  </item><item>  10  </item><item>  10  </item><item>  10.5  </item><item>  10.5  </item><item>  10.5  </item><item>  10.5  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11.5  </item><item>  11.5  </item><item>  11.5</item></list></para><equation id="element-320"><m:math>
<m:mover>
    <m:mi>x</m:mi>
    <m:mo>¯</m:mo>
  </m:mover>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mn>9</m:mn>
      <m:mo>+</m:mo>
      <m:mrow>
        <m:mn>9.5</m:mn>
        <m:mo>×</m:mo>
        <m:mrow>
          <m:mn>2</m:mn>
          <m:mo>+</m:mo>
          <m:mrow>
            <m:mn>10</m:mn>
            <m:mo>×</m:mo>
            <m:mn>4</m:mn>
          </m:mrow>
          <m:mo>+</m:mo>
          <m:mrow>
            <m:mn>10.5</m:mn>
            <m:mo>×</m:mo>
            <m:mn>4</m:mn>
          </m:mrow>
          <m:mo>+</m:mo>
          <m:mrow>
            <m:mn>11</m:mn>
            <m:mo>×</m:mo>
            <m:mn>6</m:mn>
          </m:mrow>
        </m:mrow>
      </m:mrow>
      <m:mo>+</m:mo>
      <m:mrow>
        <m:mn>11.5</m:mn>
        <m:mo>×</m:mo>
        <m:mn>3</m:mn>
      </m:mrow>
    </m:mrow>
    <m:mn>20</m:mn>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mn>10.525</m:mn>
</m:math></equation><para id="element-642">The average age is 10.53 years, rounded to 2 places.</para><para id="element-442">The variance may be calculated by using a table.  Then the standard deviation is calculated by taking the square root of the variance.  We will explain the parts of the table after calculating <m:math><m:mi>s</m:mi></m:math>.</para><table id="element-978">
<?table-summary This table presents the formulas and calculations of various values. The first column has the data, second column has frequency, third column has deviations, fourth column has deviations squared, fifth column has frequency times deviations squared. There are 6 rows of values.?>
<tgroup cols="5"><thead>
  <row>
    <entry>Data</entry>
    <entry>Freq.</entry>
    <entry>Deviations</entry>
    <entry><m:math>
 <m:msup>
    <m:mi>Deviations</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math></entry>
    <entry>(Freq.)(<m:math>
 <m:msup>
    <m:mi>Deviations</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math>)</entry>
  </row>
  </thead>
  <tbody>
  <row>
    <entry><m:math><m:mi>x</m:mi></m:math></entry>
    <entry><m:math><m:mi>f</m:mi></m:math></entry>
    <entry><m:math><m:mo>(</m:mo><m:mi>x</m:mi><m:mo>-</m:mo><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply><m:mo>)</m:mo></m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mi>x</m:mi>
      <m:mo>-</m:mo>
      <m:mover>
        <m:mi>x</m:mi>
        <m:mo>¯</m:mo>
      </m:mover>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
</m:math></entry>
    <entry><m:math>
 <m:mrow>
    <m:mo>(</m:mo>
    <m:mi>f</m:mi>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mi>x</m:mi>
      <m:mo>-</m:mo>
      <m:mover>
        <m:mi>x</m:mi>
        <m:mo>¯</m:mo>
      </m:mover>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>9</m:mi></m:math></entry>
    <entry><m:math><m:mi>1</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>9</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>1.525</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>1.525</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>2.325625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>1</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>2.325625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>2.325625</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>9.5</m:mi></m:math></entry>
    <entry><m:math><m:mi>2</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>9.5</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>1.025</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>1.025</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>1.050625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>2</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>1.050625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>2.101250</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>10</m:mi></m:math></entry>
    <entry><m:math><m:mi>4</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>10</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.525</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>0.525</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.275625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>4</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.275625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>1.1025</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>10.5</m:mi></m:math></entry>
    <entry><m:math><m:mi>4</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>10.5</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.025</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>0.025</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.000625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>4</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.000625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>.0025</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>11</m:mi></m:math></entry>
    <entry><m:math><m:mi>6</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>11</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mn>0.475</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mn>0.475</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.225625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>6</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.225625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>1.35375</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>11.5</m:mi></m:math></entry>
    <entry><m:math><m:mi>3</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>11.5</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mn>0.975</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mn>0.975</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.950625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>3</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.950625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>2.851875</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
</tbody>




</tgroup>
</table>

<para id="element-367">The sample variance, <m:math>
 <m:msup>
    <m:mi>s</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math>, is equal to the last sum (9.7375) divided by the total number of data values minus one (20 - 1):</para><para id="element-631"><m:math>
<m:msup>
    <m:mi>s</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mn>9.7375</m:mn>
    <m:mrow>
      <m:mn>20</m:mn>
      <m:mo>-</m:mo>
      <m:mn>1</m:mn>
    </m:mrow>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mn>0.5125</m:mn>
</m:math></para><para id="element-424">The sample standard deviation, <m:math><m:mi>s</m:mi></m:math>,  is equal to the square root of the sample variance:</para><para id="element-127"><m:math>
<m:mi>s</m:mi>
  <m:mo>=</m:mo>
  <m:msqrt>
    <m:mn>0.5125</m:mn>
  </m:msqrt>
  <m:mo>=</m:mo>
  <m:mo>.</m:mo>
  <m:mn>0715891</m:mn>
</m:math> Rounded to two decimal places,  <m:math><m:mi>s</m:mi>
  <m:mo>=</m:mo>
  <m:mn>0.72</m:mn>
</m:math></para><para id="element-917">Typically, you do the calculation for the standard deviation on your calculator or computer. The intermediate results are not rounded.  This is done for accuracy.</para>




  <exercise id="element-397"><problem>
  <para id="element-670">
    Verify the mean and standard deviation calculated above on your calculator or computer.  Find the median and mode.
  </para>
</problem>

<solution>
 <list id="element-3535" type="bulleted">
   <item>Median = 10.5</item>
   <item>Mode = 11</item>
 </list>
</solution>
</exercise><exercise id="element-554"><problem>
  <para id="element-265">Find the value that is 1 standard deviation above the mean. Find <m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>+</m:mo>
    <m:mrow>
      <m:mn>1</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
</m:math>.
  </para>
</problem>

<solution>
  <para id="element-330"><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>+</m:mo>
    <m:mrow>
      <m:mn>1</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>+</m:mo>
  <m:mo>(</m:mo>
  <m:mn>1</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>11.25</m:mn>
</m:math></para>
</solution>
</exercise><exercise id="element-126"><problem>
  <para id="element-88">
 Find the value that is two standard deviations below the mean.  Find <m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>-</m:mo>
    <m:mrow>
      <m:mn>2</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
</m:math>.
  </para>
</problem>

<solution>
  <para id="element-421"><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>-</m:mo>
    <m:mrow>
      <m:mn>2</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>-</m:mo>
  <m:mo>(</m:mo>
  <m:mn>2</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>9.09</m:mn>
</m:math></para>
</solution>
</exercise><exercise id="element-992"><problem>
  <para id="element-792">
Find the values that are 1.5 standard deviations <emphasis>from</emphasis> (below and above) the mean.
  </para>
</problem>

<solution>
  <list id="element-364" type="bulleted"><item><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>-</m:mo>
    <m:mrow>
      <m:mn>1.5</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>-</m:mo>
  <m:mo>(</m:mo>
  <m:mn>1.5</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>9.45</m:mn>
</m:math>
</item>
<item><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>+</m:mo>
    <m:mrow>
      <m:mn>1.5</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>+</m:mo>
  <m:mo>(</m:mo>
  <m:mn>1.5</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>11.61</m:mn>
</m:math></item>
</list>
</solution>
</exercise>

  
</example>

<para id="element-877"><emphasis>Explanation of the table:</emphasis> The deviations show how spread out the data are about the mean.  The value 11.5 is farther from the mean than 11.  The deviations  0.97 and 0.47 indicate that. <emphasis>If you add the deviations, the sum is always zero</emphasis>.  (For this example, there are 20 deviations.)  So you cannot simply add the deviations to get the spread of the data.  By squaring the deviations, you make them positive numbers.  The variance, then, is the average squared deviation.  It is small if the values are close to the mean and large if the values are far from the mean.</para><para id="element-969">The variance is a squared measure and does not have the same units as the data.  Taking the square root solves the problem.  The standard deviation measures the spread in the same units as the data.</para><para id="element-561">For the sample variance, we divide by the total number of data values minus one (<m:math><m:mi>n-1</m:mi></m:math>).  Why not divide by <m:math><m:mi>n</m:mi></m:math>?  The answer has to do with the population variance.  <emphasis>The sample variance is an estimate of the population variance.</emphasis>  By dividing by <m:math><m:mi>(n-1)</m:mi></m:math>, we get a better estimate of the population variance.</para><para id="element-628">Your concentration should be on what the standard deviation does, not on the arithmetic.  The standard deviation is a number which measures how far the data are spread from the mean.  Let a calculator or computer do the arithmetic.</para><para id="element-60">The sample standard deviation, <m:math><m:mi>s</m:mi></m:math>
, is either zero or larger than zero.  When   <m:math><m:mi>s</m:mi>
  <m:mo>=</m:mo>
  <m:mn>0</m:mn>
</m:math>, there is no spread.  When  <m:math><m:mi>s</m:mi></m:math>
  is a lot larger than zero, the data values are very spread out about the mean.  Outliers can make <m:math><m:mi>s</m:mi></m:math> very large.</para><para id="element-789">The standard deviation, when first presented, can seem unclear.   By graphing your data, you can get a better "feel" for the deviations and the standard deviation.  You will find that in symmetrical distributions, the standard deviation can be very helpful but in skewed distributions, the standard deviation may not be much help.  The reason is that the two sides of a skewed distribution have different spreads.  In a skewed distribution, it is better to look at the first quartile, the median, the third quartile, the smallest value, and the largest value.  Because numbers can be confusing, <emphasis>always graph your data</emphasis>.</para><note>The formula for the standard deviation is at the end of the chapter.</note><example id="element-649"><exercise id="exer2"><problem><para id="element-592">Use the following data (first exam scores) from Susan Dean's spring pre-calculus class:
</para><para id="element-961"><list id="set-432" type="inline"><item>33</item><item>42</item><item>49</item><item>49</item><item>53</item><item>55</item><item>55</item><item>61</item><item>  63</item><item>67</item><item>68</item><item>68</item><item>69</item><item>69</item><item>72</item><item>73</item><item>  74</item><item>78</item><item>80</item><item>83</item><item>88</item><item>88</item><item>88</item><item>90</item><item>  92</item><item>94</item><item>94</item><item>94</item><item>94</item><item>96</item><item>100  </item></list></para><list id="element-744" type="named-item"><?mark .?><item><name>a</name>Create a chart containing the data, frequencies, relative frequencies, and cumulative relative frequencies to three decimal places.</item>

<item><name>b</name>Calculate the following to one decimal place using a TI-83+ or TI-84 calculator:


<list id="element-9170" type="named-item"><?mark .?>
<item><name>i</name>The sample mean </item>
<item><name>ii</name>The sample standard deviation </item>
<item><name>iii</name>The median </item>
<item><name>iv</name>The first quartile </item>
<item><name>v</name>The third quartile </item>
<item><name>vi</name>IQR </item>
</list>
</item>

<item><name>c</name>Construct a box plot and a histogram on the same set of axes.  Make comments about the box plot, the histogram, and the chart.</item></list></problem><solution><list id="list1" type="named-item"><?mark .?><item><name>a</name>
<table id="id6947804">
<?table-summary This table presents the values listed above arranged with the data in the first column, frequency in the second column, relative frequency in the third column, and cumulative relative frequency in the fourth column.?>
<tgroup cols="4"><colspec colnum="1" colname="c1"/>
        <colspec colnum="2" colname="c2"/>
        <colspec colnum="3" colname="c3"/>
        <colspec colnum="4" colname="c4"/>
        <thead>
          <row>
            <entry>Data</entry>
            <entry>Frequency</entry>
            <entry>Relative Frequency</entry>
            <entry>Cumulative Relative Frequency</entry>
          </row>
         </thead>
         <tbody>
          <row>
            <entry>33</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.032</entry>
          </row>
          <row>
            <entry>42</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.064</entry>
          </row>
          <row>
            <entry>49</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.129</entry>
          </row>
          <row>
            <entry>53</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.161</entry>
          </row>
          <row>
            <entry>55</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.226</entry>
          </row>
          <row>
            <entry>61</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.258</entry>
          </row>
          <row>
            <entry>63</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.29</entry>
          </row>
          <row>
            <entry>67</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.322</entry>
          </row>
          <row>
            <entry>68</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.387</entry>
          </row>
          <row>
            <entry>69</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.452</entry>
          </row>
          <row>
            <entry>72</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.484</entry>
          </row>
          <row>
            <entry>73</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.516</entry>
          </row>
          <row>
            <entry>74</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.548</entry>
          </row>
          <row>
            <entry>78</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.580</entry>
          </row>
          <row>
            <entry>80</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.612</entry>
          </row>
          <row>
            <entry>83</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.644</entry>
          </row>
          <row>
            <entry>88</entry>
            <entry>3</entry>
            <entry>0.097</entry>
            <entry>0.741</entry>
          </row>
          <row>
            <entry>90</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.773</entry>
          </row>
          <row>
            <entry>92</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.805</entry>
          </row>
          <row>
            <entry>94</entry>
            <entry>4</entry>
            <entry>0.129</entry>
            <entry>0.934</entry>
          </row>
          <row>
            <entry>96</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.966</entry>
          </row>
          <row>
            <entry>100</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry><emphasis>0.998</emphasis>  (Why isn't this value 1?)</entry>
          </row>
        </tbody>


</tgroup>
</table></item>

<item><name>b</name>
<list id="element-904534" type="named-item"><?mark .?><item><name>i</name>The sample mean = 73.5 </item>
<item><name>ii</name>The sample standard deviation = 17.9 </item>
<item><name>iii</name>The median = 73 </item>
<item><name>iv</name>The first quartile = 61 </item>
<item><name>v</name>The third quartile = 90 </item>
<item><name>vi</name>IQR = 90 - 61 = 29 </item>
</list></item>

</list> 
</solution></exercise><para id="element-242"><name>Chebyshev's Theorem</name>The proportion (or fraction) of any data set lying withing K standard deviations of the mean is always at least 1 - 1/K^2, where K is any positive number greater then 1. (Why is this?)

For K = 2, the proportion is 1 - 1/4 = 3/4, hence 3/4ths or 75% of the data falls within 2 standard deviations of the mean.

For K = 3, the proportion is 1 - 1/9 = 8/9, hence 8/9ths or 89% or the data falls within 3 standard deviations of the mean.

Using the data from the pre-calculus class exams and K = 2, this means that at least 75% of the scores fall between 73.5 - 2(17.9) and 73.5 + 2(17.9), or between 37.7 and 109.3.

In actual fact all but one data value falls in this range, however Chevyshev's Theorem gives the worst case scenario.</para></example><example id="element-387"><exercise id="exer1">
  <problem><para id="element-928">Two students, John and Ali, from different high schools, wanted to find out who had the highest G.P.A. when compared to his school.  Which student had the highest G.P.A. when compared to his school?
</para><table id="element-798">
<?table-summary This table provides two students and their GPAs. The first row represents John and the second row represents Ali. The first column lists students, second column lists GPA, third column lists school mean GPA, and the fourth column list the school standard deviation.?>
<tgroup cols="4"><thead>
  <row>
    <entry>Student</entry>
    <entry>GPA</entry>
    <entry>School Mean GPA</entry>
    <entry>School Standard Deviation</entry>
  </row>
</thead>
<tbody>
  <row>
    <entry>John</entry>
    <entry>2.85</entry>
    <entry>3.0</entry>
    <entry>0.7</entry>
  </row>
  <row>
    <entry>Ali</entry>
    <entry>77</entry>
    <entry>80</entry>
    <entry>10</entry>
  </row>
</tbody>
</tgroup>
</table></problem><solution><para id="element-71">Use the formula <emphasis>value = mean + (#ofSTDEVs)(stdev)</emphasis> and solve for #ofSTDEVs for each student (stdev = standard deviation):</para><para id="element-590"><m:math>
<m:mo>#</m:mo>
  <m:mi>ofSTDEVs</m:mi>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mi>value</m:mi>
      <m:mo>-</m:mo>
      <m:mi>mean</m:mi>
    </m:mrow>
    <m:mi>stdev</m:mi>
  </m:mfrac>
</m:math> :</para><para id="element-378">For John, <m:math>
<m:mo>#</m:mo>
  <m:mi>ofSTDEVs</m:mi>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mn>2.85</m:mn>
      <m:mo>-</m:mo>
      <m:mn>3.0</m:mn>
    </m:mrow>
    <m:mn>0.7</m:mn>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.21</m:mn>
</m:math></para><para id="element-712">For Ali, <m:math>
<m:mo>#</m:mo>
  <m:mtext>ofSTDEVs</m:mtext>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mn>77</m:mn>
      <m:mo>-</m:mo>
      <m:mn>80</m:mn>
    </m:mrow>
    <m:mn>10</m:mn>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.3</m:mn>
</m:math></para><para id="element-344">John has the better G.P.A. when compared to his school because his G.P.A. is 0.21 standard deviations <emphasis>below his </emphasis>mean while Ali's G.P.A. is 0.3 standard deviations <emphasis>below his</emphasis> mean. </para></solution></exercise>
</example><para id="element-769">Your concentration should be on what the standard deviation does, not on the arithmetic.  The standard deviation is a number which measures how far the data are spread from the mean.  Let a calculator or computer do the arithmetic.</para>  
  </content>
  <glossary>

<definition id="stddev">
    <term>Standard Deviation</term>
    <meaning>
A number that is equal to the square root of the variance and measures how far data values are from their mean. Notations: s for sample standard deviation and   <m:math><m:ci>σ</m:ci></m:math>for population standard deviation.
    </meaning>
  </definition>

<definition id="variance">
    <term>Variance</term>
    <meaning>
Mean of the squared deviations from the mean. Square of the standard deviation.
    </meaning>
  </definition>
</glossary>
</document>
