<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/technology/cnxml/schema/dtd/0.5/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" xmlns:m="http://www.w3.org/1998/Math/MathML" id="new">
  <name>Descriptive Statistics: Measuring the Spread of the Data</name>
  <metadata>
  <md:version>1.5</md:version>
  <md:created>2008/06/26 11:36:57 GMT-5</md:created>
  <md:revised>2008/08/20 11:14:28.167 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="billowsky">
      <md:firstname>Barbara</md:firstname>
      
      <md:surname>Illowsky</md:surname>
      <md:email>illowskybarbara@deanza.edu</md:email>
    </md:author>
      <md:author id="sdean">
      <md:firstname>Susan</md:firstname>
      
      <md:surname>Dean</md:surname>
      <md:email>deansusan@deanza.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="cnxorg">
      <md:firstname/>
      
      <md:surname>Connexions</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>elementary</md:keyword>
    <md:keyword>statistics</md:keyword>
  </md:keywordlist>

  <md:abstract>This module describes a number of statistical measures used to describe data, such as percentiles, spread, and skewness.</md:abstract>
</metadata>
  <content>
   <para id="element-683">The most common measure of spread is the standard deviation.  The <term src="#stddev">standard deviation</term> is a number that measures how far data values are from their mean.  For example, if the mean of a set of data containing 7 is 5 and the <emphasis>standard deviation</emphasis> is 2, then the value 7 is one (1) standard deviation from its mean because  5 + (1)(2) = 7. </para><para id="element-517">The number line may help you understand standard deviation.  If we were to put 5 and 7 on a number line, 7 is to the right of 5.  We say, then, that 7 is <emphasis>one</emphasis> standard deviation to the <emphasis>right</emphasis> of 5.  If 1 were also part of the data set, then 1 is <emphasis>two</emphasis> standard deviations to the <emphasis>left</emphasis> of 5 because 5 +(-2)(2) = 1.  </para><para id="element-251">1=5+(-2)(2) ; 7=5+(1)(2)</para><para id="element-980"><media type="image/png" src="desc_stat_numline.png">
 <param name="alt" value="A number line labeled from 0 to 7."/>

 <param name="print-width" value="4in"/>
</media></para><para id="element-586"><emphasis>Formula:</emphasis> value = <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math> + (#ofSTDEVs)(s)</para><para id="element-247">Generally, a value = mean + (#ofSTDEVs)(standard deviation), where #ofSTDEVs = the number of standard deviations.</para><para id="element-315">If <m:math><m:mi>x</m:mi></m:math> is a value and  <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math>  is the sample mean, then  <m:math><m:mi>x</m:mi><m:mo>-</m:mo></m:math> <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math> is called a deviation.  In a data set, there are as many deviations as there are data values.  Deviations are used to calculate the sample standard deviation.  </para><para id="element-121">To calculate the standard deviation, calculate the variance first.  The <term src="#variance">variance</term> is the average of the squares of the deviations.  The standard deviation is the square root of the variance.  You can think of the standard deviation as a special average of the deviations (the <m:math><m:mi>x</m:mi><m:mo>-</m:mo></m:math> <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math> values).  The lower case letter <m:math><m:mi>s</m:mi></m:math> represents the sample standard deviation and the Greek letter <m:math><m:mi>σ</m:mi></m:math> (sigma) represents the population standard deviation.  We use <m:math>
 <m:msup>
    <m:mi>s</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math> to represent the sample variance and <m:math>
 <m:msup>
    <m:mi>σ</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math> to represent the population variance.  If the sample has the same characteristics as the population, then s should be a good estimate of <m:math><m:mi>σ</m:mi></m:math>.</para><note>In practice, use either a calculator or computer software to calculate the standard deviation.  However, please study the following step-by-step example.</note>

<example id="element-655">

<para id="element-440">In a fifth grade class, the teacher was interested in the average age and the standard deviation of the ages of her students.  What follows are the ages of her students to the nearest half year: 

</para><para id="element-433"><list id="element-124125" type="inline"><item>9  </item><item>  9.5  </item><item>  9.5  </item><item>  10  </item><item>  10  </item><item>  10  </item><item>  10  </item><item>  10.5  </item><item>  10.5  </item><item>  10.5  </item><item>  10.5  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11  </item><item>  11.5  </item><item>  11.5  </item><item>  11.5</item></list></para><equation id="element-320"><m:math>
<m:mover>
    <m:mi>x</m:mi>
    <m:mo>¯</m:mo>
  </m:mover>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mn>9</m:mn>
      <m:mo>+</m:mo>
      <m:mrow>
        <m:mn>9.5</m:mn>
        <m:mo>×</m:mo>
        <m:mrow>
          <m:mn>2</m:mn>
          <m:mo>+</m:mo>
          <m:mrow>
            <m:mn>10</m:mn>
            <m:mo>×</m:mo>
            <m:mn>4</m:mn>
          </m:mrow>
          <m:mo>+</m:mo>
          <m:mrow>
            <m:mn>10.5</m:mn>
            <m:mo>×</m:mo>
            <m:mn>4</m:mn>
          </m:mrow>
          <m:mo>+</m:mo>
          <m:mrow>
            <m:mn>11</m:mn>
            <m:mo>×</m:mo>
            <m:mn>6</m:mn>
          </m:mrow>
        </m:mrow>
      </m:mrow>
      <m:mo>+</m:mo>
      <m:mrow>
        <m:mn>11.5</m:mn>
        <m:mo>×</m:mo>
        <m:mn>3</m:mn>
      </m:mrow>
    </m:mrow>
    <m:mn>20</m:mn>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mn>10.525</m:mn>
</m:math></equation><para id="element-642">The average age is 10.53 years, rounded to 2 places.</para><para id="element-442">The variance may be calculated by using a table.  Then the standard deviation is calculated by taking the square root of the variance.  We will explain the parts of the table after calculating <m:math><m:mi>s</m:mi></m:math>.</para><table id="element-978">
<?table-summary This table presents the formulas and calculations of various values. The first column has the data, second column has frequency, third column has deviations, fourth column has deviations squared, fifth column has frequency times deviations squared. There are 6 rows of values.?>
<tgroup cols="5"><thead>
  <row>
    <entry>Data</entry>
    <entry>Freq.</entry>
    <entry>Deviations</entry>
    <entry><m:math>
 <m:msup>
    <m:mi>Deviations</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math></entry>
    <entry>(Freq.)(<m:math>
 <m:msup>
    <m:mi>Deviations</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math>)</entry>
  </row>
  </thead>
  <tbody>
  <row>
    <entry><m:math><m:mi>x</m:mi></m:math></entry>
    <entry><m:math><m:mi>f</m:mi></m:math></entry>
    <entry><m:math><m:mo>(</m:mo><m:mi>x</m:mi><m:mo>-</m:mo><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply><m:mo>)</m:mo></m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mi>x</m:mi>
      <m:mo>-</m:mo>
      <m:mover>
        <m:mi>x</m:mi>
        <m:mo>¯</m:mo>
      </m:mover>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
</m:math></entry>
    <entry><m:math>
 <m:mrow>
    <m:mo>(</m:mo>
    <m:mi>f</m:mi>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mi>x</m:mi>
      <m:mo>-</m:mo>
      <m:mover>
        <m:mi>x</m:mi>
        <m:mo>¯</m:mo>
      </m:mover>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>9</m:mi></m:math></entry>
    <entry><m:math><m:mi>1</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>9</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>1.525</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>1.525</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>2.325625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>1</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>2.325625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>2.325625</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>9.5</m:mi></m:math></entry>
    <entry><m:math><m:mi>2</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>9.5</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>1.025</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>1.025</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>1.050625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>2</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>1.050625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>2.101250</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>10</m:mi></m:math></entry>
    <entry><m:math><m:mi>4</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>10</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.525</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>0.525</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.275625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>4</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.275625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>1.1025</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>10.5</m:mi></m:math></entry>
    <entry><m:math><m:mi>4</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>10.5</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.025</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mo>-</m:mo>
      <m:mn>0.025</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.000625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>4</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.000625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>.0025</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>11</m:mi></m:math></entry>
    <entry><m:math><m:mi>6</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>11</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mn>0.475</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mn>0.475</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.225625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>6</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.225625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>1.35375</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
  <row>
    <entry><m:math><m:mi>11.5</m:mi></m:math></entry>
    <entry><m:math><m:mi>3</m:mi></m:math></entry>
    <entry><m:math>
 <m:mn>11.5</m:mn>
  <m:mo>-</m:mo>
  <m:mn>10.525</m:mn>
  <m:mo>=</m:mo>
  <m:mn>0.975</m:mn>
</m:math></entry>
    <entry><m:math>
<m:msup>
    <m:mrow>
      <m:mo>(</m:mo>
      <m:mn>0.975</m:mn>
      <m:mo>)</m:mo>
    </m:mrow>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mn>0.950625</m:mn>
</m:math></entry>
    <entry><m:math>
<m:mrow>
    <m:mn>3</m:mn>
    <m:mo>×</m:mo>
    <m:mrow>
      <m:mn>.950625</m:mn>
      <m:mo>=</m:mo>
      <m:mn>2.851875</m:mn>
    </m:mrow>
  </m:mrow>
</m:math></entry>
  </row>
</tbody>




</tgroup>
</table>

<para id="element-367">The sample variance, <m:math>
 <m:msup>
    <m:mi>s</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
</m:math>, is equal to the last sum (9.7375) divided by the total number of data values minus one (20 - 1):</para><para id="element-631"><m:math>
<m:msup>
    <m:mi>s</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mn>9.7375</m:mn>
    <m:mrow>
      <m:mn>20</m:mn>
      <m:mo>-</m:mo>
      <m:mn>1</m:mn>
    </m:mrow>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mn>0.5125</m:mn>
</m:math></para><para id="element-424">The sample standard deviation, <m:math><m:mi>s</m:mi></m:math>,  is equal to the square root of the sample variance:</para><para id="element-127"><m:math>
<m:mi>s</m:mi>
  <m:mo>=</m:mo>
  <m:msqrt>
    <m:mn>0.5125</m:mn>
  </m:msqrt>
  <m:mo>=</m:mo>
  <m:mo>.</m:mo>
  <m:mn>0715891</m:mn>
</m:math> Rounded to two decimal places,  <m:math><m:mi>s</m:mi>
  <m:mo>=</m:mo>
  <m:mn>0.72</m:mn>
</m:math></para><para id="element-917">Typically, you do the calculation for the standard deviation on your calculator or computer. The intermediate results are not rounded.  This is done for accuracy.</para>




  <exercise id="element-397"><problem>
  <para id="element-670">
    Verify the mean and standard deviation calculated above on your calculator or computer.  Find the median and mode.
  </para>
</problem>

<solution>
 <list id="element-3535" type="bulleted">
   <item>Median = 10.5</item>
   <item>Mode = 11</item>
 </list>
</solution>
</exercise><exercise id="element-554"><problem>
  <para id="element-265">Find the value that is 1 standard deviation above the mean. Find <m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>+</m:mo>
    <m:mrow>
      <m:mn>1</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
</m:math>.
  </para>
</problem>

<solution>
  <para id="element-330"><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>+</m:mo>
    <m:mrow>
      <m:mn>1</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>+</m:mo>
  <m:mo>(</m:mo>
  <m:mn>1</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>11.25</m:mn>
</m:math></para>
</solution>
</exercise><exercise id="element-126"><problem>
  <para id="element-88">
 Find the value that is two standard deviations below the mean.  Find <m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>-</m:mo>
    <m:mrow>
      <m:mn>2</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
</m:math>.
  </para>
</problem>

<solution>
  <para id="element-421"><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>-</m:mo>
    <m:mrow>
      <m:mn>2</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>-</m:mo>
  <m:mo>(</m:mo>
  <m:mn>2</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>9.09</m:mn>
</m:math></para>
</solution>
</exercise><exercise id="element-992"><problem>
  <para id="element-792">
Find the values that are 1.5 standard deviations <emphasis>from</emphasis> (below and above) the mean.
  </para>
</problem>

<solution>
  <list id="element-364" type="bulleted"><item><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>-</m:mo>
    <m:mrow>
      <m:mn>1.5</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>-</m:mo>
  <m:mo>(</m:mo>
  <m:mn>1.5</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>9.45</m:mn>
</m:math>
</item>
<item><m:math>
<m:mrow>
    <m:mo>(</m:mo>
    <m:mover>
      <m:mi>x</m:mi>
      <m:mo>¯</m:mo>
    </m:mover>
    <m:mo>+</m:mo>
    <m:mrow>
      <m:mn>1.5</m:mn>
      <m:mi>s</m:mi>
    </m:mrow>
    <m:mo>)</m:mo>
  </m:mrow>
  <m:mo>=</m:mo>
  <m:mn>10.53</m:mn>
  <m:mo>+</m:mo>
  <m:mo>(</m:mo>
  <m:mn>1.5</m:mn>
  <m:mo>)</m:mo>
  <m:mo>(</m:mo>
  <m:mn>0.72</m:mn>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>11.61</m:mn>
</m:math></item>
</list>
</solution>
</exercise>

  
</example>

<para id="element-877"><emphasis>Explanation of the table:</emphasis> The deviations show how spread out the data are about the mean.  The value 11.5 is farther from the mean than 11.  The deviations  0.97 and 0.47 indicate that. <emphasis>If you add the deviations, the sum is always zero</emphasis>.  (For this example, there are 20 deviations.)  So you cannot simply add the deviations to get the spread of the data.  By squaring the deviations, you make them positive numbers.  The variance, then, is the average squared deviation.  It is small if the values are close to the mean and large if the values are far from the mean.</para><para id="element-969">The variance is a squared measure and does not have the same units as the data.  Taking the square root solves the problem.  The standard deviation measures the spread in the same units as the data.</para><para id="element-561">For the sample variance, we divide by the total number of data values minus one (<m:math><m:mi>n-1</m:mi></m:math>).  Why not divide by <m:math><m:mi>n</m:mi></m:math>?  The answer has to do with the population variance.  <emphasis>The sample variance is an estimate of the population variance.</emphasis>  By dividing by <m:math><m:mi>(n-1)</m:mi></m:math>, we get a better estimate of the population variance.</para><para id="element-628">Your concentration should be on what the standard deviation does, not on the arithmetic.  The standard deviation is a number which measures how far the data are spread from the mean.  Let a calculator or computer do the arithmetic.</para><para id="element-60">The sample standard deviation, <m:math><m:mi>s</m:mi></m:math>
, is either zero or larger than zero.  When   <m:math><m:mi>s</m:mi>
  <m:mo>=</m:mo>
  <m:mn>0</m:mn>
</m:math>, there is no spread.  When  <m:math><m:mi>s</m:mi></m:math>
  is a lot larger than zero, the data values are very spread out about the mean.  Outliers can make <m:math><m:mi>s</m:mi></m:math> very large.</para><para id="element-789">The standard deviation, when first presented, can seem unclear.   By graphing your data, you can get a better "feel" for the deviations and the standard deviation.  You will find that in symmetrical distributions, the standard deviation can be very helpful but in skewed distributions, the standard deviation may not be much help.  The reason is that the two sides of a skewed distribution have different spreads.  In a skewed distribution, it is better to look at the first quartile, the median, the third quartile, the smallest value, and the largest value.  Because numbers can be confusing, <emphasis>always graph your data</emphasis>.</para><note>The formula for the standard deviation is at the end of the chapter.</note><example id="element-649"><exercise id="exer2"><problem><para id="element-592">Use the following data (first exam scores) from Susan Dean's spring pre-calculus class:
</para><para id="element-961"><list id="set-432" type="inline"><item>33</item><item>42</item><item>49</item><item>49</item><item>53</item><item>55</item><item>55</item><item>61</item><item>  63</item><item>67</item><item>68</item><item>68</item><item>69</item><item>69</item><item>72</item><item>73</item><item>  74</item><item>78</item><item>80</item><item>83</item><item>88</item><item>88</item><item>88</item><item>90</item><item>  92</item><item>94</item><item>94</item><item>94</item><item>94</item><item>96</item><item>100  </item></list></para><list id="element-744" type="named-item"><?mark .?><item><name>a</name>Create a chart containing the data, frequencies, relative frequencies, and cumulative relative frequencies to three decimal places.</item>

<item><name>b</name>Calculate the following to one decimal place using a TI-83+ or TI-84 calculator:


<list id="element-9170" type="named-item"><?mark .?>
<item><name>i</name>The sample mean </item>
<item><name>ii</name>The sample standard deviation </item>
<item><name>iii</name>The median </item>
<item><name>iv</name>The first quartile </item>
<item><name>v</name>The third quartile </item>
<item><name>vi</name>IQR </item>
</list>
</item>

<item><name>c</name>Construct a box plot and a histogram on the same set of axes.  Make comments about the box plot, the histogram, and the chart.</item></list></problem><solution><list id="list1" type="named-item"><?mark .?><item><name>a</name>
<table id="id6947804">
<?table-summary This table presents the values listed above arranged with the data in the first column, frequency in the second column, relative frequency in the third column, and cumulative relative frequency in the fourth column.?>
<tgroup cols="4"><colspec colnum="1" colname="c1"/>
        <colspec colnum="2" colname="c2"/>
        <colspec colnum="3" colname="c3"/>
        <colspec colnum="4" colname="c4"/>
        <thead>
          <row>
            <entry>Data</entry>
            <entry>Frequency</entry>
            <entry>Relative Frequency</entry>
            <entry>Cumulative Relative Frequency</entry>
          </row>
         </thead>
         <tbody>
          <row>
            <entry>33</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.032</entry>
          </row>
          <row>
            <entry>42</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.064</entry>
          </row>
          <row>
            <entry>49</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.129</entry>
          </row>
          <row>
            <entry>53</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.161</entry>
          </row>
          <row>
            <entry>55</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.226</entry>
          </row>
          <row>
            <entry>61</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.258</entry>
          </row>
          <row>
            <entry>63</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.29</entry>
          </row>
          <row>
            <entry>67</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.322</entry>
          </row>
          <row>
            <entry>68</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.387</entry>
          </row>
          <row>
            <entry>69</entry>
            <entry>2</entry>
            <entry>0.065</entry>
            <entry>0.452</entry>
          </row>
          <row>
            <entry>72</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.484</entry>
          </row>
          <row>
            <entry>73</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.516</entry>
          </row>
          <row>
            <entry>74</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.548</entry>
          </row>
          <row>
            <entry>78</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.580</entry>
          </row>
          <row>
            <entry>80</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.612</entry>
          </row>
          <row>
            <entry>83</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.644</entry>
          </row>
          <row>
            <entry>88</entry>
            <entry>3</entry>
            <entry>0.097</entry>
            <entry>0.741</entry>
          </row>
          <row>
            <entry>90</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.773</entry>
          </row>
          <row>
            <entry>92</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.805</entry>
          </row>
          <row>
            <entry>94</entry>
            <entry>4</entry>
            <entry>0.129</entry>
            <entry>0.934</entry>
          </row>
          <row>
            <entry>96</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry>0.966</entry>
          </row>
          <row>
            <entry>100</entry>
            <entry>1</entry>
            <entry>0.032</entry>
            <entry><emphasis>0.998</emphasis>  (Why isn't this value 1?)</entry>
          </row>
        </tbody>


</tgroup>
</table></item>

<item><name>b</name>
<list id="element-904534" type="named-item"><?mark .?><item><name>i</name>The sample mean = 73.5 </item>
<item><name>ii</name>The sample standard deviation = 17.9 </item>
<item><name>iii</name>The median = 73 </item>
<item><name>iv</name>The first quartile = 61 </item>
<item><name>v</name>The third quartile = 90 </item>
<item><name>vi</name>IQR = 90 - 61 = 29 </item>
</list></item>

<item><name>c</name>The x-axis goes from 32.5 to 100.5;  y-axis goes from -2.4 to 15 for the histogram;  number of intervals is 5 for the histogram so the width of an interval is (100.5 - 32.5) divided by 5 which is equal to 13.6.   Endpoints of the intervals:  starting point is 32.5,  32.5+13.6 = 46.1,  46.1+13.6 = 59.7,  59.7+13.6 = 73.3,  73.3+13.6 = 86.9,  86.9+13.6 = 100.5 = the ending value;  No data values fall on an interval boundary.

<figure><media type="image/jpeg" src="Ch2_boxplot_4.png">
 <param name="alt" value="A hybrid image displaying both a histogram and box plot described in detail in the answer solution above."/>


 <param name="print-width" value="5in"/>

</media></figure>
</item></list> 
</solution></exercise><para id="element-242">The long left whisker in the box plot is reflected in the left side of the histogram.  The spread of the exam scores in the lower 50% is greater (73 - 33 = 40) than the spread in the upper 50% (100 - 73 = 27).  The histogram, box plot, and chart all reflect this.  There are a substantial number of A and B grades (80s, 90s, and 100).  The histogram clearly shows this.  The box plot shows us that the middle 50% of the exam scores  (IQR = 29) are Ds, Cs, and Bs.  The box plot also shows us that the lower 25% of the exam scores are Ds and Fs.</para></example><example id="element-387"><exercise id="exer1">
  <problem><para id="element-928">Two students, John and Ali, from different high schools, wanted to find out who had the highest G.P.A. when compared to his school.  Which student had the highest G.P.A. when compared to his school?
</para><table id="element-798">
<?table-summary This table provides two students and their GPAs. The first row represents John and the second row represents Ali. The first column lists students, second column lists GPA, third column lists school mean GPA, and the fourth column list the school standard deviation.?>
<tgroup cols="4"><thead>
  <row>
    <entry>Student</entry>
    <entry>GPA</entry>
    <entry>School Mean GPA</entry>
    <entry>School Standard Deviation</entry>
  </row>
</thead>
<tbody>
  <row>
    <entry>John</entry>
    <entry>2.85</entry>
    <entry>3.0</entry>
    <entry>0.7</entry>
  </row>
  <row>
    <entry>Ali</entry>
    <entry>77</entry>
    <entry>80</entry>
    <entry>10</entry>
  </row>
</tbody>
</tgroup>
</table></problem><solution><para id="element-71">Use the formula <emphasis>value = mean + (#ofSTDEVs)(stdev)</emphasis> and solve for #ofSTDEVs for each student (stdev = standard deviation):</para><para id="element-590"><m:math>
<m:mo>#</m:mo>
  <m:mi>ofSTDEVs</m:mi>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mi>value</m:mi>
      <m:mo>-</m:mo>
      <m:mi>mean</m:mi>
    </m:mrow>
    <m:mi>stdev</m:mi>
  </m:mfrac>
</m:math> :</para><para id="element-378">For John, <m:math>
<m:mo>#</m:mo>
  <m:mi>ofSTDEVs</m:mi>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mn>2.85</m:mn>
      <m:mo>-</m:mo>
      <m:mn>3.0</m:mn>
    </m:mrow>
    <m:mn>0.7</m:mn>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.21</m:mn>
</m:math></para><para id="element-712">For Ali, <m:math>
<m:mo>#</m:mo>
  <m:mtext>ofSTDEVs</m:mtext>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mrow>
      <m:mn>77</m:mn>
      <m:mo>-</m:mo>
      <m:mn>80</m:mn>
    </m:mrow>
    <m:mn>10</m:mn>
  </m:mfrac>
  <m:mo>=</m:mo>
  <m:mo>-</m:mo>
  <m:mn>0.3</m:mn>
</m:math></para><para id="element-344">John has the better G.P.A. when compared to his school because his G.P.A. is 0.21 standard deviations <emphasis>below his </emphasis>mean while Ali's G.P.A. is 0.3 standard deviations <emphasis>below his</emphasis> mean. </para></solution></exercise>
</example>  
  </content>
  <glossary>

<definition id="stddev">
    <term>Standard Deviation</term>
    <meaning>
A number that is equal to the square root of the variance and measures how far data values are from their mean. Notations: s for sample standard deviation and   <m:math><m:ci>σ</m:ci></m:math>for population standard deviation.
    </meaning>
  </definition>

<definition id="variance">
    <term>Variance</term>
    <meaning>
Mean of the squared deviations from the mean. Square of the standard deviation.
    </meaning>
  </definition>
</glossary>
</document>
