<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="id8439828">
  <name>What's Normal?</name>
  <metadata>
  <md:version>1.1</md:version>
  <md:created>2008/06/11 12:25:02.234 GMT-5</md:created>
  <md:revised>2008/06/26 16:07:29.832 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="IMP2">
      <md:firstname/>
      
      <md:surname>IMP</md:surname>
      <md:email>cosborne@keypress.com</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="IMP2">
      <md:firstname/>
      
      <md:surname>IMP</md:surname>
      <md:email>cosborne@keypress.com</md:email>
    </md:maintainer>
    <md:maintainer id="KCP">
      <md:firstname/>
      
      <md:surname>Key</md:surname>
      <md:email>cosborne@keypress.com</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>IMP Year 1</md:keyword>
    <md:keyword>The Pit and the Pendulum</md:keyword>
  </md:keywordlist>

  <md:abstract/>
</metadata>
  <content>
    <section id="id-658820557855">
      <name>Intent</name>
      <para id="id8249956">This activity, and the discussion that leads into it, introduces students to the normal distribution.</para>
    </section>
    <section id="id-638327438331">
      <name>Mathematics</name>
      <para id="id8309180">The <term><cnxn document="m15620">normal distribution</cnxn></term> (also called the <emphasis>Gaussian distribution</emphasis>) is the technical name for what many call the <emphasis>bell curve</emphasis>. Of the many ways that data may be distributed, the normal distribution is of particular interest and is useful in many statistical situations. For example, many types of data related to people—such as the heights or shoe sizes of adult men or women—are approximately normally distributed. The normal distribution is a specific type of bell-shaped frequency pattern, with a precise, technical mathematical definition. Additionally, <term><cnxn document="m15620">measurement variation</cnxn></term> is approximately normally distributed. It is for this last reason that the normal distribution is introduced in this unit.</para>
    </section>
    <section id="id-715915222865">
      <name>Progression</name>
      <para id="id8415439">After a teacher-led introduction to the normal distribution, students work individually to create graphs of surmised data from several situations, including labeled axes and their own choices for intervals, units of measurement, and frequency of data within each interval. They then share their results in groups.</para>
    </section>
    <section id="id-479271076518">
      <name>Approximate Time</name>
      <para id="id7445403">30 minutes for introduction </para>
      <para id="id8702645">20 minutes for activity (at home or in class)</para>
      <para id="id8702650">10 minutes for discussion </para>
    </section>
    <section id="id-492829436763">
      <name>Classroom Organization</name>
      <para id="id7189438">Whole-class introduction, then individuals, then groups, followed by whole-class discussion</para>
    </section>
    <section id="id-6926994666">
      <name>Materials</name>
      <para id="id8776366">Frequency bar graphs from <emphasis>Time Is Relative, What’s Your Stride?</emphasis>, and <emphasis>Pulse Analysis</emphasis></para>
      <para id="id6569021">Transparencies of the graphs [link to pdf of What’s Normal, p. 1–2]</para>
      
    </section>
    <section id="id-17612788819">
      <name>Doing the Activity</name>
      <para id="id8328403">Before assigning the activity, lead a discussion to introduce the normal distribution. To begin, draw students’ attention to the frequency bar graphs made earlier of the following data sets.</para>
      <list type="bulleted" id="id8574920">
        <item>Timing of five seconds (from <emphasis>Time Is Relative</emphasis>)</item>
        <item>Stride length (from <emphasis>What’s Your Stride?</emphasis>)</item>
        <item>Pulse rates (from <emphasis>Pulse Analysis</emphasis>)</item>
      </list>
      <para id="id7172747">Ask students, <term>What features do these graphs have in common? </term>They will probably focus on two key features.</para>
      <list type="bulleted" id="id8763241">
        <item>The graphs are highest “in the middle.” (Students may or may not use the term <term>mean</term><emphasis>.</emphasis>)</item>
        <item>The graphs gradually go down toward the ends.</item>
      </list>
      <para id="id7987963">Using a diagram like the one below, explain that curves with this general appearance are called <emphasis>bell shaped</emphasis> and that there is a very special bell-shaped curve called the <term>normal distribution</term>. [link to Blackline Masters.doc.]</para>
      <figure id="id7175155">
        <media type="image/jpg" src="graphics1.jpg">
          <param name="height" value="182"/>
          <param name="width" value="346"/>
        </media>
      </figure>
      <para id="id9476796">Bring out the connection between the area under such a curve and the probability of various results. For example, if the shaded area on the next diagram is, say, 20 percent of the total area under the curve, then 20 percent of all measurements are between points <emphasis>a</emphasis> and <emphasis>b</emphasis>. </para>
      <para id="id8352964">Students may recognize that a similar idea applies to frequency bar graphs. Point out the similarity between this shaded area under the curve and the area of a bar in a frequency bar graph. It’s as if the tops of all the bars in a frequency bar graph were connected to draw a smooth curve.</para>
      <figure id="id8413419">
        <media type="image/jpg" src="graphics2.jpg">
          <param name="height" value="197"/>
          <param name="width" value="346"/>
        </media>
      </figure>
      <para id="id7768837">You can also show the next diagram, which depicts three different normal curves on the same set of axes, and ask students what they think the differences indicate. The goal is for them to recognize that the amount of variation from one measurement to another is different in each graph. The exact shape of a normal curve depends on the scales being used and the specific situation.</para>
      <figure id="id8502146">
        <media type="image/jpg" src="graphics3.jpg">
          <param name="height" value="171"/>
          <param name="width" value="333"/>
        </media>
      </figure>
      <para id="id8309376">Tell students that you are giving them a simplified description of the normal distribution. They will not be able to determine for sure whether a data set is normally distributed. The precise definition involves a complex formula for the graph—one that most people encounter only if they study statistics in college. [Link to math maps]</para>
      <para id="id7890891">You may want to clarify that in the term <emphasis>normal distribution,</emphasis> the word “normal” is being used in a special, technical sense. It does not mean “ordinary,” although the normal distribution is one that occurs in many situations.</para>
      <para id="id8080038">Ask, <term>What features do these normal curves have in common? </term>Students should see that, as with the frequency bar graphs under consideration, the normal curves are highest in the middle and decrease gradually toward both ends. Make sure they note one more specific phenomenon:</para>
      <para id="id8642082">
        <emphasis>The normal curve is symmetric.</emphasis>
      </para>
      <para id="id8651212">Introduce the term <term><cnxn document="m15620">line of symmetry</cnxn></term> for the vertical line that divides a normal curve into two equal parts. Then ask, <term>What does the location of the line of symmetry represent?</term> Students should realize that values to the right of the line of symmetry “balance out” values to the left. Help them as needed to use this observation to reach an important conclusion:</para>
      
      <para id="id7806913">
        <emphasis>The line of symmetry represents the mean of the data.</emphasis>
      </para>
      
      <para id="id4910636">If students mention the median (in addition to or instead of the mean), explain that for symmetric data, the <term><cnxn document="m15620">mean</cnxn></term> and <term><cnxn document="m15620">median</cnxn></term> are the same, and perhaps ask why.</para>
      <para id="id3283629">A more subtle observation on the shape of the graph concerns <emphasis>concavity</emphasis>. You can bring this out by asking, <term>What changes in the way the normal curve “curves”? </term>You might use the following diagram to illustrate the ideas. Introduce the terms <emphasis>concave up</emphasis> and <emphasis>concave down</emphasis> to describe the different portions of the curve. </para>
      <figure id="id9531255">
        <media type="image/jpg" src="graphics4.jpg">
          <param name="height" value="224"/>
          <param name="width" value="353"/>
        </media>
      </figure>
      <para id="id7805287">Note that the change of concavity provides an important visual image of standard deviation. For example, the point on the horizontal axis that corresponds to the first point of concavity to the right of the mean is one standard deviation above the mean. The significance of concavity in relation to standard deviation will be discussed in the activity <emphasis>The Best Spread</emphasis>.</para>
      <para id="id8648842">Ask, <term>Do you think the frequency bar graphs of our experimental data resemble the normal distribution?</term> Students’ response may depend on how much data they collected for each experiment and on how they grouped the results. Whatever their response, tell them that if they were to record more and more data, their graphs would probably begin to look more and more like the normal distribution. The normal curve is generally considered a reasonable expectation for results of measurement variation. Then tell them that, based on this general experimental phenomenon, this unit makes the following assumption:</para>
      <para id="id8880396">Normality assumption:<emphasis> If you make many measurements of the period of any given pendulum, the data will closely fit a normal distribution.</emphasis></para>
      
      <para id="id6719266">Post this assumption in the room, as it will be referred to later in the unit.</para>
      <para id="id8714023">Now ask, <term>How does the idea of normal distribution relate to the unit problem?</term> Students should recognize that, according to the normality assumption, the normal distribution describes the kind of measurement variation they should expect in pendulum experiments. Therefore, familiarity with the normal distribution is moving them along in the process of determining which variables are important.</para>
    </section>
    <section id="id-859254999285">
      <name>Discussing and Debriefing the Activity</name>
      <para id="id7118354">In their groups, have students compare the frequency bar graphs they sketched. Then discuss, as a class, which situations they think are normally distributed.</para>
      <para id="id7905428">For Question 1, some students may have arranged the categories so that the tallest frequency bars are in the middle and then concluded that the distribution is approximately normal. If so, point out that a normal distribution requires that the data items be numeric in nature. Tell the class that nonnumeric data, like shoe type, is sometimes called <emphasis>categorical</emphasis><emphasis>data</emphasis>.</para>
      <para id="id8525335">Of Questions 2 through 4, only the situation in Question 2 might be approximately normally distributed (and even that might not be), although students may not have the facts on which to make this judgment.</para>
      <para id="id3576745">For Question 3, bring out the fact that far more people have incomes below the mean than above it (due to the effect on the mean of a small number of people with very high incomes). In particular, this means that the distribution of incomes is not symmetric around the mean. As needed, review that symmetry is one of the key characteristics of the normal distribution. However, income distribution does resemble the normal distribution in at least one respect: It trails off toward the extremes (at least at the upper end). You can review here that in the normal distribution, values farther from the mean are less likely (that is, occur less often) than values closer to the mean.</para>
      <para id="id8414322">For Question 4, help students to see that, assuming either constant or increasing birthrates, the population of different age groups decreases gradually toward the higher age groups. For example, there are generally more people between the ages of 0 and 10 than between 10 and 20, more between 10 and 20 than between 20 and 30, and so on. (States like Hawaii and Florida, which attract many retirees, might be exceptions to this pattern.) </para>
      <para id="id9419944">You might mention that many properties of people and objects are distributed normally or close to normally, but many are not. It isn’t necessarily easy to decide, in theory, which are which.</para>
      <para id="id9419948">You may want to summarize several key aspects of the normal distribution that were brought out in this activity.</para>
      <list type="bulleted" id="id9405314">
        <item>Normally distributed data must be numeric.</item>
        <item>Normally distributed data are symmetric about the mean.</item>
        <item>For normally distributed data, results farther from the mean are less likely than results closer to the mean.</item>
      </list>
      <para id="id8653384">Finally, review the assumption that is being made in this unit: that measurements of a given pendulum’s period are normally distributed.</para>
    </section>
    <section id="id-0575511078456">
      <name>Key Questions</name>
      <para id="id8406002">
        <term>What features do these graphs have in common?</term>
      </para>
      <para id="id8763534">
        <term>What does the location of the line of symmetry represent?</term>
      </para>
      <para id="id8388369">
        <term>What changes in the way the normal curve “curves”?</term>
      </para>
      <para id="id8546368">
        <term>Do you think the frequency bar graphs of our experimental data resemble the normal distribution? </term>
      </para>
      <para id="id8729126">
        <term>How does the idea of normal distribution relate to the unit problem?</term>
      </para>
      <para id="id8521097">
        <term>Which situations do you think are normally distributed? Why?</term>
      </para>
    </section>
  </content>
</document>
