<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" id="m10214">
  <name>Frequency Polygons</name>

  <metadata>
  <md:version>2.2</md:version>
  <md:created>2001/07/18</md:created>
  <md:revised>2002/11/07 00:00:00.002 US/Central</md:revised>
  <md:authorlist>
      <md:author id="dmlane">
      <md:firstname>David</md:firstname>
      
      <md:surname>Lane</md:surname>
      <md:email>lane@rice.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="jago">
      <md:firstname>Adan</md:firstname>
      
      <md:surname>Galvan</md:surname>
      <md:email>jago@rice.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>frequency polygons</md:keyword>
    <md:keyword>statistics</md:keyword>
  </md:keywordlist>

  <md:abstract>Introduction to frequency polygons.</md:abstract>
</metadata>


  <content>

    <para id="intro">




      Frequency polygons are a graphical device for understanding the
      shapes of distributions. They serve the same purpose as
      histograms, but are especially helpful in comparing sets of
      data. Frequency polygons are also a good choice for
      displaying<!-- should eventually contain a link to a definition
      --> cumulative frequency distributions.

    </para>

    <para id="first">




      To create a frequency polygon, start just as for histograms, by
      choosing a class interval. Then draw an X-axis representing the
      values of the scores in your data. Mark the middle of each class
      interval with a tick mark, and label it with the middle value
      represented by the class. Draw the Y-axis to indicate the
      frequency of each class. Place a point in the middle of each
      class interval at the height corresponding to its frequency.
      Finally, connect the points. You should include one class
      interval below the lowest value in your data and one above the
      highest value. The graph will then touch the X-axis on both
      sides.

    </para>

    <para id="second">

      A frequency polygon for 642 psychology test scores is shown in
      <cnxn target="figure1"/>. The first label on the X-axis is
      35. This represents an interval extending from 29.5 to
      39.5. Since the lowest test score is 46, this interval has a
      frequency of 0. The point labeled 45 represents the interval
      from 39.5 to 49.5. There are three scores in this
      interval. There are 150 scores in the interval that surrounds
      85.

    </para>

    <para id="third">
      You can easily discern the shape of the distribution from <cnxn target="figure1"/>. Most of the scores are between 65 and
      115. It is clear that the distribution is not symmetric inasmuch
      as good scores (to the right) trail off more gradually than poor
      scores (to the left). In the terminology of Chapter 3 (where we
      will study shapes of distributions more systematically), the
      distribution is <term>skewed</term>.


    </para>

    <figure id="figure1">
      <!-- one of (subfigure media table codeblock) -->
      <media type="image/gif" src="image001.gif"/>
      <caption>Frequency polygon for the psychology test scores.</caption>

    </figure>

    <para id="fourth">




      A cumulative frequency polygon for the same test scores is shown
      in <cnxn target="figure2"/>. The graph is the same as before
      except that the <m:math><m:ci>Y</m:ci></m:math> value for each
      point is the number of students in the corresponding class
      interval plus all numbers in lower intervals. For example, there
      are no scores in the interval labeled "35," three in the
      interval "45,"and 10 in the interval "55."Therefore the
      <m:math><m:ci>Y</m:ci></m:math> value corresponding to "55" is
      13. Since 642 students took the test, the cumulative frequency
      for the last interval is 642.

    </para>

    <figure id="figure2">
      <!-- one of (subfigure media table codeblock) -->
      <media type="image/gif" src="image002.gif"/>
      <caption>Cumulative frequency polygon for the psychology test scores.</caption>
    </figure>


    <para id="fifth">



      Frequency polygons are useful for comparing distributions. This
      is achieved by overlaying the frequency polygons drawn for
      different data sets. <cnxn target="figure3"/> provides an
      example. The data come from a task in which the goal is to move
      a computer mouse to a target on the screen as fast as
      possible. On 20 of the trials, the target was a small rectangle;
      on the other 20, the target was a large rectangle. Time to reach
      the target was recorded on each trial. The two distributions
      (one for each target) are plotted together in <cnxn target="figure3"/>. The figure shows that although there is some
      overlap in times, it generally took longer to move the mouse to
      the small target than to the large one.

    </para>

    <figure id="figure3">

      <media type="image/gif" src="image003.gif"/>
      <caption>Overlaid frequency polygons.</caption>

    </figure>

    <para id="sixth">




      It is also possible to plot two cumulative frequency
      distributions in the same graph. This is illustrated in <cnxn target="figure4"/> using the same data from the mouse task. The
      difference in distributions for the two targets is again
      evident.

    </para>

    <figure id="figure4">
      <media type="image/gif" src="image004.gif"/>
      <caption>Overlaid cumulative frequency polygons.</caption>

    </figure>
<!--
    <section id='demo'>
      <para id='demo1'>

	You might be curious about your own performance in the mouse
	task. Click here to try the task yourself, and to compare your
	times with ours.

      </para>

    </section>
-->
  </content>
</document>
