<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="id6723023">
  <name>Sampling and Data: Statistics (edited: Teegarden)</name>
  <metadata>
  <md:version>1.1</md:version>
  <md:created>2008/08/12 16:44:43.356 GMT-5</md:created>
  <md:revised>2008/08/12 16:46:05.009 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="tteegard">
      <md:firstname>Mary</md:firstname>
      <md:othername>T</md:othername>
      <md:surname>Teegarden</md:surname>
      <md:email>tteegard@sdccd.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="tteegard">
      <md:firstname>Mary</md:firstname>
      <md:othername>T</md:othername>
      <md:surname>Teegarden</md:surname>
      <md:email>tteegard@sdccd.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>data</md:keyword>
    <md:keyword>descriptive</md:keyword>
    <md:keyword>dot plot</md:keyword>
    <md:keyword>inferential</md:keyword>
    <md:keyword>statistics</md:keyword>
  </md:keywordlist>

  <md:abstract>This module introduces the concept of statistics, specifically the ability to use statistics to describe data (descriptive statistics) as well as draw conclusions (inferential statistics).  An optional classroom exercise is included. Labs changed to incorporate mini-tabs.</md:abstract>
</metadata>
  <content>
    
      <para id="id8112751">The science of <term src="#stat">statistics</term> deals with the collection, analysis, interpretation, and presentation of <term src="#data">data</term>. We see and use data in our everyday lives. To be able to use data correctly is essential to many professions and is in your own best self-interest.</para>
      <section id="id-652244882543">
        <name>Example</name>
        <para id="id7763788">Suppose class members write down the average time (in hours, to the nearest half-hour) they sleep per night. Then create a simple graph (called a <emphasis>dot plot</emphasis>) of the data. A dot plot consists of a number line and dots (or points) positioned above the number line. For example, consider the following data:</para>
        <para id="element-128"><list id="set-element-266" type="inline"><item>5</item>
<item>5.5</item>
<item>6</item>
<item>6</item>
<item>6</item>
<item>6.5</item>
<item>6.5</item>
<item>6.5</item>
<item>6.5</item>
<item>7</item>
<item>7</item>
<item>8</item>
<item>8</item>
<item>9</item></list></para><para id="element-534">The dot plot for this data would be as follows:</para><figure id="element-970"><name>Frequency of Average Time (in Hours) Spent Sleeping per Night</name><media type="image/jpeg" src="m16020_DotPlot.png">
<param name="alt" value="Dot plot with hours of sleep on the X-axis and frequency on Y-axis"/>
<param name="longdesc" value="m16020_DotPlot_description.html"/>

<param name="print-width" value="3in"/>

</media></figure>
        
</section>
        
        <para id="id6043607">In this course, you will learn how to organize and summarize data. Organizing and summarizing data is called <emphasis>descriptive statistics</emphasis>. Two ways to summarize data are by graphing and by numbers (for example, finding an average). After you have studied probability and probability distributions, you will use formal methods for drawing conclusions from "good" data. The formal methods are called <emphasis>inferential statistics</emphasis>. Statistical inference uses probability to determine if conclusions drawn are reliable or not.</para>
        <para id="id9131775">Effective interpretation of data (inference) is based on good procedures for producing data and thoughtful examination of the data.  You will encounter what will seem to be too many mathematical formulas for interpreting data.  The goal of statistics is not to perform numerous calculations using the formulas, but to gain an understanding of your data.  The calculations can be done using a calculator or a computer.  The understanding must come from you.  If you can thoroughly grasp the basics of statistics, you can be more confident in the decisions you make in life.</para>
    
  </content>
<glossary>
<definition id="data">
    <term>Data</term>
    <meaning>
      A set of observations (a set of possible outcomes). Most data can be put into two groups: <emphasis>qualitative</emphasis> (hair color, ethnic groups and many other <emphasis>attributes</emphasis> of population) and <emphasis>quantitative</emphasis> (distance traveled to college, number of children in a family, etc.). In its turn quantitative data can be separated into two subgroups: <emphasis>discrete</emphasis> and <emphasis>continuous</emphasis>. Roughly speaking, data is discrete if it is result of counting (a number of student of the given ethnic group in a class, a number of books on a shelf, etc.), and data is continuous if it is result of measuring (distance traveled, weight of luggage, etc.)
    </meaning>
  </definition>
<definition id="stat">
    <term>Statistic</term>
    <meaning>
A numerical characteristic of the sample. Statistic estimates the corresponding population parameter. For example, the average number of full-time students in a 7:30 a.m. class for this term (statistic) is an estimate for the average number of full-time students in any class this term (parameter).
    </meaning>
  </definition>

</glossary>
</document>
