<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="id3710054">
  <name>Sampling and Data: Data</name>
  <metadata>
  <md:version>1.7</md:version>
  <md:created>2008/04/09 13:23:45 GMT-5</md:created>
  <md:revised>2008/07/08 20:41:56.404 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="billowsky">
      <md:firstname>Barbara</md:firstname>
      
      <md:surname>Illowsky</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:author>
      <md:author id="sdean">
      <md:firstname>Susan</md:firstname>
      
      <md:surname>Dean</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="cnxorg">
      <md:firstname/>
      
      <md:surname>Connexions</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>Continuous</md:keyword>
    <md:keyword>Counting</md:keyword>
    <md:keyword>Data</md:keyword>
    <md:keyword>Discrete</md:keyword>
    <md:keyword>Measuring</md:keyword>
    <md:keyword>Qualitative</md:keyword>
    <md:keyword>Quantitative</md:keyword>
    <md:keyword>Statistics</md:keyword>
  </md:keywordlist>

  <md:abstract>This module introduces the concepts of qualitative data, quantitative continuous data, and quantitative discrete data as used in statistics.  Sample problems are included.

Note: This module is currently under revision, and its content is subject to change.  This module is being prepared as part of a statistics textbook that will be available for the Fall 2008 semester.</md:abstract>
</metadata>
  <content>
    

      <para id="id7862377">Data may come from a population or from a sample. Small letters like 
<m:math>
  <m:mi>x</m:mi>
</m:math> or <m:math>
  <m:mi>y</m:mi>
</m:math> generally are used to represent data values. Most data can be put into the following categories:  </para>
      <list id="id10607985" type="bulleted"><item>
          Qualitative
        </item>
        <item>Quantitative</item>
      </list>
      <para id="id9602938"><term src="#qual">Qualitative data</term> are the result of categorizing or describing attributes of a population. Hair color, blood type, ethnic group, the car a person drives, and the street a person lives on are examples of qualitative data. Qualitative data are generally described by words or letters. For instance, hair color might be black, dark brown, light brown, blonde, gray, and red. Blood type might be AB+, O-, or B+. Qualitative data are not as widely used as quantitative data because many numerical techniques do not apply to the qualitative data. For example, it does not make sense to find an average hair color or blood type.</para>
      <para id="id3365343"><term src="#quant">Quantitative data</term> are always numbers and are usually the data of choice because there are many methods available for analyzing the data. Quantitative data are the result of <emphasis>counting</emphasis> or <emphasis>measuring</emphasis> attributes of a population. Amount of money, pulse rate, weight, number of people living in your town, and the number of students who take statistics are examples of quantitative data. Quantitative data may be either <term src="#discrrv">discrete</term> or <term src="#continrv">continuous</term>.</para>
      <para id="id9750754">All data that are the result of counting are called <emphasis>quantitative discrete data</emphasis>. These data take on only certain numerical values. If you count the number of phone calls you receive for each day of the week, you might get 0, 1, 2, 3, etc. </para>
      <para id="id5023881">All data that are the result of measuring are <emphasis>quantitative continuous data</emphasis> assuming that we can measure accurately. Measuring angles in radians might result in the numbers <m:math><m:mfrac><m:mi>π</m:mi><m:mn>6</m:mn></m:mfrac> </m:math>, <m:math><m:mfrac><m:mi>π</m:mi><m:mn>2</m:mn></m:mfrac></m:math> , <m:math><m:mfrac><m:mi>3π</m:mi><m:mn>4</m:mn></m:mfrac></m:math> , etc. If you and your friends carry backpacks with books in them to school, the numbers of books in the backpacks are discrete data and the weights of the backpacks are continuous data.</para>
     <example id="onetwo"><name>Data Sample of Quantitative Discrete Data</name><para id="id3406867">The data are the number of books students carry in their backpacks. You sample five students. Two students carry 3 books, one student carries 4 books, one student carries 2 books, and one student carries 1 book. The numbers of books (3, 4, 2, and 1) are the quantitative discrete data.</para>
     </example>
     <example id="onethree"><name>Data Sample of Quantitative Continuous Data</name><para id="id3944554">The data are the weights of the backpacks with the books in it. You sample the same five students. The weights (in pounds) of their backpacks are 6.2, 7, 6.8, 9.1, 4.3. Notice that backpacks carrying three books can have different weights. Weights are quantitative continuous data because weights are measured.</para>
     </example>
     <example id="onefour"><name>Data Sample of Qualitative Data</name><para id="id11979238">The data are the colors of backpacks. Again, you sample the same five students. One student has a red backpack, two students have black backpacks, one student has a green backpack, and one student has a gray backpack. The colors red, black, black, green, and gray are qualitative data. </para>
</example>
      
<note>You may collect data as numbers and report it categorically. For example, the quiz scores for each student are recorded throughout the term. At the end of the term, the quiz scores are reported as A, B, C, D, or F.</note><example id="element-783"><exercise id="element-652"><?solution_in_back?><problem>
		<para id="element-318">Work collaboratively to determine the correct data type (quantitative or qualitative). Indicate whether quantitative data are continuous or discrete. Hint: Data that are discrete often start with the words "the number of."</para><list id="element-861" type="enumerated"><item>The number of pairs of shoes you own.</item>
        <item>The type of car you drive. </item>
        <item>Where you go on vacation.</item>
        <item>The distance it is from your home to the nearest grocery store. </item>
        <item>The number of classes you take per school year.</item>
        <item>The tuition for your classes</item>
        <item>The type of calculator you use. </item>
        <item>Movie ratings.</item>
        <item>Political party preferences.</item>
        <item>Weight of sumo wrestlers.</item>
        <item>Amount of money (in dollars) won playing poker.</item>
        <item>Number of correct answers on a quiz.</item>
        <item>Peoples' attitudes toward the government.</item>
        <item>IQ scores. (This may cause some discussion.)</item>
      </list>
	</problem>
<solution>
<para id="exercise-solutions-1">
Items 1, 5, 11, and 12 are quantitative discrete; items 4, 6, 10, and 14 are quantitative continuous; and items 2, 3, 7, 8, 9, and 13 are qualitative.
</para>
</solution>
</exercise></example>
  </content>

<glossary>

<definition id="continrv">
    <term>Continuous RV</term>
    <meaning>
     A RV with continuous domain. Ex.: height of trees in the forest.
    </meaning>
  </definition>

   <definition id="data">
    <term>Data</term>
    <meaning>
      A set of observations (a set of possible outcomes). Most data can be put into two groups: <emphasis>qualitative</emphasis> (hair color, ethnic groups and many other <emphasis>attributes</emphasis> of population) and <emphasis>quantitative</emphasis> (distance traveled to college, number of children in a family, etc.). In its turn quantitative data can be separated into two subgroups: <emphasis>discrete</emphasis> and <emphasis>continuous</emphasis>. Roughly speaking, data is discrete if it is result of counting (a number of student of the given ethnic group in a class, a number of books on a shelf, etc.), and data is continuous if it is result of measuring (distance traveled, weight of luggage, etc.)
    </meaning>
  </definition>


<definition id="discrrv">
    <term>Discrete RV</term>
    <meaning>
 A RV that can assume only countable set of values. (Ex.’s.: (1). Face nominations of cubic die 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow/><m:mo stretchy="false">=</m:mo><m:mrow><m:mo stretchy="false">{</m:mo><m:mn>1,2,3,4,5,6</m:mn><m:mo stretchy="false">}</m:mo></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ {}= lbrace 1,2,3,4,5,6 rbrace } {}</m:annotation></m:semantics></m:math>, (2). a number of accidents on HW280 at Thanksgiving Holidays).
    </meaning>
  </definition>


<definition id="qual">
    <term>Qualitative Data</term>
    <meaning>

    See <term src="#data">Data</term>.
    </meaning>
  </definition>


<definition id="quant">
    <term>Quantitative</term>
    <meaning>
See <term src="#data">Data</term>.
    </meaning>
  </definition>


</glossary>

</document>
