<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="id6812780">
  <name>Sampling and Data: Practice 1</name>
  <metadata>
  <md:version>1.9</md:version>
  <md:created>2008/03/31 15:50:37 GMT-5</md:created>
  <md:revised>2008/07/08 20:55:34.448 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="billowsky">
      <md:firstname>Barbara</md:firstname>
      
      <md:surname>Illowsky</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:author>
      <md:author id="sdean">
      <md:firstname>Susan</md:firstname>
      
      <md:surname>Dean</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="cnxorg">
      <md:firstname/>
      
      <md:surname>Connexions</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>data</md:keyword>
    <md:keyword>frequency</md:keyword>
    <md:keyword>practice</md:keyword>
    <md:keyword>sampling</md:keyword>
    <md:keyword>statistics</md:keyword>
  </md:keywordlist>

  <md:abstract>This module provides an opportunity for students to practice concepts related to statistical sampling and data.  Given a sample data set, the student will practice constructing frequency tables, differentiating between key terms, and comparing sampling techniques.

Note: This module is currently under revision, and its content is subject to change.  This module is being prepared as part of a statistics textbook that will be available for the Fall 2008 semester.</md:abstract>
</metadata>
  <content>
    

 

        <section id="element-634"><name>Student Learning Outcomes</name>
<list id="id6060503" type="bulleted"><item>The student will practice constructing frequency tables.</item>
          <item>The student will differentiate between key terms.</item>
          <item>The student will compare sampling techniques.</item>
        </list>
 </section>
      <section id="id-675245136435">
        <name>Given</name>
        <para id="id6060529">Studies are often done by pharmaceutical companies to determine the effectiveness of a treatment program. Suppose that a new AIDS antibody drug is currently under study. It is given to patients once the AIDS symptoms have revealed themselves. Of interest is the average length of time in months patients live once starting the treatment. Two researchers each follow a different set of 40 AIDS patients from the start of treatment until their deaths. The following data (in months) are collected.</para>
        <para id="element-388"><list id="set-element-743" type="inline"><name>Researcher 1</name><item>3</item>
<item>4</item>
<item>11</item>
<item>15</item>
<item>16</item>
<item>17</item>
<item>22</item>
<item>44</item>
<item>37</item>
<item>16</item>
<item>14</item>
<item>24</item>
<item>25</item>
<item>15</item>
<item>26</item>
<item>27</item>
<item>33</item>
<item>29</item>
<item>35</item>
<item>44</item>
<item>13</item>
<item>21</item>
<item>22</item>
<item>10</item>
<item>12</item>
<item>8</item>
<item>40</item>
<item>32</item>
<item>26</item>
<item>27</item>
<item>31</item>
<item>34</item>
<item>29</item>
<item>17</item>
<item>8</item>
<item>24</item>
<item>18</item>
<item>47</item>
<item>33</item>
<item>34</item>
</list></para><para id="element-792"><list id="set-element-556" type="inline"><name>Researcher 2</name><item>3</item>
<item>14</item>
<item>11</item>
<item>5</item>
<item>16</item>
<item>17</item>
<item>28</item>
<item>41</item>
<item>31</item>
<item>18</item>
<item>14</item>
<item>14</item>
<item>26</item>
<item>25</item>
<item>21</item>
<item>22</item>
<item>31</item>
<item>2</item>
<item>35</item>
<item>44</item>
<item>23</item>
<item>21</item>
<item>21</item>
<item>16</item>
<item>12</item>
<item>18</item>
<item>41</item>
<item>22</item>
<item>16</item>
<item>25</item>
<item>33</item>
<item>34</item>
<item>29</item>
<item>13</item>
<item>18</item>
<item>24</item>
<item>23</item>
<item>42</item>
<item>33</item>
<item>29</item>
</list></para>
      </section>


      <section id="id-257214317573">
        <name>Organize the Data</name>
        <para id="id6060582">Complete the tables below using the data provided.</para>
        <table id="id6060586">
<?table-summary This table provides a blank template for calculating the results of a study using the data set provided.  For each survival length range provided in the first column, students are to calculate and write down the frequency (second column), relative frequency (third column), and cumulative relative frequency (fourth column).?>
<name>Researcher 1</name>
<tgroup cols="4"><colspec colnum="1" colname="header_c1" colwidth="2*"/>
            <colspec colnum="2" colname="c2" colwidth="1*"/>
            <colspec colnum="3" colname="c3" colwidth="2*"/>
            <colspec colnum="4" colname="c4" colwidth="2*"/>
            <thead>
              <row>
                <entry>Survival Length (in months)</entry>
                <entry>Frequency</entry>
                <entry>Relative Frequency</entry>
                <entry>Cumulative Rel. Frequency</entry>
              </row>

            </thead>
            <tbody>
              <row>
                <entry>0.5 - 6.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>6.5 - 12.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>12.5 - 18.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>18.5 - 24.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>24.5 - 30.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>30.5 - 36.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>36.5 - 42.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>42.5 - 48.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
            </tbody>
          




</tgroup>
</table>
        <table id="id6990964">
<?table-summary A duplicate of the previous table, this table provides a blank template for calculating the results of a study using the data set provided.  For each survival length range provided in the first column, students are to calculate and write down the frequency (second column), relative frequency (third column), and cumulative relative frequency (fourth column).?>
<name>Researcher 2</name>
<tgroup cols="4"><colspec colnum="1" colname="header_c1" colwidth="2*"/>
            <colspec colnum="2" colname="c2" colwidth="1*"/>
            <colspec colnum="3" colname="c3" colwidth="2*"/>
            <colspec colnum="4" colname="c4" colwidth="2*"/>
            <thead>
              <row>
                <entry>Survival Length (in months)</entry>
                <entry>Frequency</entry>
                <entry>Relative Frequency</entry>
                <entry>Cumulative Rel. Frequency</entry>
              </row>

             </thead>
             <tbody>
              <row>
                <entry>0.5 - 6.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>6.5 - 12.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>12.5 - 18.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>18.5 - 24.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>24.5 - 30.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>30.5 - 36.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>36.5 - 42.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
              <row>
                <entry>42.5 - 48.5</entry>
                <entry/>
                <entry/>
                <entry/>
              </row>
            </tbody>
          


</tgroup>
</table>
      </section>



      <section id="id-235615853556"><name>Key Terms</name>

        <para id="id606052900">Define the key terms based upon the above example for Researcher 1.</para>

          <exercise id="exerciseone">
<problem>
<para id="prob_1">Population</para>
</problem>
</exercise>
          <exercise id="exercisetwo">
<problem>
<para id="prob_2">Sample</para>
</problem>
</exercise>
          <exercise id="exercisethree">
<problem>
<para id="prob_3">Parameter</para>
</problem>
</exercise>
          <exercise id="exercisefour">
<problem>
<para id="prob_4">Statistic</para>
</problem>
</exercise>
          <exercise id="exercisefive">
<problem>
<para id="prob_5">Variable</para>
</problem>
</exercise>
          <exercise id="exercisesix">
<problem>
<para id="prob_6">Data</para>
</problem>
</exercise>

      </section>

      <section id="id-448369511773"><name>Discussion Questions</name>

        <para id="id606052901">Discuss the following questions and then answer in complete sentences.</para>
        
          <exercise id="exerciseseven">
<problem>
<para id="prob_7">List two reasons why the data may differ.</para></problem></exercise>
          <exercise id="exerciseeight">
<problem>
<para id="prob_8">Can you tell if one researcher is correct and the other one is incorrect? Why?</para></problem></exercise>
          <exercise id="exercisenine">
<problem>
<para id="prob_9">Would you expect the data to be identical? Why or why not?</para></problem></exercise>
          <exercise id="exerciseten">
<problem>
<para id="prob_10">How could the researchers gather random data?</para></problem></exercise>



          <exercise id="exerciseeleven">
<problem>
<para id="prob_11">Suppose that the first researcher conducted his survey by randomly choosing one state in the nation and then randomly picking 40 patients from that state. What sampling method would that researcher have used?</para></problem></exercise>
          <exercise id="exercisetwelve">
<problem>
<para id="prob_12">Suppose that the second researcher conducted his survey by choosing 40 patients he knew. What sampling method would that researcher have used? What concerns would you have about this data set, based upon the data collection method?</para></problem></exercise>
        
      </section>
  </content>
</document>
