<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/technology/cnxml/schema/dtd/0.5/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" xmlns:m="http://www.w3.org/1998/Math/MathML" id="new">
  <name>Confidence Intervals: Introduction</name>
  <metadata>
  <md:version>1.5</md:version>
  <md:created>2008/06/06 15:20:55 GMT-5</md:created>
  <md:revised>2008/10/01 03:54:28.088 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="billowsky">
      <md:firstname>Barbara</md:firstname>
      
      <md:surname>Illowsky</md:surname>
      <md:email>illowskybarbara@deanza.edu</md:email>
    </md:author>
      <md:author id="sdean">
      <md:firstname>Susan</md:firstname>
      
      <md:surname>Dean</md:surname>
      <md:email>deansusan@deanza.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="cnxorg">
      <md:firstname/>
      
      <md:surname>Connexions</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>elementary</md:keyword>
    <md:keyword>statistics</md:keyword>
  </md:keywordlist>

  <md:abstract/>
</metadata>
  <content>
<section id="element-819"><name>Student Learning Objectives</name>
<para id="element-281">
By the end of this chapter, the student should be able to:
</para>

<list id="list12315">
<item>Calculate and interpret confidence intervals for one population
average and one population proportion.</item>
<item>Interpret the student-t probability distribution as the sample size
changes.</item>
<item>Discriminate between problems applying the normal and the
student-t distributions.</item></list></section><section><name>Introduction</name>
    <para id="delete_me">Suppose you are trying to determine the average rent of a two-bedroom apartment
in your town. You might look in the classified section of the newspaper, write
down several rents listed, and average them together. You would have obtained a
point estimate of the true mean. If you are trying to determine the percent of times
you make a basket when shooting a basketball, you might count the number of
shots you make and divide that by the number of shots you attempted. In this
case, you would have obtained a point estimate for the true proportion.</para><para id="element-550">We use sample data to make generalizations about an unknown population. This
part of statistics is called <term>"inferential statistics."</term> The sample data help us to
make estimates of population parameters. We realize that the point estimate is
most likely not the exact value of the <emphasis>population parameter</emphasis>, but close to it. After
calculating point estimates, we construct confidence intervals in which we believe
the parameter lies.</para><para id="element-667">In this chapter, you will learn to construct and interpret confidence intervals. You
will also learn a new distribution, the Student-t, and how it is used with these
intervals.</para><para id="element-560">If you worked in the marketing department of an entertainment company, you
might be interested in the average number of compact discs (CD's) a consumer
buys per month. If so, you could conduct a survey and calculate the sample
average, <m:math><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply></m:math>, and the sample standard deviation, <m:math><m:mi>s</m:mi></m:math>. You would use <m:math><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply></m:math>

to estimate
the population mean and <m:math><m:mi>s</m:mi></m:math> to estimate the population standard deviation. The
sample mean, <m:math><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply></m:math>, is the <term>point estimate</term> for the population mean, <m:math><m:mi>μ</m:mi></m:math>. The sample
standard deviation, <m:math><m:mi>s</m:mi></m:math>, is the point estimate for the population standard deviation,
<m:math><m:mi>σ</m:mi></m:math>.</para><para id="element-510">A <term src="#coninter">confidence interval</term> is another type of estimate but, instead of being just one
number, it is an interval of numbers. Suppose for the CD example we do not
know the population mean <m:math><m:mi>μ</m:mi></m:math> but we do know that the population standard
deviation is <m:math><m:mi>σ</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:math> and our sample size is 100. Then by the Central Limit
Theorem, the standard deviation for the sample mean is</para><para id="element-624"><m:math>
<m:mfrac>
<m:mi>σ</m:mi>
<m:mrow>
<m:msqrt>
<m:mi>n</m:mi>
</m:msqrt>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mfrac>
<m:mn>1</m:mn>
<m:mrow>
<m:msqrt>
<m:mn>100</m:mn>
</m:msqrt>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mn>0.1</m:mn>
</m:math>.</para><para id="element-730">The <emphasis>Empirical Rule</emphasis>, which applies to bell-shaped distributions, says that in
approximately 95% of the samples, the sample mean, <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math>, will be within two standard
deviations of the population mean <m:math><m:mi>μ</m:mi></m:math>. For our CD example, two standard deviations
is <m:math><m:mi>(2)(0.1) = 0.2</m:mi></m:math>. The sample mean <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math>

is within 0.2 units of <m:math><m:mi>μ</m:mi></m:math>.</para><para id="element-457">Because <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math>

is within 0.2 units of <m:math><m:mi>μ</m:mi></m:math>, which is unknown, then <m:math><m:mi>μ</m:mi></m:math> is within 0.2 units
of <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math>

in 95% of the samples. The population mean <m:math><m:mi>μ</m:mi></m:math> is contained in an interval
whose lower number is calculated by taking the sample mean and subtracting
two standard deviations (<m:math><m:mo>(</m:mo><m:mn>2</m:mn><m:mo>)</m:mo><m:mo>(</m:mo><m:mn>0.1</m:mn><m:mo>)</m:mo></m:math>) and whose upper number is calculated by
taking the sample mean and adding two standard deviations. In other words, <m:math><m:mi>μ</m:mi></m:math>
is between <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>-</m:mo>
<m:mn>0.2</m:mn>
</m:math>

and 
<m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>+</m:mo>
<m:mn>0.2</m:mn>
</m:math>
in 95% of all the samples.</para><para id="element-433">For the CD example, suppose that a sample produced a sample mean <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>=</m:mo>
<m:mn>2</m:mn>
</m:math>. Then the
unknown population mean <m:math><m:mi>μ</m:mi></m:math> is between</para><para id="element-501"><m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>-</m:mo>
<m:mn>0.2</m:mn>
<m:mo>=</m:mo>
<m:mn>2</m:mn>
<m:mo>-</m:mo>
<m:mn>0.2</m:mn>
<m:mo>=</m:mo>
<m:mn>1.8</m:mn>
</m:math>

and



<m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>+</m:mo>
<m:mn>0.2</m:mn>
<m:mo>=</m:mo>
<m:mn>2</m:mn>
<m:mo>+</m:mo>
<m:mn>0.2</m:mn>
<m:mo>=</m:mo>
<m:mn>2.2</m:mn>
</m:math>

</para><para id="element-567">We say that we are <emphasis>95% confident</emphasis> that the unknown population mean number of CDs
is between 1.8 and 2.2. <emphasis>The 95% confidence interval is (1.8, 2.2).</emphasis></para><para id="element-484">The 95% confidence interval implies two possibilities. Either the interval (1.8, 2.2)
contains the true mean <m:math><m:mi>μ</m:mi></m:math> or our sample produced an <m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
</m:math>

that is not within 0.2 units of
the true mean <m:math><m:mi>μ</m:mi></m:math>. The second possibility happens for only 5% of all the samples
(100% - 95%).</para><para id="element-39">Remember that a confidence interval is created for an unknown population parameter
like the population mean, <m:math><m:mi>μ</m:mi></m:math>. A confidence interval has the form</para><para id="element-75"><emphasis>(point estimate - margin of error, point estimate + margin of error)</emphasis></para><para id="element-485">The margin of error depends on the confidence level or percentage of confidence.</para></section><section id="element-637"><name>Optional Collaborative Classroom Activity</name>
<para id="element-906">Have your instructor record the number of meals each student in your class eats
out in a week. Assume that the standard deviation is known to be 3 meals.
Construct an approximate 95% confidence interval for the true average number of
meals students eat out each week.
</para><list id="element-997" type="enumerated"><item>Calculate the sample mean.</item>

<item>
<m:math>
<m:mi>σ</m:mi>
<m:mo>=</m:mo>
<m:mn>3</m:mn>
</m:math> and <m:math>
<m:mi>n</m:mi>
<m:mo>=</m:mo>
</m:math> the number of students surveyed.</item>

<item>Construct the interval 
<m:math>
<m:mo>(</m:mo>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>-</m:mo>
<m:mn>2</m:mn>
<m:mo>⋅</m:mo>
<m:mfrac>
<m:mi>σ</m:mi>
<m:mrow>
<m:msqrt>
<m:mi>n</m:mi>
</m:msqrt>
</m:mrow>
</m:mfrac>
<m:mo>,</m:mo>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>+</m:mo>
<m:mn>2</m:mn>
<m:mo>⋅</m:mo>
<m:mfrac>
<m:mi>σ</m:mi>
<m:mrow>
<m:msqrt>
<m:mi>n</m:mi>
</m:msqrt>
</m:mrow>
</m:mfrac>
<m:mo>)</m:mo>
</m:math>
</item>
</list><para id="element-927">We say we are approximately 95% confident that the true average number of meals that
students eat out in a week is between __________ and ___________.</para></section>   
  </content>
  <glossary>
<definition id="coninter">
    <term>Confidential Interval</term>
    <meaning>
  An interval estimate for unknown population parameter. This depends on: 
<list type="bulleted" id="confint1">
<item>The desired confidence level.</item> <item>What is known for the distribution information (for ex., known variance).</item><item>Gathering from the sampling information.</item></list>
    </meaning>
  </definition>

</glossary>
</document>
