<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/technology/cnxml/schema/dtd/0.5/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" xmlns:m="http://www.w3.org/1998/Math/MathML" id="new">
  <name>Confidence Intervals: Confidence Interval for a Population Proportion</name>
  <metadata>
  <md:version>1.5</md:version>
  <md:created>2008/06/06 16:38:08 GMT-5</md:created>
  <md:revised>2008/07/18 15:20:39.925 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="billowsky">
      <md:firstname>Barbara</md:firstname>
      
      <md:surname>Illowsky</md:surname>
      <md:email>illowskybarbara@deanza.edu</md:email>
    </md:author>
      <md:author id="sdean">
      <md:firstname>Susan</md:firstname>
      
      <md:surname>Dean</md:surname>
      <md:email>deansusan@deanza.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="cnxorg">
      <md:firstname/>
      
      <md:surname>Connexions</md:surname>
      <md:email>cnx@cnx.org</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>elementary</md:keyword>
    <md:keyword>statistics</md:keyword>
  </md:keywordlist>

  <md:abstract/>
</metadata>
  <content>
    <para id="delete_me">During an election year, we see articles in the newspaper that state <term src="#coninter">confidence intervals</term>
in terms of proportions or percentages. For example, a poll for a particular
candidate running for president might show that the candidate has 40% of the vote
within 3 percentage points. Often, election polls are calculated with 95% confidence.
So, the pollsters would be 95% confident that the true proportion of voters who
favored the candidate would be between 0.37 and 0.43 
<m:math>
<m:mo>(</m:mo>
<m:mn>0.40</m:mn>
<m:mo>-</m:mo>
<m:mn>0.03</m:mn>
<m:mo>,</m:mo>
<m:mn>0.40</m:mn>
<m:mo>+</m:mo>
<m:mn>0.03</m:mn>
<m:mo>)</m:mo>
</m:math>.</para><para id="element-292">Investors in the stock market are interested in the true proportion of stocks that go up
and down each week. Businesses that sell personal computers are interested in the
proportion of households in the United States that own personal computers.
Confidence intervals can be calculated for the true proportion of stocks that go up or
down each week and for the true proportion of households in the United States that
own personal computers.</para><para id="element-918">The procedure to find the confidence interval, the sample size, the <term src="ebpbound">error bound,</term> and
the <term src="#conflevel">confidence level</term> for a proportion is similar to that for the population mean. The
formulas are different.</para><para id="element-574"><emphasis>How do you know you are dealing with a proportion problem?</emphasis> First, the
underlying <term src="#bidist">distribution is binomial</term>. (There is no mention of a mean or average.) If
<m:math><m:mi>X</m:mi></m:math> is a binomial random variable, then <m:math><m:mi>X</m:mi><m:mo>~</m:mo><m:mi>B</m:mi><m:mo>(</m:mo><m:mi>n</m:mi><m:mo>,</m:mo><m:mi>p</m:mi><m:mo>) </m:mo></m:math> where <m:math><m:mi>n</m:mi></m:math> = the number of trials
and
<m:math><m:mi>p</m:mi></m:math> = the probability of a success. To form a proportion, take <m:math><m:mi>X</m:mi></m:math>, the random
variable for the number of successes and divide it by <m:math><m:mi>n</m:mi></m:math>, the number of trials (or the
sample size). The random variable <m:math><m:mi>P</m:mi><m:mo>'</m:mo></m:math> (read "P prime") is that proportion,
</para><para id="element-771"><m:math><m:mi>P</m:mi><m:mo>'</m:mo><m:mo>=</m:mo><m:mfrac><m:mrow><m:mi>X</m:mi></m:mrow><m:mrow><m:mi>n</m:mi></m:mrow></m:mfrac><m:mspace width="20pt"/></m:math> </para><para id="element-596">(Sometimes the random variable is <m:math><m:mover><m:mi>P</m:mi><m:mo>̂</m:mo></m:mover></m:math>, read "P hat".)</para><para id="element-959">When <m:math><m:mi>n</m:mi></m:math> is large, we can use the <term src="#normdist">normal distribution</term> to approximate the binomial.</para><para id="element-813"><m:math>
<m:mi>X</m:mi></m:math> ~
<m:math>
<m:mi>N</m:mi>
<m:mo>(</m:mo>
<m:mi>n</m:mi>
<m:mo>⋅</m:mo>
<m:mi>p</m:mi>
<m:mo>,</m:mo>
<m:msqrt>
<m:mi>n</m:mi>
<m:mo>⋅</m:mo>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:msqrt>
<m:mo>)</m:mo>
</m:math></para><para id="element-76">If we divide all values of the random variable by <m:math><m:mi>n</m:mi></m:math>, the mean by <m:math><m:mi>n</m:mi></m:math>, and the standard
deviation by <m:math><m:mi>n</m:mi></m:math>, we get a normal distribution of proportions with <m:math><m:mi>P</m:mi><m:mo>'</m:mo></m:math>, called the
estimated proportion, as the random variable. (Recall that a proportion = the
number of successes divided by <m:math><m:mi>n</m:mi></m:math>.)</para><para id="element-461"><m:math>
<m:mfrac>
<m:mrow>
<m:mi>X</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mi>P</m:mi>
<m:mo>'</m:mo></m:math> ~
<m:math>
<m:mi>N</m:mi>
<m:mo>(</m:mo>
<m:mfrac>
<m:mrow>
<m:mi>n</m:mi>
<m:mo>⋅</m:mo>
<m:mi>p</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
<m:mo>,</m:mo>
<m:mfrac>
<m:mrow>
<m:msqrt>
<m:mi>n</m:mi>
<m:mo>⋅</m:mo>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:msqrt>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
<m:mo>)</m:mo>
</m:math>
</para><para id="element-894">By algebra,
<m:math>
<m:mfrac>
<m:mrow>
<m:msqrt>
<m:mi>n</m:mi>
<m:mo>⋅</m:mo>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:msqrt>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
</m:math></para><para id="element-683"><emphasis><m:math><m:mi>P</m:mi><m:mo>'</m:mo></m:math> follows a normal distribution for proportions</emphasis>:
<m:math>
<m:mi>P</m:mi>
<m:mo>'</m:mo></m:math> ~
<m:math>
<m:mi>N</m:mi>
<m:mo>(</m:mo>
<m:mi>p</m:mi>
<m:mo>,</m:mo>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
<m:mo>)</m:mo>
</m:math></para><para id="element-716">The confidence interval has the form <emphasis><m:math><m:mo>(</m:mo><m:mi>p</m:mi><m:mo>'</m:mo><m:mo>-</m:mo><m:mtext>EBP</m:mtext><m:mo>,</m:mo><m:mi>p</m:mi><m:mo>'</m:mo><m:mo>+</m:mo><m:mtext>EBP</m:mtext><m:mo>)</m:mo></m:math></emphasis>.</para><para id="element-35"><m:math>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:mi>x</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:math>
</para><para id="element-142"><m:math>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
</m:math> = the <emphasis>estimated proportion</emphasis> of successes (<m:math><m:mi>p</m:mi><m:mo>'</m:mo></m:math> is a <emphasis>point estimate</emphasis> for <m:math><m:mi>p</m:mi></m:math>, the true proportion) </para><para id="element-818"><m:math><m:mi>x</m:mi></m:math> = the <emphasis>number</emphasis> of successes. </para><para id="element-434"><m:math><m:mi>n</m:mi></m:math> = the size of the sample</para><para id="element-498"><emphasis>The error bound for a proportion is</emphasis>
</para><para id="element-249"><m:math>
<m:mtext>EBP</m:mtext>
<m:mo>=</m:mo>
<m:msub>
<m:mi>z</m:mi>
<m:mfrac>
<m:mrow>
<m:mi>α</m:mi>
</m:mrow>
<m:mrow>
<m:mn>2</m:mn>
</m:mrow>
</m:mfrac>
</m:msub>
<m:mo>⋅</m:mo>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
<m:mo>'</m:mo>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
<m:mspace width="20pt"/>
<m:mi>q</m:mi>
<m:mo>'</m:mo>
<m:mo>=</m:mo>
<m:mn>1</m:mn>
<m:mo>-</m:mo>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
</m:math>
</para><para id="element-354">This formula is actually very similar to the error bound formula for a
mean. The difference is the standard deviation. For a mean where
the population standard deviation is known, the standard deviation is
<m:math>
<m:mfrac>
<m:mrow>
<m:mi>σ</m:mi>
</m:mrow>
<m:mrow>
<m:msqrt>
<m:mi>n</m:mi>
</m:msqrt>
</m:mrow>
</m:mfrac>
</m:math>.</para><para id="element-886">For a proportion, the standard deviation is
<m:math>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
</m:math>.</para><para id="element-556">However, in the error bound formula, the standard deviation is
<m:math>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
<m:mo>'</m:mo>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
</m:math>.</para><para id="element-685">In the error bound formula, <emphasis><m:math><m:mi>p</m:mi><m:mo>'</m:mo></m:math> and <m:math><m:mi>q</m:mi><m:mo>'</m:mo></m:math> are estimates of <m:math><m:mi>p</m:mi></m:math> and <m:math><m:mi>q</m:mi></m:math></emphasis>. The estimated
proportions <m:math><m:mi>p</m:mi><m:mo>'</m:mo></m:math> and <m:math><m:mi>q</m:mi><m:mo>'</m:mo></m:math> are used because <m:math><m:mi>p</m:mi></m:math> and <m:math><m:mi>q</m:mi></m:math> are not known. <m:math><m:mi>p</m:mi><m:mo>'</m:mo></m:math> and <m:math><m:mi>q</m:mi><m:mo>'</m:mo></m:math> are
calculated from the data. <m:math><m:mi>p</m:mi><m:mo>'</m:mo></m:math> is the estimated proportion of successes. <m:math><m:mi>q</m:mi><m:mo>'</m:mo></m:math> is the
estimated proportion of failures.</para><para id="element-291">When a study gives a margin of error of "+ or - 3 percentage points", this is determined
before the survey is done. Since <m:math><m:mi>p</m:mi><m:mo>'</m:mo></m:math> and <m:math><m:mi>q</m:mi><m:mo>'</m:mo></m:math> are unknown,<emphasis> the most conservative
choice is <m:math><m:mi>p</m:mi><m:mo>'</m:mo><m:mo>=</m:mo><m:mn>0.5</m:mn></m:math> and
<m:math><m:mi>q</m:mi><m:mo>'</m:mo><m:mo>=</m:mo><m:mn>0.5</m:mn></m:math></emphasis>, because these values give the largest standard
deviation, error bound, and confidence interval.</para><note>For the normal distribution of proportions, the z-score formula is as follows.</note><para id="element-272">If
<m:math>
<m:mi>P</m:mi>
<m:mo>'</m:mo></m:math> ~
<m:math>
<m:mi>N</m:mi>
<m:mo>(</m:mo>
<m:mi>p</m:mi>
<m:mo>,</m:mo>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
<m:mo>)</m:mo>
</m:math> then the z-score formula is
<m:math>
<m:mi>z</m:mi>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>-</m:mo>
<m:mi>p</m:mi>
</m:mrow>
<m:mrow>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
</m:mrow>
</m:mfrac>
</m:math></para><example id="element-32"><exercise id="element-814"><problem>
  <para id="element-687">Suppose that a sample of 500 households in Phoenix was taken last May
to determine whether the oldest child had given his/her mother a Mother's Day card. Of
the 500 households, 421 responded yes. Compute a 95% confidence interval for the true
proportion of all Phoenix households whose oldest child gave his/her mother a Mother's
Day card.</para><list id="element-757" type="bulleted"><name>Note:</name><item>The first solution is step-by-step.</item>
<item>The second solution uses the TI-83+ and TI-84 calculators.</item>
</list>
</problem>

<solution>
  <para id="element-386">Let <m:math><m:mi>X</m:mi></m:math> = the number of oldest children who gave their mothers Mother's Day card last
May. <m:math><m:mi>X</m:mi></m:math> is binomial. <m:math><m:mi>X</m:mi></m:math> ~ <m:math><m:mi>B</m:mi><m:mo>(</m:mo><m:mn>500</m:mn><m:mo>,</m:mo> <m:mfrac><m:mrow><m:mn>421</m:mn></m:mrow><m:mrow><m:mn>500</m:mn></m:mrow></m:mfrac><m:mo>)</m:mo></m:math>.</para><para id="element-985">To calculate the confidence interval, you must find <m:math><m:mi>p</m:mi><m:mo>'</m:mo></m:math>, <m:math><m:mi>q</m:mi><m:mo>'</m:mo></m:math>, and <m:math><m:mtext>EBP</m:mtext></m:math>.</para><para id="element-227"><m:math>
<m:mi>n</m:mi>
<m:mo>=</m:mo>
<m:mn>500</m:mn>
<m:mspace width="20pt"/>
<m:mi>x</m:mi>
</m:math>
= the number of successes
<m:math>
<m:mo>=</m:mo>
<m:mn>421</m:mn>
</m:math>
</para><para id="element-513"><m:math>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:mi>x</m:mi>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mfrac>
<m:mrow>
<m:mn>421</m:mn>
</m:mrow>
<m:mrow>
<m:mn>500</m:mn>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mn>0.842</m:mn>
</m:math></para><para id="element-339"><m:math>
<m:mi>q</m:mi>
<m:mo>'</m:mo>
<m:mo>=</m:mo>
<m:mn>1</m:mn>
<m:mo>-</m:mo>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>=</m:mo>
<m:mn>1</m:mn>
<m:mo>-</m:mo>
<m:mn>0.842</m:mn>
<m:mo>=</m:mo>
<m:mn>0.158</m:mn>
</m:math></para><para id="element-239">Since
<m:math>
<m:mtext>CL</m:mtext>
<m:mo>=</m:mo>
<m:mn>0.95</m:mn>
</m:math>, then
<m:math>
<m:mi>α</m:mi>
<m:mo>=</m:mo>
<m:mn>1</m:mn>
<m:mo>-</m:mo>
<m:mtext>CL</m:mtext>
<m:mo>=</m:mo>
<m:mn>1</m:mn>
<m:mo>-</m:mo>
<m:mn>0.95</m:mn>
<m:mo>=</m:mo>
<m:mn>0.05</m:mn>
<m:mspace width="20pt"/>
<m:mfrac>
<m:mrow>
<m:mi>α</m:mi>
</m:mrow>
<m:mrow>
<m:mn>2</m:mn>
</m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mn>0.025</m:mn>
</m:math>.</para><para id="element-838">Then
<m:math>
<m:msub>
<m:mi>z</m:mi>
<m:mfrac>
<m:mrow>
<m:mi>α</m:mi>
</m:mrow>
<m:mrow>
<m:mn>2</m:mn>
</m:mrow>
</m:mfrac>
</m:msub>
<m:mo>=</m:mo>
<m:msub>
<m:mi>z</m:mi>
<m:mn>.025</m:mn>
</m:msub>
<m:mo>=</m:mo>
<m:mn>1.96</m:mn>
</m:math> using a calculator, computer, or standard normal table.</para><para id="element-88">Remember that the area to the right = 0.025 and therefore, area to the
left is 0.975.</para><para id="element-42">The z-score that corresponds to 0.975 is 1.96.</para><para id="element-255"><m:math>
<m:mtext>EBP</m:mtext>
<m:mo>=</m:mo>
<m:msub>
<m:mi>z</m:mi>
<m:mfrac>
<m:mrow>
<m:mi>α</m:mi>
</m:mrow>
<m:mrow>
<m:mn>2</m:mn>
</m:mrow>
</m:mfrac>
</m:msub>
<m:mo>⋅</m:mo>
<m:msqrt>
<m:mfrac>
<m:mrow>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>⋅</m:mo>
<m:mi>q</m:mi>
<m:mo>'</m:mo>
</m:mrow>
<m:mrow>
<m:mi>n</m:mi>
</m:mrow>
</m:mfrac>
</m:msqrt>
<m:mo>=</m:mo>
<m:mn>1.96</m:mn>
<m:mo>⋅</m:mo>
<m:msqrt>
<m:mo>[</m:mo>
<m:mfrac>
<m:mrow>
<m:mo>(</m:mo>
<m:mn>.842</m:mn>
<m:mo>)</m:mo>
<m:mo>⋅</m:mo>
<m:mo>(</m:mo>
<m:mn>.158</m:mn>
<m:mo>)</m:mo>
</m:mrow>
<m:mrow>
<m:mn>500</m:mn>
</m:mrow>
</m:mfrac>
<m:mo>]</m:mo>
</m:msqrt>
<m:mo>=</m:mo>
<m:mn>0.032</m:mn>
</m:math></para><para id="element-237"><m:math>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>-</m:mo>
<m:mtext>EBP</m:mtext>
<m:mo>=</m:mo>
<m:mn>0.842</m:mn>
<m:mo>-</m:mo>
<m:mn>0.032</m:mn>
<m:mo>=</m:mo>
<m:mn>0.81</m:mn>
</m:math></para><para id="element-744"><m:math>
<m:mi>p</m:mi>
<m:mo>'</m:mo>
<m:mo>+</m:mo>
<m:mtext>EBP</m:mtext>
<m:mo>=</m:mo>
<m:mn>0.842</m:mn>
<m:mo>+</m:mo>
<m:mn>0.032</m:mn>
<m:mo>=</m:mo>
<m:mn>0.874</m:mn>
</m:math></para><para id="element-144">The confidence interval for the true binomial population proportion is <m:math><m:mo>(</m:mo><m:mi>p</m:mi><m:mo>'</m:mo><m:mo>-</m:mo><m:mtext>EBP</m:mtext><m:mo>,</m:mo><m:mi>p</m:mi><m:mo>'</m:mo><m:mo>+</m:mo><m:mtext>EBP</m:mtext><m:mo>)</m:mo> <m:mo>=</m:mo></m:math><emphasis><m:math><m:mo>(</m:mo><m:mn>0.810</m:mn><m:mo>,</m:mo><m:mn>0.874</m:mn><m:mo>)</m:mo></m:math></emphasis>.</para><para id="element-822">We are 95% confident that between 81% and 87.4% of the
oldest children in households in Phoenix gave their mothers a
Mother's Day card last May.</para><para id="element-415">We can also say that 95% of the confidence intervals constructed in this
way contain the true proportion of oldest children in Phoenix who gave
their mothers a Mother's Day card last May.</para>
</solution>

<solution>
  <para id="element-387">TI-83+ and TI-84: Press <code>STAT</code> and arrow over to <code>TESTS</code>. Arrow down
to <code>A:PropZint</code>. Press <code>ENTER</code>. Enter 421 for <m:math><m:mi>x</m:mi></m:math>, 500 for <m:math><m:mi>n</m:mi></m:math>, and .95 for
<code>C-Level</code>. Arrow down to <code>Calculate</code> and press <code>ENTER</code>. The confidence
interval is (0.81003, 0.87397).</para>
</solution>
</exercise>
</example><example id="element-735"><exercise id="element-2"><problem>
  <para id="element-182">For a class project, a political science student at a large university
wants to determine the percent of students that are registered voters. He surveys 500
students and finds that 300 are registered voters. Compute a 90% confidence interval
for the true percent of students that are registered voters and interpret the confidence
interval.</para>
</problem>

<solution>
  <para id="element-829"><m:math><m:mi>x</m:mi><m:mo>=</m:mo><m:mn>300</m:mn></m:math> and <m:math><m:mi>n</m:mi><m:mo>=</m:mo><m:mn>500</m:mn></m:math>. Using a TI-83+ or 84 calculator, the 90% confidence
interval for the true percent of students that are registered voters is (0.564, 0.636).</para><list id="element-523" type="bulleted"><name>Interpretation:</name><item>We are 90% confident that the true percent of students that are registered voters
is between 56.4% and 63.6%.</item>
<item>Ninety percent (90 %) of all confidence intervals constructed in this way contain
the true percent of students that are registered voters.</item>
</list>
</solution>
</exercise>
</example>   
  </content>

<glossary>
 <definition id="bidist">
    <term>Binomial Distribution</term>
    <meaning>
      A discrete random variable (RV) which arises from the Bernoulli trials with the next additional requirements. There are fixed number, n, of independent trials. “Independent” means that the result to any trial (for example, trial 1) in no way affects the answer to all the following trials, and all trials are conducted under the same conditions. Under these circumstances the binomial RV 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>X</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X} {}</m:annotation></m:semantics></m:math> is defined as the number of success in n trials. The notation is: 

<emphasis><m:math><m:mi>X</m:mi></m:math>~<m:math> <m:mi>B</m:mi>
  <m:mo>(</m:mo>
  <m:mi>n</m:mi>
  <m:mo>,</m:mo>
  <m:mi>p</m:mi>
  <m:mo>)</m:mo></m:math></emphasis>; the domain is
 the mean is <m:math><m:apply>
  <m:eq/>
  <m:ci>μ</m:ci>
  <m:ci>np</m:ci>
</m:apply>
</m:math>, and the variance is <m:math>

   <m:msup>
    <m:mi>σ</m:mi>
    <m:mn>2</m:mn>
  </m:msup>
  <m:mo>=</m:mo>
  <m:mi>df</m:mi></m:math>. The probability to have exactly <m:math><m:mi>x</m:mi></m:math> successes in <m:math><m:mi>n</m:mi></m:math> trials is <m:math>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>X</m:mi>
  <m:mo>=</m:mo>
  <m:mi>x</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mfenced>
    <m:mfrac linethickness="0">
      <m:mi>n</m:mi>
      <m:mi>x</m:mi>
    </m:mfrac>
  </m:mfenced>
  <m:msup>
    <m:mi>p</m:mi>
    <m:mi>x</m:mi>
  </m:msup>
  <m:msup>
    <m:mi>q</m:mi>
    <m:mrow>
      <m:mi>n</m:mi>
      <m:mo>−</m:mo>
      <m:mi>x</m:mi>
    </m:mrow>
  </m:msup>
</m:math>.
    </meaning>
  </definition>

  <definition id="coninter">
    <term>Confidential Interval</term>
    <meaning>
  An interval estimate for unknown population parameter. This depends on: 
<list type="bulleted" id="confint1">
<item>The desired confidence level.</item> <item>What is known for the distribution information (for ex., known variance).</item><item>Gathering from the sampling information.</item></list>
    </meaning>
  </definition>

  <definition id="conflevel">
    <term>Confidence Level</term>
    <meaning>
The percent expression for the probability that the confidence interval contains the true population parameter. That is, for ex., if CL=90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter.
    </meaning>
  </definition>


  <definition id="ebmbound">
    <term>Error Bound for a Population Mean (EBM)</term>
    <meaning>
      The margin of error. Depends on the confidence level, sample size, and known or estimated population standard deviation.
    </meaning>
  </definition>

<definition id="normdist">
    <term>Normal Distribution</term>
    <meaning>
   A continuous random variable (RV) with 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow><m:mstyle fontstyle="italic"><m:mrow><m:mtext>pdf</m:mtext></m:mrow></m:mstyle><m:mo stretchy="false">=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mi>σ</m:mi><m:msqrt><m:mn>2π</m:mn></m:msqrt></m:mrow></m:mfrac></m:mrow><m:msup><m:mi>e</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mrow><m:mrow><m:mo stretchy="false">−</m:mo><m:mo stretchy="false">(</m:mo></m:mrow><m:mrow><m:mi>x</m:mi><m:mo stretchy="false">−</m:mo><m:mi>μ</m:mi></m:mrow><m:mrow><m:msup><m:mo stretchy="false">)</m:mo><m:mstyle fontsize="6pt"><m:mrow><m:mn>2</m:mn></m:mrow></m:mstyle></m:msup><m:mo stretchy="false">/</m:mo><m:msup><m:mn>2σ</m:mn><m:mstyle fontsize="6pt"><m:mrow><m:mn>2</m:mn></m:mrow></m:mstyle></m:msup></m:mrow></m:mrow></m:mrow></m:mstyle></m:msup></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ ital "pdf"= {  {1}  over  {σ sqrt {2π} } } e rSup { size 8{ -  \( x - μ \)  rSup { size 6{2} } /2σ rSup { size 6{2} } } } } {}</m:annotation></m:semantics></m:math>, where <m:math><m:mi>μ</m:mi></m:math>  is the mean of the distribution and <m:math><m:mi>σ</m:mi></m:math>  is its standard deviation. Notation: <m:math><m:mi>X</m:mi></m:math>  ~  <m:math> <m:mi>N</m:mi>
  <m:mfenced>
    <m:mi>μ</m:mi>
    <m:msup>
      <m:mi>σ</m:mi>
      <m:mn>2</m:mn>
    </m:msup>
  </m:mfenced></m:math>. If <m:math><m:mi>μ</m:mi><m:mo>=</m:mo><m:mn>0</m:mn></m:math> and <m:math><m:mi>σ</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:math>, the RV is called <emphasis>standard normal distribution</emphasis>, or <emphasis>z-score</emphasis>.
    </meaning>
  </definition>

 
</glossary>  
</document>
