<?xml version="1.0" encoding="utf-8"?>
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" xmlns:q="http://cnx.rice.edu/qml/1.0" id="id7033113" module-id="" cnxml-version="0.6">
  <title>Collaborative Statistics: Glossary</title>
<metadata xmlns:md="http://cnx.rice.edu/mdml/0.4">
  <!-- WARNING! The 'metadata' section is read only. Do not edit below.
       Changes to the metadata section in the source will not be saved. -->
  <md:content-id>m16129</md:content-id>
  <md:title>Collaborative Statistics: Glossary</md:title>
  <md:version>1.11</md:version>
  <md:created>2008/04/25 09:21:28 GMT-5</md:created>
  <md:revised>2009/02/24 12:53:59.167 US/Central</md:revised>
  <md:authorlist>
    <md:author id="sdean">
        <md:firstname>Susan</md:firstname>
        <md:surname>Dean</md:surname>
        <md:fullname>Susan Dean</md:fullname>
        <md:email>deansusan@deanza.edu</md:email>
    </md:author>
    <md:author id="billowsky">
        <md:firstname>Barbara</md:firstname>
        <md:surname>Illowsky</md:surname>
        <md:fullname>Barbara Illowsky, Ph.D.</md:fullname>
        <md:email>illowskybarbara@deanza.edu</md:email>
    </md:author>
  </md:authorlist>
  <md:maintainerlist>
    <md:maintainer id="sdean">
        <md:firstname>Susan</md:firstname>
        <md:surname>Dean</md:surname>
        <md:fullname>Susan Dean</md:fullname>
        <md:email>deansusan@deanza.edu</md:email>
    </md:maintainer>
    <md:maintainer id="billowsky">
        <md:firstname>Barbara</md:firstname>
        <md:surname>Illowsky</md:surname>
        <md:fullname>Barbara Illowsky, Ph.D.</md:fullname>
        <md:email>illowskybarbara@deanza.edu</md:email>
    </md:maintainer>
    <md:maintainer id="cnxorg">
        <md:firstname/>
        <md:surname>Connexions</md:surname>
        <md:fullname>Connexions</md:fullname>
        <md:email>cnx@cnx.org</md:email>
    </md:maintainer>
  </md:maintainerlist>
  <md:license href="http://creativecommons.org/licenses/by/2.0/"/>
  <md:licensorlist>
    <md:licensor id="MaxfieldFoundation">
        <md:firstname/>
        <md:surname>Maxfield Foundation</md:surname>
        <md:fullname>Maxfield Foundation</md:fullname>
        <md:email>cnx@cnx.org</md:email>
    </md:licensor>
  </md:licensorlist>
  <md:keywordlist>
    <md:keyword>definitions</md:keyword>
    <md:keyword>glossary</md:keyword>
    <md:keyword>statistics</md:keyword>
    <md:keyword>terms</md:keyword>
  </md:keywordlist>
  <md:subjectlist>
    <md:subject>Mathematics and Statistics</md:subject>
  </md:subjectlist>
  <md:abstract>This module contains a number of glossary terms related to elementary statistics.  This module represents the combined glossary information for the Collaborative Statistics textbook/module (col10522).</md:abstract>
  <md:language>en</md:language>
  <!-- WARNING! The 'metadata' section is read only. Do not edit above.
       Changes to the metadata section in the source will not be saved. -->
</metadata>

<content>
    <para id="nodata"/>
    </content>


<glossary>
  <definition id="additionrule">
    <term>Addition Rule</term>
    <meaning id="id43978334">
     For any events 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>A </m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{A} {}</m:annotation></m:semantics><m:mspace/></m:math> and
 <m:math><m:mspace/><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>B </m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{B} {}</m:annotation></m:semantics></m:math> in the sample space <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mi>P</m:mi>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>A</m:mi>
                  <m:mstyle fontweight="bold">
                    <m:mrow>
                      <m:mspace/><m:mtext> or </m:mtext><m:mspace/>
                    </m:mrow>
                  </m:mstyle>
                  <m:mi>B</m:mi>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">=</m:mo>
                    <m:mi>P</m:mi>
                  </m:mrow>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>A</m:mi>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">+</m:mo>
                    <m:mi>P</m:mi>
                  </m:mrow>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>B</m:mi>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">−</m:mo>
                    <m:mi>P</m:mi>
                  </m:mrow>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>A</m:mi>
                  <m:mstyle fontweight="bold">
                    <m:mrow>
                      <m:mspace/><m:mtext>and</m:mtext><m:mspace/>
                    </m:mrow>
                  </m:mstyle>
                  <m:mi>B</m:mi>
                  <m:mo stretchy="false">)</m:mo>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{P \( A bold "or"B \) =P \( A \) +P \( B \) -P \( A bold "and"B \) } {}</m:annotation>
        </m:semantics>
      </m:math>.
    </meaning>
  </definition>

  <definition id="anova">
    <term>Analysis of Variance</term>
    <meaning id="id18185394">
      Also referred to as ANOVA.  A method of testing whether or not the means of three or more populations are equal. The method is applicable if: 
<list id="gllist1" list-type="bulleted">
<item>All populations of interest are normally distributed.</item>
<item>The populations have equal standard deviations.</item>
<item>Samples (not necessarily of the same size) are randomly and independently selected from each population.</item>
</list>The test statistic for analysis of variance is the F-ratio.
    </meaning>
  </definition>

  <definition id="and">
    <term>AND</term>
    <meaning id="id15817023">
     Logical operation over the subsets of a set. In statistics, if 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>A </m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{A} {}</m:annotation></m:semantics></m:math> and 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>B</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{B} {}</m:annotation></m:semantics></m:math><m:math><m:semantics><m:mrow/><m:annotation encoding="StarMath 5.0">{}</m:annotation></m:semantics></m:math> are any two events (subsets in the sample space), then the event “
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>A</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{A} {}</m:annotation></m:semantics></m:math> and
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>B</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{B} {}</m:annotation></m:semantics></m:math>” consists of all possible outcomes that are common to both  
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>A</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{A} {}</m:annotation></m:semantics></m:math> and
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>B</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{B} {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>

  <definition id="arithmean">
    <term>Arithmetic Mean</term>
    <meaning id="id14691138">
      The sum of the values divided by the number of values. The notation for the mean of a sample is <m:math><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply></m:math>. The notation for the mean of a population is <m:math><m:ci>μ</m:ci></m:math>. 
    </meaning>
  </definition>

  <definition id="average">
    <term>Average</term>
    <meaning id="id16316921">
      A number that describes the central tendency of the data. There are a number of specialized averages, including the arithmetic mean, weighted mean, median, mode, and geometric mean.
    </meaning>
  </definition>


  <definition id="bayestheorem">
    <term>Bayes' Theorem</term>
    <meaning id="id3186812">
     Developed by Reverend Bayes in the 1700s. A rule designed to find the probability of one event, 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>A</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{A} {}</m:annotation></m:semantics></m:math>, occurring, given that a finite set of other events, 
<m:math><m:mo>{</m:mo><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:msub><m:mi>B</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mi>i</m:mi></m:mrow></m:mstyle></m:msub><m:mi>,</m:mi><m:mrow><m:mi>i</m:mi><m:mo stretchy="false">=</m:mo><m:mn>1,2,</m:mn></m:mrow><m:mtext>.</m:mtext><m:mtext>.</m:mtext><m:mtext>.</m:mtext><m:mi>,</m:mi><m:mi>l</m:mi></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{B rSub { size 8{i} } ,i=1,2, "."  "."  "." ,l} {}</m:annotation></m:semantics><m:mo>}</m:mo></m:math>, has occurred. 
    </meaning>
  </definition>


  <definition id="bernoullitr">
    <term>Bernoulli Trials</term>
    <meaning id="id16564907">
      An experiment with the following characteristics: <list id="gloslst1" list-type="bulleted">
<item>There are only 2 possible outcomes called “success” and “failure” for each trial.</item>
<item>
The probabilities <emphasis><m:math><m:mi>p</m:mi></m:math></emphasis> of success and <emphasis><m:math><m:mi>q</m:mi> <m:mo>=</m:mo> <m:mn>1</m:mn><m:mo>-</m:mo><m:mi>p</m:mi></m:math> </emphasis>of failure are the same for any trial.
</item></list></meaning>
  </definition>


  <definition id="bias">
    <term>Bias</term>
    <meaning id="id43978397">
      A possible consequence if certain members of the population are denied the chance to be selected for the sample.
    </meaning>
  </definition>


  <definition id="bidist">
    <term>Binomial Distribution</term>
    <meaning id="id43978423">
      A discrete random variable (RV) which arises from the Bernoulli trials. There are a fixed number, <m:math><m:mi>n</m:mi></m:math>, of independent trials. “Independent” means that the result of any trial (for example, trial 1) does not affect the results of  all the following trials, and all trials are conducted under the same conditions. Under these circumstances the binomial RV 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>X</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X} {}</m:annotation></m:semantics></m:math> is defined as the number of successes in <m:math><m:mi>n</m:mi></m:math> trials. The notation is: 

<emphasis><m:math><m:mi>X</m:mi></m:math>~<m:math> <m:mi>B</m:mi>
  <m:mo>(</m:mo>
  <m:mi>n</m:mi>
  <m:mo>,</m:mo>
  <m:mi>p</m:mi>
  <m:mo>)</m:mo></m:math></emphasis>.  
 The mean is <m:math><m:apply>
  <m:eq/>
  <m:ci>μ</m:ci>
  <m:ci>np</m:ci>
</m:apply>
</m:math> and the standard deviation is 

<m:math>   
  <m:mi>σ</m:mi>
  <m:mo>=</m:mo>
  <m:msqrt><m:mi>npq</m:mi></m:msqrt></m:math>. The probability of having  exactly <m:math><m:mi>x</m:mi></m:math> successes in <m:math><m:mi>n</m:mi></m:math> trials is <m:math>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>X</m:mi>
  <m:mo>=</m:mo>
  <m:mi>x</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mfenced>
    <m:mfrac linethickness="0">
      <m:mi>n</m:mi>
      <m:mi>x</m:mi>
    </m:mfrac>
  </m:mfenced>
  <m:msup>
    <m:mi>p</m:mi>
    <m:mi>x</m:mi>
  </m:msup>
  <m:msup>
    <m:mi>q</m:mi>
    <m:mrow>
      <m:mi>n</m:mi>
      <m:mo>−</m:mo>
      <m:mi>x</m:mi>
    </m:mrow>
  </m:msup>
</m:math>.
    </meaning>
  </definition>


  <definition id="centlimit">
    <term>Central Limit Theorem</term>
    <meaning id="id43867050">
     Given a random variable (RV) with known mean <m:math><m:mi>μ</m:mi></m:math> and known standard deviation <m:math><m:mi>σ</m:mi></m:math>. We are sampling with size n and we are interested in two new RVs - the sample mean, 
<m:math><m:mrow><m:mover accent="true"><m:mi>X</m:mi><m:mo stretchy="false">ˉ</m:mo></m:mover></m:mrow></m:math>,

and the sample sum, <m:math><m:mrow><m:mi>Σ</m:mi><m:mi>X</m:mi></m:mrow></m:math>. 

If the size <m:math><m:mi>n</m:mi></m:math> of the sample is sufficiently large, then 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mover accent="true"><m:mi>X</m:mi><m:mo stretchy="false">ˉ</m:mo></m:mover></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ { bar  {X}}} {}</m:annotation></m:semantics></m:math>∼ 

<m:math>
 <m:mi>N</m:mi>
  <m:mfenced>
    <m:mi>μ</m:mi>
    <m:mfrac>
        <m:mi>σ</m:mi>
      <m:msqrt><m:mi>n</m:mi></m:msqrt>
    </m:mfrac>
  </m:mfenced>
</m:math>

<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:msup><m:mrow/><m:mstyle fontsize="8pt"><m:mrow><m:mn/></m:mrow></m:mstyle></m:msup></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> </m:annotation></m:semantics></m:math> and 

<m:math> <m:mi>Σ</m:mi><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>X</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X} {}</m:annotation></m:semantics></m:math> ∼  
<m:math><m:mi>N</m:mi>
  <m:mo>(</m:mo>
    <m:mi>nμ</m:mi>
    <m:mo>,</m:mo>
    <m:msqrt><m:mi>n</m:mi></m:msqrt>
    <m:mi>σ</m:mi>
  <m:mo>)</m:mo></m:math>. 

If the size n of the sample is sufficiently large, then the distribution of the sample means and the distribution of the sample sums will approximate a normal distribution regardless of the shape of the population. The mean of the sample  means will equal the population mean and the mean of the sample sums will equal n times the population mean. The standard deviation of the distribution of the sample means, 
<m:math> <m:mfrac>
    <m:mi>σ</m:mi>
    <m:msqrt>
      <m:mi>n</m:mi>
    </m:msqrt>
  </m:mfrac></m:math>, is called the standard error of the mean.
    </meaning>
  </definition>


  <definition id="charts">
    <term>Charts</term>
    <meaning id="id43964579">
     Special graphical formats used to visualize a frequency distribution. They include, but are not limited to: <emphasis>histograms, frequency polygons, cumulative frequency polygons, box plots, stemplots, bar charts, Venn and tree diagrams, and pie charts</emphasis>. 
    </meaning>
  </definition>


  <definition id="classmark">
    <term>Class Mark</term>
    <meaning id="id43964620">
     Midpoint of the class.
    </meaning>
  </definition>



  <definition id="chisqdist">
    <term>Chi-square Distribution</term>
    <meaning id="id43964647">
     A continuous distribution with the following characteristics: 
<list id="chisqlst" list-type="bulleted"> 

<item>The random variable (RV) is continuous and takes on only nonnegative values (in fact, it is the sum of squares of <m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>k</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{k} {}</m:annotation></m:semantics></m:math> independent normal distributions).</item>

<item>There is a "family" of Chi-square distributions. Each representative of the family is completely defined by the number of degrees of freedom, 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>k</m:mi><m:mo stretchy="false">−</m:mo><m:mn>1</m:mn></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{k - 1} {}</m:annotation></m:semantics></m:math>, where <m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>k</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{k} {}</m:annotation></m:semantics></m:math> is the number of categories (not the size of sample). </item>

<item>The pdf is positively skewed (skewed right). However, as 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>k</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{k} {}</m:annotation></m:semantics></m:math> increases (<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>k</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{k} {}</m:annotation></m:semantics></m:math>&gt;90), the distribution approximates the normal distribution.</item></list>

The notation is: 

<m:math>
<m:msup>
<m:mi>χ</m:mi>
<m:mn>2</m:mn>
</m:msup></m:math> ~
<m:math>
<m:msubsup>
<m:mi>χ</m:mi>
<m:mtext>df</m:mtext>
<m:mn>2</m:mn>
</m:msubsup>
</m:math>.  

For the 
<m:math>
<m:msup>
<m:mi>χ</m:mi>
<m:mn>2</m:mn>
</m:msup>
</m:math> distribution, the population mean is 
<m:math>
<m:mi>μ</m:mi>
<m:mo>=</m:mo>
<m:mi>df</m:mi>
</m:math> and the population
standard deviation is 
<m:math>
<m:mi>σ</m:mi>
<m:mo>=</m:mo>
<m:msqrt>
<m:mn>2</m:mn>
<m:mo>⋅</m:mo>
<m:mi>df</m:mi>
</m:msqrt>
</m:math>.


The Chi-square distribution is used to calculate the test statistic for the  <emphasis>Goodness-of-fit Test</emphasis> (to determine if a population follows a specified distribution), for the <emphasis>Test of Independence</emphasis> (to determine if two factors are related or not), and for the <emphasis>Test of a Single Variance.</emphasis>
    </meaning>
  </definition>



  <definition id="class">
    <term>Classes</term>
    <meaning id="id44035025">
     Intervals in which the data are grouped. It is convenient to group outcomes into classes when working with large amounts of data. For example, every bar in a histogram corresponds to one class (one interval) and the midpoint of the interval can be chosen as a representative of all outcomes in the class. The Midpoint of the class is often called the <emphasis>class mark</emphasis>.
    </meaning>
  </definition>



  <definition id="clustersamp">
    <term>Cluster Sampling</term>
    <meaning id="id44035066">
      A procedure that is used if the population is dispersed over a wide geographic area. The population is divided into units or groups (counties, precincts, blocks, etc.) called primary units. Then some of the primary units are randomly chosen, and all members of those primary units are the sample. 
    </meaning>
  </definition>



  <definition id="coeffcorr">
    <term>Coefficient of Correlation</term>
    <meaning id="id44035097">
A measure developed by Karl Pearson (early 1900s) that gives the strength of association between the independent variable and the dependent variable. The formula is:
    <equation id="id5499555">
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mrow>
                    <m:mi>r</m:mi>
                    <m:mo stretchy="false">=</m:mo>
                    <m:mfrac>
                      <m:mrow>
                        <m:mi>n</m:mi>
                        <m:mrow>
                          <m:mrow>
                            <m:mo stretchy="false">∑</m:mo>
                            <m:mstyle fontstyle="italic">
                              <m:mrow>
                                <m:mtext>XY</m:mtext>
                              </m:mrow>
                            </m:mstyle>
                          </m:mrow>
                          <m:mo stretchy="false">−</m:mo>
                          <m:mo stretchy="false">(</m:mo>
                        </m:mrow>
                        <m:mrow>
                          <m:mo stretchy="false">∑</m:mo>
                          <m:mrow>
                            <m:mi>X</m:mi>
                            <m:mo stretchy="false">)</m:mo>
                            <m:mo stretchy="false">(</m:mo>
                            <m:mrow>
                              <m:mo stretchy="false">∑</m:mo>
                              <m:mrow>
                                <m:mi>Y</m:mi>
                                <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                            </m:mrow>
                          </m:mrow>
                        </m:mrow>
                      </m:mrow>
                      <m:msqrt>
                        <m:mrow>
                          <m:mo stretchy="false">[</m:mo>
                          <m:mi>n</m:mi>
                          <m:mrow>
                            <m:mo stretchy="false">∑</m:mo>
                            <m:mrow>
                              <m:mrow>
                                <m:msup>
                                  <m:mi>X</m:mi>
                                  <m:mstyle fontsize="8pt">
                                    <m:mrow>
                                      <m:mn>2</m:mn>
                                    </m:mrow>
                                  </m:mstyle>
                                </m:msup>
                                <m:mo stretchy="false">−</m:mo>
                                <m:mo stretchy="false">(</m:mo>
                              </m:mrow>
                              <m:mrow>
                                <m:mo stretchy="false">∑</m:mo>
                                <m:mrow>
                                  <m:mi>X</m:mi>
                                  <m:msup>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mstyle fontsize="8pt">
                                      <m:mrow>
                                        <m:mn>2</m:mn>
                                      </m:mrow>
                                    </m:mstyle>
                                  </m:msup>
                                  <m:mo stretchy="false">]</m:mo>
                                  <m:mo stretchy="false">[</m:mo>
                                  <m:mi>n</m:mi>
                                  <m:mrow>
                                    <m:mo stretchy="false">∑</m:mo>
                                    <m:mrow>
                                      <m:mrow>
                                        <m:msup>
                                          <m:mi>Y</m:mi>
                                          <m:mstyle fontsize="8pt">
                                            <m:mrow>
                                              <m:mn>2</m:mn>
                                            </m:mrow>
                                          </m:mstyle>
                                        </m:msup>
                                        <m:mo stretchy="false">−</m:mo>
                                        <m:mo stretchy="false">(</m:mo>
                                      </m:mrow>
                                      <m:mrow>
                                        <m:mo stretchy="false">∑</m:mo>
                                        <m:mrow>
                                          <m:mi>Y</m:mi>
                                          <m:msup>
                                            <m:mo stretchy="false">)</m:mo>
                                            <m:mstyle fontsize="8pt">
                                              <m:mrow>
                                                <m:mn>2</m:mn>
                                              </m:mrow>
                                            </m:mstyle>
                                          </m:msup>
                                          <m:mo stretchy="false">]</m:mo>
                                        </m:mrow>
                                      </m:mrow>
                                    </m:mrow>
                                  </m:mrow>
                                </m:mrow>
                              </m:mrow>
                            </m:mrow>
                          </m:mrow>
                        </m:mrow>
                      </m:msqrt>
                    </m:mfrac>
                  </m:mrow>
                  <m:mi>,</m:mi>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{r= {  {n Sum { ital "XY"}  -  \(  Sum {X \)  \(  Sum {Y \) } } }  over  { sqrt { \[ n Sum {X rSup { size 8{2} }  -  \(  Sum {X \)  rSup { size 8{2} }  \]  \[ n Sum {Y rSup { size 8{2} }  -  \(  Sum {Y \)  rSup { size 8{2} }  \] } } } } } } } ,} {}</m:annotation>
        </m:semantics>
      </m:math>
    </equation>
    where <m:math><m:mi>n</m:mi></m:math> is the number of data points. 
    The coefficient <m:math><m:mi>r</m:mi></m:math> is not more then 1 nor less then -1. The closer the coefficient is to 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mo stretchy="false">±</m:mo><m:mn>1</m:mn></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ +- 1} {}</m:annotation></m:semantics></m:math>, the stronger the evidence of a significant linear relationship between 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>X</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X} {}</m:annotation></m:semantics></m:math> and 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>Y</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{Y} {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>



  <definition id="the_cdf">
    <term>Cumulative Distribution Function (CDF)</term>
    <meaning id="id6895046">
      Given a quantitative random variable (RV) <m:math><m:mi>X</m:mi></m:math>, the function 
<m:math> <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mrow>
    <m:mi>X</m:mi>
    <m:mo>≤</m:mo>
    <m:mrow>
      <m:mi>x</m:mi>
      <m:mo>)</m:mo>
    </m:mrow>
  </m:mrow></m:math> is called the Cumulative Distribution Function (CDF).  The CDF is the sum of the probabilities of all values of <m:math><m:mi>X</m:mi></m:math> that are less than or equal to a particular <m:math><m:mi>x</m:mi></m:math>.   
    </meaning>
  </definition>



  <definition id="cumrelfreq">
    <term>Cumulative Relative Frequency</term>
    <meaning id="id19883082">
      The term applies to an ordered set of observations from smallest to largest. The Cumulative Relative Frequency is the sum of the relative frequencies for all values that are less than or equal to the given value.
    </meaning>
  </definition>




  <definition id="compevent">
    <term>Complement Event</term>
    <meaning id="id18406673">
     The event consisting of all outcomes that are in the sample space but are not in the given event. 
    </meaning>
  </definition>




  <definition id="condprob">
    <term>Conditional Probability</term>
    <meaning id="id20331383">
    The likelihood that an event will occur given that another event has already occurred.
    </meaning>
  </definition>




  <definition id="coninter">
    <term>Confidence Interval (CI)</term>
    <meaning id="id20495395">
  An interval estimate for an unknown population parameter. This depends on: 
<list id="confint1" list-type="bulleted">
<item>The desired confidence level.</item> <item>Information that is known about the  distribution (for example, known standard deviation).</item><item>The sample and its size.</item></list>
    </meaning>
  </definition>




  <definition id="conflevel">
    <term>Confidence Level (CL)</term>
    <meaning id="id17288738">
The percent expression for the probability that the confidence interval contains the true population parameter. For example, if the <m:math><m:mi>CL</m:mi><m:mo>=</m:mo><m:mn>90%</m:mn></m:math>, then in <m:math><m:mn>90</m:mn></m:math> out of <m:math><m:mn>100</m:mn></m:math> samples the interval estimate will enclose the true population parameter.
    </meaning>
  </definition>




  <definition id="contintable">
    <term>Contingency Table</term>
    <meaning id="id17487593">
The method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other.  The table provides an easy way to calculate conditional probabilities.
    </meaning>
  </definition>


<definition id="continrv">
    <term>Continuous Random Variable (RV)</term>
    <meaning id="id20531650">
     A random variable (RV) whose outcomes are measured.
    </meaning>
<example id="contrvex"> <para id="contrvpara">The height of trees in the forest is a continuous RV.</para></example>
  </definition>


  <definition id="Corranal">
    <term>Correlation Analysis</term>
    <meaning id="id16341927">
      A group of statistical procedures used to measure the strength of the relationship between two variables.
    </meaning>
  </definition>


  <definition id="countprinc">
    <term>Counting Principal</term>
    <meaning id="id20350267">
      If there are 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>m</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{m} {}</m:annotation></m:semantics></m:math> ways of doing one thing and 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>n</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{n} {}</m:annotation></m:semantics></m:math> ways of doing another, then there are 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>m</m:mi><m:mo stretchy="false">×</m:mo><m:mi>n</m:mi></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{m times n} {}</m:annotation></m:semantics></m:math> ways of doing both. 
  </meaning>
<example id="cntprn1"><para id="cntprn2">A cafe offers 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>m</m:mi><m:mo stretchy="false">=</m:mo><m:mn>5</m:mn></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{m=5} {}</m:annotation></m:semantics></m:math> kinds of coffee and 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>n</m:mi><m:mo stretchy="false">=</m:mo><m:mn>7</m:mn></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{n=7} {}</m:annotation></m:semantics></m:math> kinds of cake. There are 35 ways to serve coffee with cake.</para></example>

  
  </definition>




  <definition id="critval">
    <term>Critical Value</term>
    <meaning id="id20092187">
      The dividing point between the region where the null hypothesis is not rejected and the region where it is rejected. For a one-tailed hypothesis test, there is only one critical value. For a two-tailed hypothesis test, there are two critical values—one in each tail— with the same absolute value and opposite signs.
    </meaning>
  </definition>



  <definition id="data">
    <term>Data</term>
    <meaning id="id15539900">
      A set of observations (a set of possible outcomes). Most data can be put into two groups: <emphasis>qualitative</emphasis> (hair color, ethnic groups and other <emphasis>attributes</emphasis> of the population) and <emphasis>quantitative</emphasis> (distance traveled to college, number of children in a family, etc.). Quantitative data can be separated into two subgroups: <emphasis>discrete</emphasis> and <emphasis>continuous</emphasis>. Data is discrete if it is the result of counting (the number of students of a given ethnic group in a class, the number of books on a shelf, etc.).  Data is continuous if it is the result of measuring (distance traveled, weight of luggage, etc.)
    </meaning>
  </definition>




  <definition id="degrefree">
    <term>Degrees of Freedom (df)</term>
    <meaning id="id14645554">
The number of sample values that are free to vary.
    </meaning>
  </definition>




  <definition id="depsample">
    <term>Dependant Samples</term>
    <meaning id="id20142902">
   Samples chosen in such a way that they are not independent of each other. Paired samples are dependent because two measurements are taken from the same individual or item. 
   </meaning>

<example id="depsample1"><para id="depsamp2">If the test scores of 13 individuals were recorded before a new teaching method was introduced, and then after using the new method, the paired samples are dependent.
 </para></example>

  </definition>




  <definition id="descstats">
    <term>Descriptive Statistics</term>
    <meaning id="id18219568">
     The numerical and graphical ways used to describe and display the important characteristics of data; for example, charts, frequency distributions, measures of central tendency and measures of spread and skewness.
    </meaning>
  </definition>



  <definition id="discrrv">
    <term>Discrete Random Variable</term>
    <meaning id="id18678019">
 A random variable (RV) whose outcomes are counted. 
  </meaning>
  </definition>




  <definition id="domain">
    <term>Domain</term>
    <meaning id="id12685536">
     The set of possible values for the independent variable.  </meaning>
<example id="dom1"><para id="dom2">

<list id="dom3" list-type="bulleted"><item>We are interested in the longevity of human life in years. The domain is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mo stretchy="false">{</m:mo><m:mrow><m:mn>0,1,2,3</m:mn><m:mtext>.</m:mtext><m:mtext>.</m:mtext><m:mtext>.</m:mtext><m:mi>,</m:mi><m:mtext>120</m:mtext></m:mrow><m:mo stretchy="false">}</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ lbrace 0,1,2,3 "."  "."  "." ,"120" rbrace } {}</m:annotation></m:semantics></m:math>. </item>
<item> We are interested in the suit of a regular 52-card deck.  The domain is <m:math> <m:mrow>
    <m:mo>{</m:mo>
<m:mo>♥</m:mo>
    <m:mo>;</m:mo>
    <m:mo>♦</m:mo>
    <m:mo>;</m:mo>
    <m:mo>♣</m:mo>
    <m:mo>;</m:mo>
    <m:mo>♠</m:mo>
    <m:mo>}</m:mo>
  </m:mrow></m:math>. </item></list>
</para>
</example>
   
  </definition>




  <definition id="eqlikly">
    <term>Equally Likely</term>
    <meaning id="id19865926">
    Each outcome of an experiment has the same probability of occurring.
    </meaning>
  </definition>




  <definition id="ebmbound">
    <term>Error Bound for a Population Mean (EBM)</term>
    <meaning id="id18766198">
      The margin of error. It depends on the confidence level, sample size, and the known or estimated population standard deviation.
    </meaning>
  </definition>

 <definition id="ebpbound">
    <term>Error Bound for a Proportion (EBP)</term>
    <meaning id="id14691823">
      The margin of error. It depends on the confidence level, sample size, and the estimated (from the sample) proportion of successes.
    </meaning>
  </definition>

 <definition id="event">
    <term>Event</term>
    <meaning id="id19906185">
     A subset in the set of all outcomes of an experiment. The set of all outcomes of an experiment is called a <emphasis>sample space</emphasis> and denoted usually by S. An event is any arbitrary subset in <emphasis>S</emphasis>. It can contain one outcome, two outcomes, no outcomes (empty subset), the entire sample space, etc. Standard notations for events are capital letters such as A, B, C, etc. 
    </meaning>
  </definition>



 <definition id="expectedv">
    <term>Expected Value</term>
    <meaning id="id17662938">
     The arithmetic average when an experiment is repeated many times. The Expected Value is called the long-term mean or average. Notation: 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>E</m:mi><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mo stretchy="false">)</m:mo><m:mi>,</m:mi><m:mi>μ</m:mi></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{E \( x \) ,μ} {}</m:annotation></m:semantics></m:math>.  
For a discrete random variable (RV) with probability distribution function
 
<m:math>
<m:mi>P</m:mi><m:mo>(</m:mo><m:mi>X</m:mi><m:mo>=</m:mo><m:mi>x</m:mi><m:mo>)</m:mo>
</m:math>,  

the definition also can be written in the form 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>E</m:mi><m:mo stretchy="false">(</m:mo><m:mi>X</m:mi><m:mrow><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mi>μ</m:mi></m:mrow><m:mo stretchy="false">=</m:mo><m:mrow><m:mo stretchy="false">∑</m:mo><m:mrow><m:mstyle fontstyle="italic"><m:mrow><m:mtext>xP</m:mtext></m:mrow></m:mstyle><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{E \( X \) =μ= Sum { ital "xP" \( x \) } } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>


 <definition id="expdist">
    <term>Exponential Distribution</term>
    <meaning id="id13301408">
     A continuous random variable (RV) that appears when we are interested in the intervals of time between some random events, for example, the length of time between emergency arrivals at a hospital. Notation: 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>X</m:mi><m:mtext>~</m:mtext><m:mstyle fontstyle="italic"><m:mrow><m:mtext>Exp</m:mtext></m:mrow></m:mstyle><m:mo stretchy="false">(</m:mo><m:mi>m</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X "~"  ital "Exp" \( m \) } {}</m:annotation></m:semantics></m:math>. The mean is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>μ</m:mi><m:mo stretchy="false">=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>m</m:mi></m:mfrac></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{μ= {  {1}  over  {m} } } {}</m:annotation></m:semantics></m:math> and the standard deviation is  
<m:math> 
    <m:mi>σ</m:mi>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mn>1</m:mn>
      <m:mi>m</m:mi>    
  </m:mfrac></m:math>. The probability density function is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>f</m:mi><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mstyle fontstyle="italic"><m:mrow><m:msup><m:mtext>me</m:mtext><m:mstyle fontsize="8pt"><m:mrow><m:mrow><m:mo stretchy="false">−</m:mo><m:mstyle fontstyle="italic"><m:mrow><m:mtext>mx</m:mtext></m:mrow></m:mstyle></m:mrow></m:mrow></m:mstyle></m:msup></m:mrow></m:mstyle></m:mrow><m:mi>,</m:mi><m:mtext/></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{f \( x \) = ital "me" rSup { size 8{- ital "mx"} } ,"  "} {}</m:annotation></m:semantics></m:math>  <m:math><m:mrow>
    <m:mi>x</m:mi>
    <m:mo>≥</m:mo>
    <m:mn>0</m:mn>
  </m:mrow></m:math> and the cumulative distribution function is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">≤</m:mo><m:mi>x</m:mi></m:mrow><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mrow><m:mn>1</m:mn><m:mo stretchy="false">−</m:mo><m:msup><m:mi>e</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mrow><m:mo stretchy="false">−</m:mo><m:mstyle fontstyle="italic"><m:mrow><m:mtext>mx</m:mtext></m:mrow></m:mstyle></m:mrow></m:mrow></m:mstyle></m:msup></m:mrow></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \( X &lt;= x \) =1-e rSup { size 8{- ital "mx"} } } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>


 <definition id="experiment">
    <term>Experiment</term>
    <meaning id="id18789021">
  A planned activity carried out under controlled conditions.
    </meaning>
  </definition>


 <definition id="fDistribution">
    <term>F Distribution</term>
    <meaning id="id14676234">
      Developed by Sir Ronald Fisher. The F Distribution has the following characteristics:

<list id="fdistlist" list-type="bulleted">
<item>The random variable (RV) is a ratio (called the F-ratio) of two sums of weighted squares. It is continuous and takes on only nonnegative value. </item><item>The pdf is positively skewed (skewed to the right).</item>
<item>There is a "family" of F distributions.</item></list> 
Every representative of the family is defined by 2 parameters: the number of degrees of freedom for the numerator in the F-ratio and the number of degrees of freedom in the denominator in the F-ratio. The F Distribution is used to  test of 2 population variances and in ANOVA hypothesis tests.
    </meaning>
  </definition>


 <definition id="freqdist">
    <term>Frequency Distribution</term>
    <meaning id="id15225997">
    A grouping of data into mutually exclusive classes showing the number of outcomes in each class.
    </meaning>
  </definition>


 <definition id="freq">
    <term>Frequency</term>
    <meaning id="id19849552">
   The number of times a value of the data occurs.
    </meaning>
  </definition>

 <definition id="geodist">
    <term>Geometric Distribution</term>
    <meaning id="id14905098">
    A discrete random variable (RV) which arises from the Bernoulli trials. The trials are repeated until the first success. The geometric variable <m:math><m:mi>X</m:mi></m:math> is defined as the number of trials until the first success. Notation: <emphasis><m:math><m:mi>X</m:mi></m:math>∼

<m:math><m:mi>G</m:mi>
  <m:mo>(</m:mo>
  <m:mi>p</m:mi>
  <m:mo>)</m:mo></m:math></emphasis>.  

The mean is 
<m:math><m:mi>μ</m:mi>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mn>1</m:mn>
    <m:mi>p</m:mi>
  </m:mfrac></m:math> and 

the standard deviation is 
<m:math><m:mi>σ</m:mi><m:mo>=</m:mo><m:msqrt><m:mrow><m:mfrac><m:mn>1</m:mn><m:mi>p</m:mi></m:mfrac><m:mo>⋅</m:mo><m:mo>(</m:mo></m:mrow><m:mrow><m:mfrac><m:mn>1</m:mn><m:mi>p</m:mi></m:mfrac><m:mo>−</m:mo><m:mn>1</m:mn></m:mrow><m:mo>)</m:mo></m:msqrt></m:math> 


The probability of exactly x failures before the first success is given by the formula: 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">=</m:mo><m:mi>x</m:mi></m:mrow><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mi>p</m:mi></m:mrow><m:mo stretchy="false">(</m:mo><m:mrow><m:mn>1</m:mn><m:mo stretchy="false">−</m:mo><m:mi>p</m:mi></m:mrow><m:msup><m:mo stretchy="false">)</m:mo><m:mstyle fontsize="8pt"><m:mrow><m:mrow><m:mi>x</m:mi><m:mo stretchy="false">−</m:mo><m:mn>1</m:mn></m:mrow></m:mrow></m:mstyle></m:msup></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \( X=x \) =p \( 1 - p \)  rSup { size 8{x - 1} } } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>



 <definition id="hpygeoprob">
    <term>Hypergeometric Distribution</term>
    <meaning id="id19874231">
   A discrete random variable (RV) that is characterized by 
<list id="hyp1" list-type="bulleted">
<item>A fixed number of trials. </item>
<item> The probability of success is not the same from trial to trial.</item>
</list>
We sample from two groups of items when we are interested in only one group. 
<m:math><m:mi>X</m:mi></m:math> is defined as the number of successes out of the total number chosen. Notation: 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>X</m:mi><m:mtext>~</m:mtext><m:mi>H</m:mi><m:mo stretchy="false">(</m:mo><m:mi>r</m:mi><m:mi>,</m:mi><m:mi>b</m:mi><m:mi>,</m:mi><m:mi>n</m:mi><m:mo stretchy="false">)</m:mo><m:mtext>.</m:mtext></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X "~" H \( r,b,n \)} {}</m:annotation></m:semantics></m:math>, where <m:math><m:mi>r</m:mi></m:math> = the number of items in the group of interest, <m:math><m:mi>b</m:mi></m:math> = the number of items in the group not of interest, and <m:math><m:mi>n</m:mi></m:math> = the number of items chosen. 
    </meaning>
  </definition>


 <definition id="hypotest">
    <term>Hypothesis Testing</term>
    <meaning id="id19136681">
   Based on sample evidence, hypothesis testing is a procedure that determines  whether the null hypothesis is a reasonable statement and cannot be rejected, or is unreasonable and should be rejected.
    </meaning>
  </definition>


 <definition id="hypothesis">
    <term>Hypothesis</term>
    <meaning id="id15317266">
   A statement about the value of a population parameter. In case of two hypotheses, the statement assumed to be true is called the null hypothesis (notation 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:msub><m:mi>H</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mn>0</m:mn></m:mrow></m:mstyle></m:msub></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{H rSub { size 8{0} } } {}</m:annotation></m:semantics></m:math>) and the contradictory statement is called the alternate hypothesis (notation 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:msub><m:mi>H</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mi>a</m:mi></m:mrow></m:mstyle></m:msub></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{H rSub { size 8{a} } } {}</m:annotation></m:semantics></m:math>).
    </meaning>
  </definition>


 <definition id="indevents">
    <term>Independent Events</term>
    <meaning id="id19874035">
   The occurrence of one event has no effect on the probability of the occurrence of any other event. Events A and B are independent if any of the following is true: 
<list id="fs-id19374549">
<item><m:math><m:mi>P</m:mi>
  <m:mo>(</m:mo>
    <m:mi>A</m:mi>
    <m:mo>|</m:mo>
  <m:mi>B</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>A</m:mi>
  <m:mo>)</m:mo>
  </m:math></item> 
<item><m:math><m:mi>P</m:mi>
  <m:mo>(</m:mo>
    <m:mi>B</m:mi>
    <m:mo>|</m:mo>
  <m:mi>A</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>B</m:mi>
  <m:mo>)</m:mo>
  </m:math></item>
<item><m:math><m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>A</m:mi>
  <m:mi>and</m:mi>
  <m:mi>B</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>A</m:mi>
  <m:mo>)</m:mo>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>B</m:mi>
  <m:mo>)</m:mo></m:math></item></list>.
    </meaning>
  </definition>


 <definition id="indsamp">
    <term>Independent Samples</term>
    <meaning id="id20017281">
   Samples that are not related in any way.
    </meaning>
  </definition>


 <definition id="infrstats">
    <term>Inferential Statistics </term>
    <meaning id="id20359958">
   Also called statistical inference or inductive statistics. This facet of statistics deals with estimating a population parameter based on a sample statistic. For example, if 4 out of the 100 calculators sampled are defective we might infer that 4 percent of the production is defective.
    </meaning>
  </definition>


 <definition id="iqr">
    <term>Interquartile Range (IRQ)</term>
    <meaning id="id15896860">
   The distance between the third quartile (Q3) and the first quartile (Q1). IQR = Q3 - Q1.
    </meaning>
  </definition>


 <definition id="intest">
    <term>Interval Estimate</term>
    <meaning id="id15325501">
   Based on sample information, an Interval Estimate is an interval of numbers that may contain a population parameter.
    </meaning>
  </definition>


 <definition id="signtest">
    <term>Level of Significance of the Test </term>
    <meaning id="id18705874">
   Probability of a Type I error (reject the null hypothesis when it is true).  Notation: <m:math><m:mi>α</m:mi></m:math>.  In hypothesis testing, the Level of Significance is called the preconceived <m:math><m:mi>α</m:mi></m:math> or the preset <m:math><m:mi>α</m:mi></m:math>.
    </meaning>
  </definition>


 <definition id="linregress">
    <term>Linear Regression Equation </term>
    <meaning id="id14982809">
   A linear equation in the form <m:math>  <m:mover>
<m:mi>y</m:mi>
<m:mo>^</m:mo>
</m:mover>
  <m:mo>=</m:mo>
  <m:mi>a</m:mi>
  <m:mo>+</m:mo>
  <m:mi>bx</m:mi></m:math>, that defines the relationship between two variables. It is used to predict the dependent variable <m:math><m:mi>y</m:mi></m:math> based on a selected value of independent variable <m:math><m:mi>x</m:mi></m:math>.
    </meaning>
  </definition>


 <definition id="mean">
    <term>Mean</term>
    <meaning id="id10578364">
   A number that measures the central tendency.  A common name for mean is  'average.'  The term 'mean' is a shortened form of 'arithmetic mean.' By definition, the mean for a sample (denoted by 
<m:math>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply></m:math>) is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow>

<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>


<m:mo>=</m:mo>

<m:mfrac><m:mtext>Sum of all values in the sample</m:mtext><m:mtext>Number of values in the sample</m:mtext></m:mfrac></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ { bar  {X}}= {  {"Sum of all values in the sample"}  over  {"Number of values in the sample"} } } {}</m:annotation></m:semantics></m:math>, 

and the mean for a population (denoted by 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>μ</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{m} {}</m:annotation></m:semantics></m:math>) is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>μ</m:mi><m:mo stretchy="false">=</m:mo><m:mfrac><m:mtext>Sum of all values in the population</m:mtext><m:mtext>Number of values in the population</m:mtext></m:mfrac></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{m= {  {"Sum of all values in the population"}  over  {"Number of values in the population"} } } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>







 <definition id="median">
    <term>Median</term>
    <meaning id="id19912305">
   A number that separates ordered data into halves.  Half the values are the same number or smaller than the median and half the values are the same number or larger than the median. The median may or may not be part of the data.
    </meaning>
  </definition>


 <definition id="mode">
    <term>Mode</term>
    <meaning id="id20331319">
   The value that appears most frequently in a set of data.
    </meaning>
  </definition>

 <definition id="multirule">
    <term>Multiplication Rule</term>
    <meaning id="id20494460">
   For any events A and B in the sample space, <m:math>
        <m:semantics>
          <m:mrow/>
          <m:annotation encoding="StarMath 5.0">{}</m:annotation>
        </m:semantics>
      </m:math>
      <m:math>
        <m:semantics>
          <m:mrow>
            <m:mstyle fontsize="12pt">
              <m:mrow>
                <m:mrow>
                  <m:mi>P</m:mi>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>A</m:mi>
                  <m:mstyle fontweight="bold">
                    <m:mrow>
                      <m:mtext>and</m:mtext>
                    </m:mrow>
                  </m:mstyle>
                  <m:mi>B</m:mi>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">=</m:mo>
                    <m:mi>P</m:mi>
                  </m:mrow>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>A</m:mi>
                  <m:mo stretchy="false">∣</m:mo>
                  <m:mi>B</m:mi>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">⋅</m:mo>
                    <m:mi>P</m:mi>
                  </m:mrow>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>B</m:mi>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">=</m:mo>
                    <m:mi>P</m:mi>
                  </m:mrow>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>B</m:mi>
                  <m:mo stretchy="false">∣</m:mo>
                  <m:mi>A</m:mi>
                  <m:mrow>
                    <m:mo stretchy="false">)</m:mo>
                    <m:mo stretchy="false">⋅</m:mo>
                    <m:mi>P</m:mi>
                  </m:mrow>
                  <m:mo stretchy="false">(</m:mo>
                  <m:mi>A</m:mi>
                  <m:mo stretchy="false">)</m:mo>
                  <m:mtext>.</m:mtext>
                </m:mrow>
              </m:mrow>
            </m:mstyle>
            <m:mrow/>
          </m:mrow>
          <m:annotation encoding="StarMath 5.0"> size 12{P \( A bold "and"B \) =P \( A \lline B \)  cdot P \( B \) =P \( B \lline A \)  cdot P \( A \)  "." } {}</m:annotation>
        </m:semantics>
      </m:math>
    </meaning>
  </definition>


<definition id="mutex">
    <term>Mutually Exclusive</term>
    <meaning id="id17428187">
   An observation cannot fall into more than one class (category). Being in one category prevents being in a mutually exclusive category.
    </meaning>
  </definition>

 

<definition id="normdist">
    <term>Normal Distribution</term>
    <meaning id="id8261470">
   A continuous random variable (RV) with pdf    
<m:math>
<m:mi>f(x)</m:mi><m:mo>=</m:mo><m:mfrac>
   <m:mn>1</m:mn>
   <m:mrow>
     <m:mi>σ</m:mi>
     <m:msqrt>
       <m:mn>2</m:mn><m:mi>π</m:mi>
     </m:msqrt>
   </m:mrow>
</m:mfrac>


<m:msup>
  <m:mi>e</m:mi>
  <m:mrow>
   <m:mfrac>
     <m:mrow>
        <m:msup>
           <m:mrow>
              <m:mo>-</m:mo><m:mo>(</m:mo><m:mi>x</m:mi><m:mo>-</m:mo><m:mi>μ</m:mi><m:mo>)</m:mo>
           </m:mrow>
           <m:mn>2</m:mn>
        </m:msup>     
     </m:mrow>
     <m:mrow>
        <m:mn>2</m:mn>
        <m:msup>
           <m:mi>σ</m:mi>
           <m:mn>2</m:mn>
        </m:msup>
     </m:mrow>
   </m:mfrac>
  </m:mrow>
</m:msup>

</m:math> where <m:math><m:mi>μ</m:mi></m:math>  is the mean of the distribution and <m:math><m:mi>σ</m:mi></m:math>  is the standard deviation. Notation: <m:math><m:mi>X</m:mi></m:math>  ~  <m:math> <m:mi>N</m:mi>
  <m:mfenced>
    <m:mi>μ</m:mi>
    
    <m:mi>σ</m:mi>
      
  </m:mfenced></m:math>. If <m:math><m:mi>μ</m:mi><m:mo>=</m:mo><m:mn>0</m:mn></m:math> and <m:math><m:mi>σ</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:math>, the RV is called <emphasis> the standard normal distribution</emphasis>.
    </meaning>
  </definition>

 

<definition id="onetailtest">
    <term>One-Tailed Test</term>
    <meaning id="id17334142">
   Used when the alternate hypothesis states a direction. The rejection region is in one tail.  Example:   

<m:math><m:mrow><m:msub><m:mi>H</m:mi><m:mrow><m:mi>a</m:mi></m:mrow></m:msub></m:mrow></m:math>:<m:math><m:mi>μ</m:mi></m:math> &gt; <m:math><m:mi>40</m:mi></m:math> with the rejection region in the right tail.
    </meaning>
  </definition>


<definition id="or">
    <term>OR</term>
    <meaning id="id17747276">
   Logical operation over the subsets of a set. In statistics, if <m:math><m:mi>A</m:mi></m:math> and <m:math><m:mi>B</m:mi></m:math> are any two events (subsets in the sample space), then the event “<m:math><m:mi>A</m:mi></m:math> <emphasis>or</emphasis> <m:math><m:mi>B</m:mi></m:math>” consists of all outcomes that are in <m:math><m:mi>A</m:mi></m:math>, or in <m:math><m:mi>B</m:mi></m:math>, or in both <m:math><m:mi>A</m:mi></m:math> and <m:math><m:mi>B</m:mi></m:math>.
    </meaning>
  </definition>

 

<definition id="outcome">
    <term>Outcome (observation)</term>
    <meaning id="id19619068">
   A particular result of an experiment.
    </meaning>
  </definition>

 

<definition id="outlier">
    <term>Outlier</term>
    <meaning id="id18587788">
   An observation that does not fit the rest of the data.
    </meaning>
  </definition>

 

<definition id="parameter">
    <term>Parameter</term>
    <meaning id="id18035004">
   A numerical characteristic of the population. 
    </meaning>
<example id="param1"><para id="param2">The mean price to rent a 1-bedroom apartment in California.</para></example>

  </definition>

 
<definition id="pdf">
    <term>pdf</term>
    <meaning id="id10630548">
   see <term target-id="pdffn"> Probability Density Function</term>
    </meaning>
  </definition>

 
<definition id="pdf2">
    <term>PDF</term>
    <meaning id="id14315812">
   see <term target-id="pdfelab"> Probability Distribution Function</term>
    </meaning>
  </definition>

<definition id="percentile">
    <term>Percentile</term>
    <meaning id="id19436015">
 A number that divides ordered data into hundredths.</meaning>

<example id="prtil1"><para id="prtil2">
Let a data set contain 200 ordered observations starting with 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mo stretchy="false">{</m:mo><m:mrow><m:mn>2</m:mn><m:mtext>.</m:mtext><m:mn>3,2</m:mn><m:mtext>.</m:mtext><m:mn>7,2</m:mn><m:mtext>.</m:mtext><m:mn>8,2</m:mn><m:mtext>.</m:mtext><m:mn>9,2</m:mn><m:mtext>.</m:mtext><m:mn>9,3</m:mn><m:mtext>.</m:mtext><m:mn>0</m:mn><m:mtext>.</m:mtext><m:mtext>.</m:mtext><m:mtext>.</m:mtext></m:mrow><m:mo stretchy="false">}</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ lbrace 2 "." 3,2 "." 7,2 "." 8,2 "." 9,2 "." 9,3 "." 0 "."  "."  "."  rbrace } {}</m:annotation></m:semantics></m:math>. Then the first percentile is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow><m:mfrac><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mtext>.</m:mtext><m:mrow><m:mn>7</m:mn><m:mo stretchy="false">+</m:mo><m:mn>2</m:mn></m:mrow><m:mtext>.</m:mtext><m:mn>8</m:mn><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:mfrac><m:mo stretchy="false">=</m:mo><m:mn>2</m:mn></m:mrow><m:mtext>.</m:mtext><m:mtext>75</m:mtext></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ {  { \( 2 "." 7+2 "." 8 \) }  over  {2} } =2 "." "75"} {}</m:annotation></m:semantics></m:math>, because 1% of the data is to the left of this point on the number line and 99% of the data is on its right. The second percentile is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow><m:mfrac><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mtext>.</m:mtext><m:mrow><m:mn>9</m:mn><m:mo stretchy="false">+</m:mo><m:mn>2</m:mn></m:mrow><m:mtext>.</m:mtext><m:mn>9</m:mn><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:mfrac><m:mo stretchy="false">=</m:mo><m:mn>2</m:mn></m:mrow><m:mtext>.</m:mtext><m:mn>9</m:mn></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{ {  { \( 2 "." 9+2 "." 9 \) }  over  {2} } =2 "." 9} {}</m:annotation></m:semantics></m:math>. Percentiles may or may not be part of the data. In this example, the first percentile is not in the data, but the second percentile is. The median of the data is the second quartile and the 50th percentile. The first and third quartiles are the 25th and the 75th percentiles, respectively.
    </para></example>


  </definition>

 
<definition id="pointest">
    <term>Point Estimate</term>
    <meaning id="id20057179">
 A single number computed from a sample and used to estimate a population parameter. 
    </meaning>
  </definition>

 
<definition id="poisson">
    <term>Poisson Distribution</term>
    <meaning id="id14162739">
 A discrete random variable (RV) is the number of times a certain event will occur in a specific interval. Characteristics of the variable: 

<list id="fs-id44682301">
<item>The probability that the event occurs in a given interval is the same for all intervals.</item>
<item>The events occur with a known mean and independently of the time since the last event.</item>
</list>

The distribution is defined by the mean <m:math><m:mi>μ</m:mi></m:math> of the event in the interval. Notation:   
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>X</m:mi><m:mtext>~</m:mtext><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mi>μ</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X "~" P \( μ \) } {}</m:annotation></m:semantics></m:math>. 
The mean is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>μ</m:mi><m:mo stretchy="false">=</m:mo><m:mstyle fontstyle="italic"><m:mrow><m:mtext>np</m:mtext></m:mrow></m:mstyle></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{μ= ital "np"} {}</m:annotation></m:semantics></m:math>.  
The standard deviation is 
<m:math>
  <m:mi>σ</m:mi>
  <m:mo>=</m:mo>
    <m:mi>μ</m:mi>
</m:math>.  
The probability of having exactly <m:math><m:mi>x</m:mi></m:math>  successes in <m:math><m:mi>r</m:mi></m:math>  trials is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">=</m:mo><m:mi>x</m:mi></m:mrow><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:msup><m:mi>e</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mrow><m:mo stretchy="false">−</m:mo><m:mi>μ</m:mi></m:mrow></m:mrow></m:mstyle></m:msup></m:mrow><m:mfrac><m:msup><m:mi>μ</m:mi><m:mstyle fontsize="8pt"><m:mrow><m:mi>x</m:mi></m:mrow></m:mstyle></m:msup><m:mrow><m:mi>x</m:mi><m:mi>!</m:mi></m:mrow></m:mfrac></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \( X=x \) =e rSup { size 8{ - μ} }  {  {μ rSup { size 8{x} } }  over  {x!} } } {}</m:annotation></m:semantics></m:math>. 

The Poisson distribution is often used to approximate the binomial distribution when <m:math><m:mi>n</m:mi></m:math> is “large” and <m:math><m:mi>p</m:mi></m:math> is “small” (a general rule is that <m:math><m:mi>n</m:mi></m:math> should be greater than or equal to 20 and <m:math><m:mi>p</m:mi></m:math>    should be less than or equal to .05).
    </meaning>
  </definition>

 
<definition id="population">
    <term>Population</term>
    <meaning id="id16187942">
 The collection, or set, of all individuals, objects, or measurements whose properties are being studied.
    </meaning>
  </definition>

 
<definition id="prealpha">
    <term>Preconceived <m:math><m:ci>α</m:ci></m:math></term>
    <meaning id="id15924482">
The probability of rejecting the null hypothesis when the null hypothesis is true (<m:math><m:ci>α</m:ci></m:math> is equal to the probability of a Type I error). <m:math><m:ci>α</m:ci></m:math> is called the <emphasis>level of significance of the test</emphasis>.  Also called the preset <m:math><m:ci>α</m:ci></m:math>.
    </meaning>
  </definition>

 
<definition id="pdffn">
    <term>Probability Density Function (pdf)</term>
    <meaning id="id17850687">
 A mathematical description of a continuous random variable (RV). For any specific value <m:math><m:mi>x</m:mi></m:math>,  
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">=</m:mo><m:mi>x</m:mi></m:mrow><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mn>0</m:mn></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \( X=x \) =0} {}</m:annotation></m:semantics></m:math>.  

By definition, the <m:math><m:mi>pdf</m:mi></m:math> is any positive function <m:math><m:mi>f(x)</m:mi></m:math> over the real numbers such that the area bounded above by <m:math><m:mi>f(x)</m:mi></m:math>, below by the <m:math><m:mi>x-axis</m:mi></m:math> and from the right by a vertical line 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">=</m:mo><m:mi>x</m:mi></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X=x} {}</m:annotation></m:semantics></m:math>  is equal to the probability  
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">≤</m:mo><m:mi>x</m:mi></m:mrow><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \( X &lt;= x \) } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>



 <definition id="pdfelab">
    <term>Probability Distribution Function (PDF)</term>
    <meaning id="id20054675">
   A mathematical description of a discrete random variable (RV), given either in the form of an equation (by formula) or in the form of a table listing all the possible outcomes of an experiment and the probability associated with each outcome. 
</meaning>

<example id="pdfer1"><para id="pdfer2">
A biased coin with probability 0.7 of heads is tossed 5 times. We are interested in the number of heads (<m:math><m:mi>X</m:mi></m:math>  = the number of heads). <m:math><m:mi>X</m:mi></m:math>  is Binomial: <m:math><m:mi>X</m:mi></m:math>∼<m:math><m:mi>B</m:mi>
  <m:mfenced>
    <m:mn>5</m:mn>
    <m:mrow>
      <m:mo>.</m:mo>
      <m:mn>7</m:mn>
    </m:mrow>
  </m:mfenced></m:math>.   
<m:math>   <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>X</m:mi>
  <m:mo>=</m:mo>
  <m:mi>x</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo></m:math> <m:math>  <m:mfenced>
    <m:mtable>
      <m:mtr>
        <m:mtd>
          <m:mn>5</m:mn>
        </m:mtd>
      </m:mtr>
      <m:mtr>
        <m:mtd>
          <m:mi>x</m:mi>
        </m:mtd>
      </m:mtr>
    </m:mtable>
  </m:mfenced>
  <m:msup>
    <m:mrow>
      <m:mo>.</m:mo>
      <m:mn>7</m:mn>
    </m:mrow>
    <m:mi>x</m:mi>
  </m:msup>
  <m:msup>
    <m:mrow>
      <m:mo>.</m:mo>
      <m:mn>3</m:mn>
    </m:mrow>
    <m:mrow>
      <m:mn>5</m:mn>
      <m:mo>−</m:mo>
      <m:mi>x</m:mi>
    </m:mrow>
  </m:msup></m:math> or in the form of the table.
<table id="id4500004" frame="none" summary="">
      <tgroup cols="2" colsep="1" rowsep="0">
        <colspec colnum="1" colname="c1"/>
        <colspec colnum="2" colname="c2"/>
<thead>        
  <row rowsep="1">
            <entry><m:math><m:mi>x</m:mi></m:math> </entry>
            <entry><m:math>   <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>X</m:mi>
  <m:mo>=</m:mo>
  <m:mi>x</m:mi>
  <m:mo>)</m:mo>
</m:math></entry>
          </row>
</thead>        
<tbody>

          <row>
            <entry>0</entry>
            <entry>0.0024</entry>
          </row>
          <row>
            <entry>1</entry>
            <entry>0.0284</entry>
          </row>
          <row>
            <entry>2</entry>
            <entry>0.1323</entry>
          </row>
          <row>
            <entry>3</entry>
            <entry>0.3087</entry>
          </row>
          <row>
            <entry>4</entry>
            <entry>0.3602</entry>
          </row>
          <row>
            <entry>5</entry>
            <entry>0.1681</entry>
          </row>
        </tbody>
      </tgroup>
    </table> 
    </para></example>
  </definition>

<definition id="probdistr">
    <term>Probability Distribution</term>
    <meaning id="id19003516">
The common name for <emphasis>Probability Density Function (pdf)</emphasis> and <emphasis>Probability Distribution Function (PDF)</emphasis>.
    </meaning>
  </definition>

<definition id="prob">
    <term>Probability</term>
    <meaning id="id17934331">
A number between 0 and 1, inclusive, that gives the likelihood that a specific event will occur. The foundation of statistics is given by the following 3 axioms (by A. N. Kolmogorov, 1930’s): Let <m:math><m:mi>S</m:mi></m:math>  denote the sample space and <m:math><m:mi>A</m:mi></m:math>  and <m:math><m:mi>B</m:mi></m:math>  are two events in <m:math><m:mi>S</m:mi></m:math> . Then:

<list id="fs-id6987848"> 
<item><m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow><m:mn>0</m:mn><m:mo stretchy="false">≤</m:mo><m:mi>P</m:mi></m:mrow><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">≤</m:mo><m:mn>1</m:mn></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{0 &lt;= P \( A \)  &lt;= 1;} {}</m:annotation></m:semantics></m:math>.</item> 
<item>If <m:math><m:mi>A</m:mi></m:math>  and <m:math><m:mi>B</m:mi></m:math>  are any two mutually exclusive events, then <m:math>  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>A</m:mi>
  <m:mi>or</m:mi>
  <m:mi>B</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>A</m:mi>
  <m:mo>)</m:mo>
  <m:mo>+</m:mo>
  <m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>B</m:mi>
  <m:mo>)</m:mo>
  </m:math>.</item>
<item><m:math><m:mi>P</m:mi>
  <m:mo>(</m:mo>
  <m:mi>S</m:mi>
  <m:mo>)</m:mo>
  <m:mo>=</m:mo>
  <m:mn>1</m:mn></m:math>.</item>
</list>
    </meaning>
  </definition>

<definition id="proportion">
    <term>Proportion</term>
    <meaning id="id15701010">

<list id="fs-id14984286">
<item>As a number: A proportion is the number of successes divided by the total number in the sample.</item>
<item>As a probability distribution:  Given a binomial random variable (RV), <m:math><m:mi>X</m:mi></m:math> ∼<m:math>  <m:mi>B</m:mi>
  <m:mfenced>
    <m:mi>n</m:mi>
    <m:mi>p</m:mi>
  </m:mfenced></m:math>, consider the ratio of the number <m:math><m:mi>X</m:mi></m:math> of successes in <m:math><m:mi>n</m:mi></m:math> Bernouli trials to the number <m:math><m:mi>n</m:mi></m:math> of trials.  <m:math><m:mi>P</m:mi>
  <m:mo>'</m:mo>
  <m:mo>=</m:mo>
  <m:mfrac>
    <m:mi>X</m:mi>
    <m:mi>n</m:mi>
  </m:mfrac></m:math>. This new RV is called a proportion, and if the number of trials, <m:math><m:mi>n</m:mi></m:math>, is large enough, <m:math><m:mi>P'</m:mi></m:math> ∼<m:math> <m:mi>N</m:mi>
  <m:mfenced>
    <m:mi>p</m:mi>
    <m:mfrac>
      <m:mi>pq</m:mi>
      <m:mi>n</m:mi>
    </m:mfrac>
  </m:mfenced></m:math>.</item>
</list>
    </meaning>
  </definition>

<definition id="pvalue">
    <term>p-value</term>
    <meaning id="id17395082">
The probability that an event will happen purely by chance assuming the null hypothesis is true. The smaller the p-value, the stronger the evidence is against the null hypothesis.
    </meaning>
  </definition>

<definition id="qual">
    <term>Qualitative Data</term>
    <meaning id="id17543854">
see <term target-id="data">Data</term>.
    </meaning>
  </definition>

<definition id="quant">
   <term>Quantitative Data</term>
    <meaning id="id14440784">
 see <term target-id="data">Data</term>.
    </meaning>
  </definition>

<definition id="quartiles">
    <term>Quartiles</term>
    <meaning id="id14362350">
The numbers that separate the data into quarters. Quartiles may or may not be part of the data. The second quartile is the median of the data.
    </meaning>
  </definition>

<definition id="range">
    <term>Range</term>
    <meaning id="id15595848">
Difference between the highest and lowest values: Range = Highest value – Lowest value.
    </meaning>
  </definition>

<definition id="relfreq">
    <term>Relative Frequency</term>
    <meaning id="id5747717">
The ratio of the number of times a value of the data occurs in the set of all outcomes to the number of all outcomes.
    </meaning>
  </definition>

<definition id="randvar">
    <term>Random Variable (RV)</term>
    <meaning id="id16226227">
see <term target-id="variable">Variable</term>
    </meaning>
  </definition>


<definition id="samplesp">
    <term>Sample Space</term>
    <meaning id="id19986750">
The set of all possible outcomes of an experiment.
    </meaning>
  </definition>


<definition id="sample">
    <term>Sample</term>
    <meaning id="id16291948">
A portion of the population under study. A sample is representative if it characterizes the population being studied.
    </meaning>
  </definition>


<definition id="samperr">
    <term>Sample Error</term>
    <meaning id="id16105513">
The difference between a sample statistic and the corresponding population parameter that can be attributed to sampling (to chance).
    </meaning>
  </definition>

<definition id="sampling">
    <term>Sampling</term>
    <meaning id="id16575724">
A procedure for gathering information about the entire population by selecting only a portion of the population. The more popular random procedures are systematic sampling, simple random sampling, stratified sampling, and cluster sampling.
    </meaning>
  </definition>

<definition id="scattdia">
    <term>Scatter Diagram</term>
    <meaning id="id15642793">
A chart that visually depicts the relationship between two variables.
    </meaning>
  </definition>

<definition id="simrandsamp">
    <term>Simple Random Sampling</term>
    <meaning id="id17212595">
A sampling scheme in which every member of the population has the same chance of being selected.
    </meaning>
  </definition>

<definition id="spaddrule">
    <term>Special Rule for Addition</term>
    <meaning id="id16540492">
For this rule to apply the events must be mutually exclusive: 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mstyle fontstyle="italic"><m:mrow><m:mtext>AorB</m:mtext></m:mrow></m:mstyle><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mi>P</m:mi></m:mrow><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">+</m:mo><m:mi>P</m:mi></m:mrow><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \(  ital "AorB" \) =P \( A \) +P \( B \) } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>

<definition id="spmultrule">
    <term>Special Rule for Multiplication</term>
    <meaning id="id19505681">
For this rule to apply the events must be independent:
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mstyle fontstyle="italic"><m:mrow><m:mtext>AandB</m:mtext></m:mrow></m:mstyle><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mi>P</m:mi></m:mrow><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mo stretchy="false">)</m:mo><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \(  ital "AandB" \) =P \( A \) P \( B \) } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>

<definition id="stddev">
    <term>Standard Deviation</term>
    <meaning id="id20302532">
A number that is equal to the square root of the variance and measures how far data values are from their mean. Notation: s for sample standard deviation and   <m:math><m:ci>σ</m:ci></m:math>for population standard deviation.
    </meaning>
  </definition>

<definition id="stdmean">
    <term>Standard Error of the Mean</term>
    <meaning id="id18741589">
The standard deviation of the distribution of the sample means, 
<m:math>  <m:mfrac>
    <m:mi>σ</m:mi>
    <m:msqrt>
      <m:mi>n</m:mi>
    </m:msqrt>
  </m:mfrac></m:math>.
    </meaning>
  </definition>

<definition id="nrmdist">
    <term>Standard Normal Distribution</term>
    <meaning id="id11541500">
A continuous random variable (RV) 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>X</m:mi><m:mtext>~</m:mtext><m:mi>N</m:mi><m:mo stretchy="false">(</m:mo><m:mn>0,1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X "~" N \( 0,1 \) } {}</m:annotation></m:semantics></m:math>. When X follows the standard normal distribution, it is often noted as 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>Z</m:mi><m:mtext>~</m:mtext><m:mi>N</m:mi><m:mo stretchy="false">(</m:mo><m:mn>0,1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{Z "~" N \( 0,1 \) } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>

<definition id="stat">
    <term>Statistic</term>
    <meaning id="id17366233">
A numerical characteristic of the sample. A statistic estimates the corresponding population parameter. For example, the average number of full-time students in a 7:30 a.m. class for this term (statistic) is an estimate for the average number of full-time students in any class this term (parameter).
    </meaning>
  </definition>

<definition id="stats">
    <term>Statistics</term>
    <meaning id="id16128666">
The science of collecting, organizing, analyzing, and interpreting numerical data.
    </meaning>
  </definition>

<definition id="stratrandsamp">
    <term>Stratified Random Sampling</term>
    <meaning id="id15796834">
A population is divided into groups (called strata) and then a random sample is selected from each stratum.
    </meaning>
  </definition>

<definition id="studenttdist">
    <term>Student-<emphasis>t</emphasis> Distribution</term>
    <meaning id="id18834394">
Investigated and reported by William S. Gossett in 1908 and published under the pseudonym Student. The major characteristics of the random variable (RV) are: 

<list id="tdist1" list-type="bulleted"><item>The Student-t is continuous and assumes any real values. 
</item><item>The <m:math><m:mi>pdf</m:mi></m:math> is symmetrical about its mean of zero. However, it is more spread out and flatter at the apex than the normal distribution. </item>
<item>The Student-t approaches the standard normal distribution as <m:math><m:mi>n</m:mi></m:math> gets larger. </item>
<item>There is a "family" of <m:math><m:mi>t</m:mi></m:math> distributions: every representative of the family is completely defined by the number of degrees of freedom (one less than the number, <m:math><m:mi>n</m:mi></m:math>,  of data).</item></list>

Notation:  <m:math><m:msub><m:mi>t</m:mi><m:mi>df</m:mi></m:msub></m:math> where <m:math><m:mi>df</m:mi></m:math> is the degrees of freedom.  <m:math><m:mi>df</m:mi><m:mo>=</m:mo><m:mi>n - 1</m:mi></m:math>.

    </meaning>
  </definition>

<definition id="systsamp">
    <term>Systematic Sampling</term>
    <meaning id="id19492372">
A population is arranged in some standard list (for example, alphabetically) and then every m-th (for example, every fifth) representative of the list is taken in the sample starting from a random initial representative.
    </meaning>
  </definition>

<definition id="tstat">
    <term><emphasis>t</emphasis> statistic</term>
    <meaning id="id18932573">
Calculated from the data according to the Student-t distribution statistic  that is used to conduct a hypothesis test and to make the statistical inference about the whole population. If data contains <m:math><m:mi>n</m:mi></m:math> observations, then the number of degrees of freedom for the Student-t distribution is <m:math><m:mi>n - 1</m:mi></m:math>. The <m:math><m:mi>t</m:mi></m:math> statistic is used, for example, when the population standard deviation is unknown, when <m:math><m:mi>n</m:mi></m:math> is small, and when samples are dependent (matched pairs hypothesis test).  

The <m:math><m:mi>t</m:mi></m:math> statistic formula is 
<m:math><m:mi>t</m:mi><m:mo>=</m:mo>
<m:mfrac><m:mrow>
<m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply>
<m:mo>-</m:mo>
<m:mi>μ</m:mi></m:mrow>
<m:mrow><m:mfrac><m:mi>s</m:mi><m:msqrt><m:mi>n</m:mi></m:msqrt></m:mfrac></m:mrow>
</m:mfrac>
</m:math>.

    </meaning>
  </definition>

<definition id="teststat">
    <term>Test Statistic</term>
    <meaning id="id17441634">
Calculated from the sample value that is used to conduct the hypothesis test and that makes the statistical inference about the whole population. The calculation depends on the choice of the appropriate distribution, which often is reflected in the name of statistic: z-score, <emphasis>t</emphasis> statistic, <emphasis>F</emphasis>  statistic (<emphasis>F</emphasis> Ratio), etc.
    </meaning>
  </definition>

<definition id="treediagram">
    <term>Tree Diagram</term>
    <meaning id="id18749941">
The useful visual representation of a sample space and events in the form of a “tree” with branches marked by possible outcomes simultaneously with associated probabilities (frequencies, relative frequencies).
    </meaning>
  </definition>


<definition id="type1err">
    <term>Type 1 Error</term>
    <meaning id="id18319022">
The decision is to reject the Null hypothesis, when, in fact, the Null hypothesis is true.
    </meaning>
  </definition>


<definition id="type2err">
    <term>Type 2 Error</term>
    <meaning id="id10577941">
The decision is not to reject the Null hypothesis when, in fact, the Null hypothesis is false.
    </meaning>
  </definition>


<definition id="unidist">
    <term>Uniform Distribution</term>
    <meaning id="id14303008">
A continuous random variable (RV) that has equally likely outcomes over the domain, 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mrow><m:mi>a</m:mi><m:mo stretchy="false">&lt;</m:mo><m:mi>x</m:mi></m:mrow><m:mo stretchy="false">&lt;</m:mo><m:mi>b</m:mi></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{a&lt;x&lt;b} {}</m:annotation></m:semantics></m:math>. Often referred as the  <emphasis>Rectangular distribution</emphasis> because the graph of the pdf has the form of a rectangle. Notation: 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>X</m:mi><m:mtext>~</m:mtext><m:mi>U</m:mi><m:mo stretchy="false">(</m:mo><m:mi>a</m:mi><m:mi>,</m:mi><m:mi>b</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X "~" U \( a,b \) } {}</m:annotation></m:semantics></m:math>. 

The mean is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>μ</m:mi><m:mo stretchy="false">=</m:mo><m:mfrac><m:mrow><m:mi>a</m:mi><m:mo stretchy="false">+</m:mo><m:mi>b</m:mi></m:mrow><m:mn>2</m:mn></m:mfrac></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{μ= {  {a+b}  over  {2} } } {}</m:annotation></m:semantics></m:math>  

and the standard deviation is 
<m:math><m:reln><m:eq/>
<m:mrow><m:mi>σ</m:mi></m:mrow>
<m:mrow><m:msqrt><m:mfrac>
<m:mrow><m:mo>(</m:mo><m:mi>b</m:mi><m:mo>-</m:mo><m:mi>a</m:mi><m:msup><m:mo>)</m:mo><m:mn>2</m:mn></m:msup></m:mrow>
<m:mrow><m:mn>12</m:mn></m:mrow>
</m:mfrac></m:msqrt></m:mrow>
</m:reln></m:math>

The probability density function is  
<m:math>
<m:apply>
<m:ci type="fn">f</m:ci><m:ci>X</m:ci></m:apply>
<m:mo>=</m:mo>
<m:mfrac><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>b</m:mi><m:mo>-</m:mo><m:mi>a</m:mi></m:mrow></m:mfrac>
</m:math>
for
<m:math><m:reln><m:lt/><m:reln><m:lt/>
<m:mrow><m:mi>a</m:mi></m:mrow>
<m:mrow><m:mi>X</m:mi></m:mrow></m:reln>
<m:mrow><m:mi>b</m:mi></m:mrow></m:reln></m:math>
or 
<m:math><m:reln><m:leq/><m:reln><m:leq/>
<m:mrow><m:mi>a</m:mi></m:mrow>
<m:mrow><m:mi>X</m:mi></m:mrow></m:reln>
<m:mrow><m:mi>b</m:mi></m:mrow></m:reln></m:math>.

The  cumulative distribution is 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">≤</m:mo><m:mi>x</m:mi></m:mrow><m:mrow><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">=</m:mo><m:mfrac><m:mrow><m:mi>x</m:mi><m:mo stretchy="false">−</m:mo><m:mi>a</m:mi></m:mrow><m:mrow><m:mi>b</m:mi><m:mo stretchy="false">−</m:mo><m:mi>a</m:mi></m:mrow></m:mfrac></m:mrow></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{P \( X &lt;= x \) = {  {x-a}  over  {b-a} } } {}</m:annotation></m:semantics></m:math>.
    </meaning>
  </definition>


<definition id="variable">
    <term>Variable (Random Variable)</term>
    <meaning id="id18716579">
A characteristic of interest in a population being studied. The common notation for variables are upper case Latin letters 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>X</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X} {}</m:annotation></m:semantics></m:math>, 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>Y</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{Y} {}</m:annotation></m:semantics></m:math>, 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>Z</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{Z} {}</m:annotation></m:semantics></m:math>,....

The common notation for a specific value of a variable) are lower case Latin letters 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>x</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{x} {}</m:annotation></m:semantics></m:math>, 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>y</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{y} {}</m:annotation></m:semantics></m:math>, 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>z</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{z} {}</m:annotation></m:semantics></m:math>,.... 

The variable in statistics differs from the variable in intermediate algebra in two following ways: 

<list id="arrvee" list-type="bulleted">
<item> The domain of a random variable (RV) is not necessarily a numerical set but it may be words.  For example, if 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>X</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X} {}</m:annotation></m:semantics></m:math> = hair color then the domain is {black, blond, gray, red, brown}. </item>

<item> We can tell a specific value of 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>x</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{x} {}</m:annotation></m:semantics></m:math> that the variable 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>X</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{X} {}</m:annotation></m:semantics></m:math> takes on only after performing the experiment. </item></list>


    </meaning>
  </definition>



<definition id="variance">
    <term>Variance</term>
    <meaning id="id3154337">
Mean of the squared deviations from the mean. Square of the standard deviation.  For a set of data, a deviation can be represented as <m:math><m:mi>x</m:mi><m:mo>-</m:mo><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply></m:math>  where <m:math><m:mi>x</m:mi></m:math> is a value of the data and <m:math><m:apply>
  <m:conjugate/>
  <m:ci>x</m:ci>
</m:apply></m:math> is the sample mean. The sample variance is equal to the sum of the squares of the deviations divided by the difference of the sample size and 1.

    </meaning>
  </definition>



<definition id="vendiagram">
    <term>Venn Diagram</term>
    <meaning id="id3154364">
The useful visual representation of a sample space and events in the form of circles or ovals showing their intersections.
    </meaning>
  </definition>



<definition id="zscore">
    <term>z-score</term>
    <meaning id="id3154393">
The linear transformation of the form 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>z</m:mi><m:mo stretchy="false">=</m:mo><m:mfrac><m:mrow><m:mi>x</m:mi><m:mo stretchy="false">−</m:mo><m:mi>μ</m:mi></m:mrow><m:mi>σ</m:mi></m:mfrac></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{z= {  {x-μ}  over  {σ} } } {}</m:annotation></m:semantics></m:math>. 

If this transformation is applied to any normal distribution
<m:math><m:mrow><m:mrow><m:mi>X</m:mi><m:mtext>~</m:mtext><m:mi>N</m:mi><m:mo stretchy="false">(</m:mo>
<m:mi>μ</m:mi>
<m:mo>,</m:mo>
<m:mi>σ</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:math>  ,  
 
the result is the standard normal distribution 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mrow><m:mi>Z</m:mi><m:mtext>~</m:mtext><m:mi>N</m:mi><m:mo stretchy="false">(</m:mo><m:mn>0,1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{Z "~" N \( 0,1 \) } {}</m:annotation></m:semantics></m:math>. If this transformation is applied to any specific value 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>x</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{x} {}</m:annotation></m:semantics></m:math> of the RV with mean 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>μ</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{μ} {}</m:annotation></m:semantics></m:math> and standard deviation 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>σ</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{σ} {}</m:annotation></m:semantics></m:math> , the result is called the  z-score of 
<m:math><m:semantics><m:mrow><m:mstyle fontsize="12pt"><m:mrow><m:mi>x</m:mi></m:mrow></m:mstyle><m:mrow/></m:mrow><m:annotation encoding="StarMath 5.0"> size 12{x} {}</m:annotation></m:semantics></m:math>. Z-scores allow us to compare data that are normally distributed but scaled differently.
    </meaning>
  </definition>



</glossary>
</document>
