<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="m10953">

  <name xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Remarks on the concept of "Probability"</name>
  
<metadata xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">
  <md:version xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">2.3</md:version>
  <md:created xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">2002/12/02</md:created>
  <md:revised xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">2003/07/11 10:15:21.714 GMT-5</md:revised>
  <md:authorlist xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">
    <md:author xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="dmlane">
      <md:firstname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">David</md:firstname>
      
      <md:surname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Lane</md:surname>
      <md:email xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">lane@rice.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">
    <md:maintainer xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="dmlane">
      <md:firstname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">David</md:firstname>
      
      <md:surname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Lane</md:surname>
      <md:email xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">lane@rice.edu</md:email>
    </md:maintainer>
    <md:maintainer xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="emaloney">
      <md:firstname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Erin</md:firstname>
      
      <md:surname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Maloney</md:surname>
      <md:email xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">emaloney@rice.edu</md:email>
    </md:maintainer>
    <md:maintainer xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="meyer">
      <md:firstname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Eileen</md:firstname>
      
      <md:surname xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Meyer</md:surname>
      <md:email xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">meyer@rice.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  

  <md:abstract xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/"/>
</metadata>

  <content xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">
    <para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para1">
<term xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" src="#Inferential_Statistics">Inferential statistics</term> is built on the foundation of probability theory, 
and has been remarkably successful in guiding opinion about the 
conclusions to be drawn from data. Yet (paradoxically) the very idea 
of probability has been plagued by controversy from the beginning of the 
subject to the present day. In this section we provide a glimpse of the 
debate about the interpretation of the probability concept.
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para2">
One conception of probability is drawn from the idea of <term xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">symmetrical 
outcomes</term>. For example, the two possible outcomes of tossing a fair 
coin seem not to be distinguishable in any way that affects which side 
will land up or down. Therefore the probability of heads is taken to be 
<m:math><m:apply><m:divide/><m:cn>1</m:cn><m:cn>2</m:cn></m:apply></m:math>, 
as is the probability of tails. In general, if there 
are <m:math><m:ci>N</m:ci></m:math> symmetrical outcomes, the probability 
of any given one of them occurring is taken to be <m:math><m:apply>
<m:divide/><m:cn>1</m:cn><m:ci>N</m:ci></m:apply></m:math>. Thus, if a 
six-sided die is rolled, the probability of any one of the six sides coming 
up is <m:math><m:apply><m:divide/><m:cn>1</m:cn><m:cn>6</m:cn></m:apply></m:math>.
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para3">
Probabilities can also be thought of in terms of <term xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">relative frequencies</term>. 
If we tossed a coin millions of times, we would expect the proportion of tosses 
that came up heads to be pretty close to <m:math><m:apply><m:divide/><m:cn>1</m:cn>
<m:cn>2</m:cn></m:apply></m:math>. As the number of tosses increases, the proportion 
of heads approaches <m:math><m:apply><m:divide/><m:cn>1</m:cn><m:cn>2</m:cn>
</m:apply></m:math>. Therefore, we can say that the probability of a head is 
<m:math><m:apply><m:divide/><m:cn>1</m:cn><m:cn>2</m:cn></m:apply></m:math>.
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para4">
If it has rained in Seattle on <m:math><m:cn>62</m:cn></m:math>% of the 
last <m:math><m:cn>100,000</m:cn></m:math> days, then the probability of 
it raining tomorrow might be taken to be <m:math><m:cn>0.62</m:cn>
</m:math>. This is a natural idea but nonetheless unreasonable if we 
have further information relevant to whether will rain tomorrow. For 
example, if tomorrow is August 1, a day of the year on which it seldom 
rains in Seattle, we should only consider the percentage of the time it 
rained on August 1. But even this is not enough since the probability of 
rain on the next August 1 depends on the humidity. (The chances are 
higher in the presence of high humidity.) So, we should consult only 
the prior occurrences of August 1 that had the same humidity as the 
next occurrence of August 1. Of course, wind direction also affects 
probability ... You can see that our sample of prior cases will soon be 
reduced to the empty set. Anyway, past meteorological history is 
misleading if the climate is changing? 
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para5">
For some purposes, probability is best thought of as 
<term xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">subjective</term>. Questions such as "What is the probability 
that Ms. Jones will defeat Mr. Smith in an upcoming congressional 
election?" do not conveniently fit into either the symmetry or frequency 
approaches to probability. Rather, assigning probability <m:math><m:cn>0.7
</m:cn></m:math> (say) to this event seems to reflect the speaker's personal 
opinion --- perhaps his willingness to bet according to certain odds. Such 
an approach to probability, however, seems to lose the objective content of 
the idea of chance; probability becomes mere opinion.
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para6">
Two people might attach different probabilities to the election outcome, yet 
there would be no criterion for calling one "right" and the other "wrong." 
We cannot call one of the two people right simply because she assigned higher 
probability to the outcome that actually transpires. After all, you would be 
right to attribute probability <m:math><m:apply><m:divide/><m:cn>1</m:cn>
<m:cn>6</m:cn></m:apply></m:math> to throwing a six with a fair die, and your 
friend who attributes <m:math><m:apply><m:divide/><m:cn>2</m:cn><m:cn>3</m:cn>
</m:apply></m:math> to this event would be wrong. And you are still right (and 
your friend is still wrong) even if the die ends up showing a six! The lack 
of objective criteria for adjudicating claims about probabilities in the subjective 
perspective is an unattractive feature of it for many scholars.
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para7">
Like most work in the field, the present text adopts the frequentist 
approach to probability. Moreover, almost all the probabilities we shall 
encounter will be <term xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">nondogmatic</term>, that is, neither zero nor 
one. An event with probability <m:math><m:cn>0</m:cn></m:math> has no 
chance of occurring; an event of probability <m:math><m:cn>1</m:cn>
</m:math> is certain to occur. It is hard to think of any examples of 
interest to statistics in which the probability is either <m:math>
<m:cn>0</m:cn></m:math> or <m:math><m:cn>1</m:cn></m:math>. (Even the 
probability that the Sun will come up tomorrow is less than <m:math>
<m:cn>1</m:cn></m:math>.)
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para8">
The following example illustrates our attitude about probabilities. 
Suppose you wish to know what the weather will be like next Saturday 
because you are planning a picnic. You turn on your radio, and the 
weather person says, "There is a <m:math><m:cn>10</m:cn></m:math>% 
chance of rain." You decide to have the picnic outdoors and, lo and 
behold, it rains. You are furious with the weather person. But was 
she wrong? No, she did not say it would not rain, only that rain was 
unlikely. She would have been flatly wrong only if she said that the 
probability is <m:math><m:cn>0</m:cn></m:math> and it subsequently 
rained. However, if you kept track of her weather predictions over a 
long periods of time and found that it rained on <m:math><m:cn>50</m:cn>
</m:math>% of the days that the weather person said the probability 
was <m:math><m:cn>0.10</m:cn></m:math>, you could say her probability 
assessments are wrong.
</para>

<para xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="para9">
So when is it accurate to say that the probability of rain is <m:math>
<m:cn>0.10</m:cn></m:math>? According to our frequency interpretation, 
it means that it will rain <m:math><m:cn>10</m:cn></m:math>% of the 
days on which rain is forecast with this probability.
    </para>   

  </content>
  
  <glossary xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">
    <definition xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/" id="Inferential_Statistics">
      <term xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">Inferential Statistics</term>
      <meaning xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:bib="http://bibtexml.sf.net/">
	The branch of staistics concerned with drawing conclusions about a population from a smaller sample. This is generally done through random sampling, followed by inferences made about central tendency, or any of a number of other aspects of a distribution.
      </meaning>
    </definition>

</glossary>


</document>
