A contingency table provides a different way of calculating probabilities. The table helps in determining conditional probabilities quite easily. The table displays sample values in relation to two different variables that may be dependent or contingent on one another. Later on, we will use contingency tables again, but in another manner.
Suppose a study of speeding violations and drivers who use car phones produced the following fictional data:
Table 1
| |
Speeding violation
in the last year |
No speeding violation
in the last year |
Total |
| Car phone user |
25 |
280 |
305 |
| Not a car phone user |
45 |
405 |
450 |
| Total |
70 |
685 |
755 |
The total number of people in the sample is 755. The row totals are 305 and 450. The column totals are 70 and 685. Notice that 305 + 450 = 755305 + 450 = 755 and 70 + 685 = 75570 + 685 = 755.
Calculate the following probabilities using the table
P(person is a car phone user) = P(person is a car phone user) =
number of car phone users
total number in study
=
305
755
number of car phone users
total number in study
=
305
755
P(person had no violation in the last year) = P(person had no violation in the last year) =
number that had no violation
total number in study
=
685
755
number that had no violation
total number in study
=
685
755
P(person had no violation in the last year AND was a car phone user) =P(person had no violation in the last year AND was a car phone user) =
P(person is a car phone user OR person had no violation in the last year) =P(person is a car phone user OR person had no violation in the last year) =
(
305
755
+
685
755
) -
280
755
=
710
755
(
305
755
+
685
755
) -
280
755
=
710
755
P(person is a car phone user GIVEN person had a violation in the last year) = P(person is a car phone user GIVEN person had a violation in the last year) =
25
70
25
70
(The sample space is reduced to the number of persons who had a violation.)
P(person had no violation last year GIVEN person was not a car phone user) = P(person had no violation last year GIVEN person was not a car phone user) =
405
450
405
450
(The sample space is reduced to the number of persons who were not car phone users.)
The following table shows a random sample of 100 hikers and the areas of hiking preferred:
Table 2: Hiking Area Preference
| Sex |
The Coastline |
Near Lakes and Streams |
On Mountain Peaks |
Total |
| Female |
18 |
16 |
___ |
45 |
| Male |
___ |
___ |
14 |
55 |
| Total |
___ |
41 |
___ |
___ |
Table 3: Hiking Area Preference
| Sex |
The Coastline |
Near Lakes and Streams |
On Mountain Peaks |
Total |
| Female |
18 |
16 |
11 |
45 |
| Male |
16 |
25 |
14 |
55 |
| Total |
34 |
41 |
25 |
100 |
Are the events "being female" and "preferring the coastline" independent events?
Let FF = being female and let CC = preferring the coastline.
- a. P(F AND C)P(F AND C) =
- b. P(F) ⋅ P(C)P(F) ⋅ P(C) =
Are these two numbers the same? If they are, then FF and CC are independent. If they are not, then FF and CC are not independent.
- a.
P(F AND C)
=
18
100
=
0.18
P(F AND C) =
18
100
= 0.18
- b.
P(F) ⋅ P(C)
=
45
100
⋅
45
100
=
0.45
⋅
0.45
=
0.153
P(F) ⋅ P(C) =
45
100
⋅
45
100
= 0.45 ⋅ 0.45 = 0.153
P(F AND C)
≠
P(F) ⋅ P(C)P(F AND C) ≠ P(F) ⋅ P(C), so the events FF and CC are not independent.
Find the probability that a person is male given that the person prefers hiking near lakes and streams. Let MM = being male and let LL = prefers hiking near lakes and streams.
- a. What word tells you this is a conditional?
- b. Fill in the blanks and calculate the probability: P(___|___) = ___P(___|___) = ___.
- c. Is the sample space for this problem all 100 hikers? If not, what is it?
- a. The word 'given' tells you that this is a conditional.
- b. P(M|L) = 2541P(M|L) = 2541
- c. No, the sample space for this problem is 41.
Find the probability that a person is female or prefers hiking on mountain peaks.
Let FF = being female and let PP = prefers mountain peaks.
- a. P(F) = P(F) =
- b. P(P) = P(P) =
- c. P(F AND P) = P(F AND P) =
- d. Therefore, P(F OR P) = P(F OR P) =
- a. P(F) = 45100P(F) = 45100
- b. P(P) = 25100P(P) = 25100
- c. P(F AND P) = 11100P(F AND P) = 11100
- d. P(F OR P) = 45100 + 25100 - 11100 = 59100P(F OR P) = 45100 + 25100 - 11100 = 59100
- Contingency Table:
The method of displaying a frequency distribution in case of dependable (contingent) variables; the table provides the easy way to calculate conditional probabilities.