Summary: This module provides an overview of Comparing Two Independent Population Means with Unknown Population Standard Deviations as a part of Collaborative Statistics collection (col10522) by Barbara Illowsky and Susan Dean.
Note: You are viewing an old version of this document. The latest version is available here.
The comparison of two population means is very common. A difference between
the two samples depends on both the means and the standard deviations. Very
different means can occur by chance if there is great variation among the individual
samples. In order to account for the variation, we take the difference of the sample
means,
Because we do not know the population standard deviations, we estimate them using
the two sample standard deviations from our independent samples. For the
hypothesis test, we calculate the estimated standard deviation, or standard error, of
the difference in sample means,
The test statistic (t-score) is calculated as follows:
The degrees of freedom (df) is a somewhat complicated calculation. However, a computer or calculator calculates it easily. The dfs are not always a whole number. The test statistic calculated above is approximated by the student's-t distribution with dfs as follows:
When both sample sizes
The average amount of time boys and girls ages 7 through 11 spend playing sports each day is believed to be the same. An experiment is done, data is collected, resulting in the table below:
| Sample Size | Average Number of Hours Playing Sports Per Day | Sample Standard Deviation | |
|---|---|---|---|
| Girls | 9 | 2 hours | |
| Boys | 16 | 3.2 hours | 1.00 |
Is there a difference in the average amount of time boys and girls ages 7 through 11 play sports each day? Test at the 5% level of significance.
The population standard deviations are not known.
Let
Random variable:
The words "the same" tell you
Distribution for the test:
Use
Calculate the p-value using a student's-t distribution: p-value = 0.0054
Graph:
![]() |
So,
Half the p-value is below -1.2 and half is above 1.2.
Make a decision: Since
This means you reject
Conclusion: At the 5% level of significance, the sample data show there is sufficient evidence to conclude that the average number of hours that girls and boys aged 7 through 11 play sports per day is different.
STAT. Arrow over to TESTS and press
4:2-SampTTest. Arrow over to Stats and press ENTER. Arrow down
and enter 2 for the first sample mean,
0.75
0.75
for Sx1, 9 for n1, 3.2 for the
second sample mean, 1 for Sx2, and 16 for n2. Arrow down to μ1: and
arrow to does not equal μ2. Press ENTER. Arrow down to Pooled: and
No. Press ENTER. Arrow down to Calculate and press ENTER. The
p-value is p = 0.0054, the dfs are approximately 18.8462, and the test
statistic is -3.14. Do the procedure again but instead of Calculate do Draw.A study is done by a community group in two neighboring colleges to determine which one graduates students with more math classes. College A samples 11 graduates. Their average is 4 math classes with a standard deviation of 1.5 math classes. College B samples 9 graduates. Their average is 3.5 math classes with a standard deviation of 1 math class. The community group believes that a student who graduates from college A has taken more math classes, on the average. Test at a 1% significance level. Answer the following questions.
Is this a test of two means or two proportions?
two means
Are the populations standard deviations known or unknown?
unknown
Which distribution do you use to perform the test?
student's-t
What is the random variable?
What are the null and alternate hypothesis?
Is this test right, left, or two tailed?
right
What is the p-value?
0.1928
Do you reject or not reject the null hypothesis?
Do not reject.
At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student who graduates from college A has taken more math classes, on the average, than a student who graduates from college B.