The
FF size 12{F} {} distribution is named after Ronald Fisher. Fisher is one of the most respected statisticians of all time. He did a lot of statistical work in biology and genetics and became chair of genetics at Cambridge University in England in 1949. In 1952, he was awarded knighthood.
This section is a very brief overview of the
FF size 12{F} {} distribution and two of its applications - One Way Analysis of Variance (ANOVA) and test of two variances. There are college courses which deal exclusively with these topics. ANOVA, particularly, is used regularly in industry.
- kk size 12{k} {} = the number of different groups
- njnj size 12{n rSub { size 8{j} } } {} = the size of the
jthjth size 12{ ital "jth"} {} group
- sjsj size 12{s rSub { size 8{j} } } {}= the sum of the values in the
jthjth size 12{ ital "jth"} {} group
- NN size 12{N} {} = the total number of all the values combined
- Total sample size: ∑nj∑nj size 12{ Sum {n rSub { size 8{j} } } } {}
- xx = one value:
∑x=∑sj∑x=∑sj size 12{ Sum {x} = Sum {s rSub { size 8{j} } } } {}
- Sum of squares of all values from every group combined: ∑x2∑x2 size 12{ Sum {x rSup { size 8{2} } } ={}} {}
- Variability of between group:
SS
total
=
∑
x
2
−
∑
x
2
N
SS
total
=
∑
x
2
−
∑
x
2
N
size 12{ ital "SS" rSub { size 8{ ital "total"} } = Sum {x rSup { size 8{2} } } - { { left ( Sum {x} right ) rSup { size 8{2} } } over {N} } } {}
- Total sum of squares:
∑
x
2
−
(
∑
x
)
2
N
∑
x
2
−
(
∑
x
)
2
N
size 12{ Sum {x rSup { size 8{2} } } - { { \( Sum {x} \) rSup { size 8{2} } } over {N} } } {}
- Explained variation- sum of squares representing variation among the different samples
SS
between
=
∑
[
(
sj
)
2
n
j
]
−
(
∑
s
j
)
2
N
SS
between
=
∑
[
(
sj
)
2
n
j
]
−
(
∑
s
j
)
2
N
size 12{ ital "SS" rSub { size 8{ ital "between"} } = Sum { \[ { { \( ital "sj" \) rSup { size 8{2} } } over {n rSub { size 8{j} } } } \] } - { { \( Sum {s rSub { size 8{j} } \) rSup { size 8{2} } } } over {N} } } {}
- Unexplained variation- sum of squares representing variation within samples due to chance:
SS
within
=
SS
total
−
SS
between
SS
within
=
SS
total
−
SS
between
size 12{ ital "SS" rSub { size 8{ ital "within"} } = ital "SS" rSub { size 8{ ital "total"} } - ital "SS" rSub { size 8{ ital "between"} } } {}
- df's for different groups (df's for the numerator):
df
between
=
k
−
1
df
between
=
k
−
1
size 12{ ital "df" rSub { size 8{ ital "between"} } =k - 1} {}
-
Equation for errors within samples (df's for the denominator):
df
within
=
N
−
k
df
within
=
N
−
k
size 12{ ital "df" rSub { size 8{ ital "within"} } =N - k} {}
- Mean square (variance estimate) explained by the different groups:
MS
between
=
SS
between
df
between
MS
between
=
SS
between
df
between
size 12{ ital "MS" rSub { size 8{ ital "between"} } = { { ital "SS" rSub { size 8{ ital "between"} } } over { ital "df" rSub { size 8{ ital "between"} } } } } {}
-
Mean square (variance estimate) that is due to chance (unexplained):
MS
within
=
SS
within
df
within
MS
within
=
SS
within
df
within
size 12{ ital "MS" rSub { size 8{ ital "within"} } = { { ital "SS" rSub { size 8{ ital "within"} } } over { ital "df" rSub { size 8{ ital "within"} } } } } {}
-
F ratio or F statistic of two estimates of variance:
F
=
MS
between
MS
within
F
=
MS
between
MS
within
size 12{F= { { ital "MS" rSub { size 8{ ital "between"} } } over { ital "MS" rSub { size 8{ ital "within"} } } } } {}
The above calculations were done with groups of different sizes. If the groups are the same size, the calculations simplify somewhat and the F ratio can be written as:
F
=
n
⋅
(
s
−
x
)
2
(
s
pooled
)
2
F
=
n
⋅
(
s
−
x
)
2
(
s
pooled
)
2
size 12{F= { {n cdot \( s rSub { size 8{ - x} } \) rSup { size 8{2} } } over { \( s rSub { size 8{ ital "pooled"} } \) rSup { size 8{2} } } } } {}
(1)
- (s−x)2=(s−x)2= size 12{ \( s rSub { size 8{ - x} } \) rSup { size 8{2} } ={}} {}the variance of the sample means
- n=n= size 12{n={}} {}the sample size of each group
- (spooled)2=(spooled)2= size 12{ \( s rSub { size 8{ ital "pooled"} } \) rSup { size 8{2} } ={}} {}the mean of the sample variances (pooled variance)
-
df
numerator
=
k
−
1
df
numerator
=
k
−
1
size 12{ ital "df" rSub { size 8{ ital "numerator"} } =k - 1} {}
-
df
numerator
=
k
(
n
−
1
)
=
N
−
k
df
numerator
=
k
(
n
−
1
)
=
N
−
k
size 12{ ital "df" rSub { size 8{ ital "numerator"={}} } k \( n - 1 \) =N - k} {}
These calculations are easily done with a graphing calculator or a computer program. We present the information in the chapter assuming some kind of technology will be used.
For ANOVA, the samples must come from normally distributed populations with the same variance, and the samples must be independent. The ANOVA test is right-tailed.
In a test of two variances, the samples must come from normal populations and must be independent of each other.
Three different diet plans are to be tested for average weight loss. For each diet plan, 4 dieters are selected and their weight loss (in pounds) in one month's time is recorded.
| Plan 1 |
Plan 2 |
Plan 3 |
| 5 |
3.5 |
8 |
| 4.5 |
7 |
4 |
| 4 |
6 |
3.5 |
| 3 |
4 |
4.5 |
Is the average weight loss the same for each plan? Conduct an ANOVA test with a 1% level of significance.
Let
μ1μ1 size 12{μ rSub { size 8{1} } } {},
μ2μ2 size 12{μ rSub { size 8{2} } } {}, and
μ3μ3 size 12{μ rSub { size 8{3} } } {} be the population means for the three diet plans.
-
H
o
:
μ
1
=
μ
2
=
μ
3
H
o
:
μ
1
=
μ
2
=
μ
3
size 12{H rSub { size 8{o} } :μ rSub { size 8{1} } =μ rSub { size 8{2} } =μ rSub { size 8{3} } } {}
- Ha:Ha: size 12{H rSub { size 8{a} } :} {} Not all pairs of means are equal.
-
df
numerator
=
3
−
1
=
2
df
numerator
=
3
−
1
=
2
size 12{ ital "df" rSub { size 8{ ital "numerator"} } =3 - 1=2} {}
-
df
deno
min
ator
=
12
−
3
=
9
df
deno
min
ator
=
12
−
3
=
9
size 12{ ital "df" rSub { size 8{ ital "deno""min" ital "ator"} } ="12" - 3=9} {}
The distribution for the test is
F2,9F2,9 size 12{F rSub { size 8{2,9} } } {}
Using a calculator or computer, the test statistic is
F=0.47F=0.47 size 12{F=0 "." "47"} {}. The notation used for the
FF size 12{F} {} statistic may also be
F'F' size 12{F'} {} or
F2,9F2,9 size 12{F rSub { size 8{2,9} } } {} (like the distribution). The TI-83/84 series has the function ANOVA in STAT TESTS. Enter the lists of data separated by commas.
If you use the formulas for groups of the same size, the calculations are as follows:
Sample means are 4.13, 5.13, and 5, respectively. Sample standard deviations are 0.8539, 1.6250, and 2.0412, respectively.
|
(
s
−
x
)
2
=
0
.
2956
(
s
−
x
)
2
=
0
.
2956
size 12{ \( s rSub { size 8{ - x} } \) rSup { size 8{2} } =0 "." "2956"} {}
|
The variance of the sample means |
|
(
s
pooled
)
2
=
2
.
5416
(
s
pooled
)
2
=
2
.
5416
size 12{ \( s rSub { size 8{ ital "pooled"} } \) rSup { size 8{2} } =2 "." "5416"} {}
|
The mean of the sample variances |
|
n
=
4
n
=
4
size 12{n=4} {}
|
The sample size of each group |
F
=
4
⋅
0
.
2956
2
.
5416
F
=
4
⋅
0
.
2956
2
.
5416
size 12{F= { {4 cdot 0 "." "2956"} over {2 "." "5416"} } } {}
(2)
Probability Statement:
P
(
F
>
0
.
47
)
=
0
.
6395
P
(
F
>
0
.
47
)
=
0
.
6395
size 12{P \( F>0 "." "47" \) =0 "." "6395"} {}
Since
α<pα<p size 12{a<p} {}-value, do not reject
HoHo size 12{H rSub { size 8{o} } } {}.
There is not sufficient evidence to conclude that the three diet plans are different. It appears that the three diet plans work equally well. The average weight loss is the same for all three plans.
Machine A makes a box and machine B makes a lid. For the lid to fit the box correctly, the variances should be nearly the same. There is a suspicion that the variance of the box is greater than the variance of the lid. The following data was collected.
| |
Machine A (Box) |
Machine B (Lid) |
| Number of Parts |
9 |
11 |
| Variance |
150 |
45 |
Are the machines working properly? Test at a 5% level of significance.
Let
σA2σA2 size 12{σ rSub { size 8{A rSup { size 6{2} } } } } {} and
σB2σB2 size 12{σ rSub { size 8{B rSup { size 6{2} } } } } {}be the population variances for machine A and machine B, respectively
-
H
o
:
σ
A
2
=
σ
B
2
H
o
:
σ
A
2
=
σ
B
2
size 12{H rSub { size 8{o} } :σ rSub { size 8{A rSup { size 6{2} } } } =σ rSub {B rSup { size 6{2} } } } {}
-
H
a
:
σ
A
2
>
σ
B
2
H
a
:
σ
A
2