Skip to content Skip to navigation

OpenStax_CNX

You are here: Home » Content » F Distribution and ANOVA: Facts About the F Distribution

Navigation

Recently Viewed

This feature requires Javascript to be enabled.

Tags

(What is a tag?)

These tags come from the endorsement, affiliation, and other lenses that include this content.
 

F Distribution and ANOVA: Facts About the F Distribution

Module by: Susan Dean, Barbara Illowsky, Ph.D.. E-mail the authors

Summary: This module states the factors associated with F Distributions and provides students with some examples to help further understand the concept. Students will be given the opportunity to see F Distributions in action through participation in an optional classroom exercise.

Note: You are viewing an old version of this document. The latest version is available here.

  1. The curve is not symmetrical but skewed to the right.
  2. There is a different curve for each set of dfs dfs .
  3. The F statistic is greater than or equal to zero.
  4. As the degrees of freedom for the numerator and for the denominator get larger, the curve approximates the normal.
  5. Other uses for the F distribution include comparing two variances and Two-Way Analysis of Variance. Comparing two variances is discussed at the end of the chapter. Two-Way Analysis is mentioned for your information only.

Figure 1
(a) (b)
Nonsymmetrical F distribution curve skewed to the right, more values in the right tail and the peak is closer to the left. This curve is different from the graph on the right because of the different dfs.Nonsymmetrical F distribution curve skewed to the right, more values in the right tail and the peak is closer to the left. This curve is different from the graph on the left because of the different dfs. Because its dfs are larger, it is closer in resemblance to a normal distribution curve.

Example 1

One-Way ANOVA: Four sororities took a random sample of sisters regarding their grade averages for the past term. The results are shown below:

Table 1
GRADE AVERAGES FOR FOUR SORORITIES
Sorority 1 Sorority 2 Sorority 3 Sorority 4
2.17 2.63 2.63 3.79
1.85 1.77 3.78 3.45
2.83 3.25 4.00 3.08
1.69 1.86 2.55 2.26
3.33 2.21 2.45 3.18

Problem 1

Using a significance level of 1%, is there a difference in grade averages among the sororities?

Solution

Let μ 1 μ 1 , μ 2 μ 2 , μ 3 μ 3 , μ 4 μ 4 be the population means of the sororities. Remember that the null hypothesis claims that the sorority groups are from the same normal distribution. The alternate hypothesis says that at least two of the sorority groups come from populations with different normal distributions. Notice that the four sample sizes are each size 5.

H o : μ 1 = μ 2 = μ 3 = μ 4 H o : μ 1 = μ 2 = μ 3 = μ 4

H a H a : Not all of the means μ 1 , μ 2 , μ 3 , μ 4 μ 1 , μ 2 , μ 3 , μ 4 are equal.

Distribution for the test: F 3 , 16 F 3 , 16

where k = 4 groupsk=4 groups and N = 20 samples in total N=20 samples in total

df(num) = k - 1 = 4 - 1 = 3 df(num)=k-1=4-1=3

df(denom) = N - k = 20 - 4 = 16 df(denom)=N-k=20-4=16

Calculate the test statistic: F = 2.23 F=2.23

Graph:

Figure 2
Nonsymmetrical F distribution curve with values of 0 and 2.23 on the x-axis representing the test statistic of sorority grade averages. A vertical upward line extends from 2.23 to the curve and the area to the right of this is equal to the p-value.

Probability statement: p-value = P ( F > 2.23 ) = 0.1241 p-value=P(F>2.23)=0.1241

Compare α α and the p-value p-value: α = 0.01 p-value = 0.1242 α=0.01p-value=0.1242 α < p-value α<p-value

Make a decision: Since α < p-value α<p-value, you cannot reject H o H o .

This means that the population averages appear to be the same.

Conclusion: There is not sufficient evidence to conclude that there is a difference among the grade averages for the sororities.

TI-83+ or TI 84: Put the data into lists L1, L2, L3, and L4. Press STAT and arrow over to TESTS. Arrow down to F:ANOVA. Press ENTER and Enter (L1,L2,L3,L4). The F statistic is 2.2303 and the p-value p-value is 0.1241. df(numerator) = 3df(numerator) = 3 (under "Factor") and df(denominator) = 16df(denominator) = 16 (under Error).

Example 2

A fourth grade class is studying the environment. One of the assignments is to grow bean plants in different soils. Tommy chose to grow his bean plants in soil found outside his classroom mixed with dryer lint. Tara chose to grow her bean plants in potting soil bought at the local nursery. Nick chose to grow his bean plants in soil from his mother's garden. No chemicals were used on the plants, only water. They were grown inside the classroom next to a large window. Each child grew 5 plants. At the end of the growing period, each plant was measured, producing the following data (in inches):

Table 2
Tommy's Plants Tara's Plants Nick's Plants
24 25 23
21 31 27
23 23 22
30 20 30
23 28 20

Problem 1

Does it appear that the three media in which the bean plants were grown produce the same average height? Test at a 3% level of significance.

Solution

This time, we will perform the calculations that lead to the F' statistic. Notice that each group has the same number of plants so we will use the formula F' = n ( s x_ ) 2 ( s pooled ) 2 F' = n ( s x_ ) 2 ( s pooled ) 2 .

First, calculate the sample mean and sample variance of each group.

Table 3
  Tommy's Plants Tara's Plants Nick's Plants
Sample Mean 24.2 25.4 24.4
Sample Variance 11.7 18.3 16.3

Next, calculate the variance of the three group means (Calculate the variance of 24.2, 25.4, and 24.4). Variance of the group means = 0.413 = (s x¯ )2(s x )2

Then MS between = n (s x¯ )2 = ( 5 ) ( 0.413 ) MS between =n(s x )2=(5)(0.413) where n=5n=5 is the sample size (number of plants each child grew).

Calculate the average of the three sample variances (Calculate the average of 11.7, 18.3, and 16.3). Average of the sample variances = 15.433 = (spooled)2 =(spooled)2 size 12{ \( s rSub { size 8{ ital "pooled"} } \) rSup { size 8{2} } ={}} {}

Then MS within = (spooled)2 = 15.433 MS within =(spooled)2 size 12{ \( s rSub { size 8{ ital "pooled"} } \) rSup { size 8{2} } ={}} {}=15.433.

The F F statistic (or F F ratio) is F = MS between MS within = n ( s x_ ) 2 ( s pooled ) 2 = ( 5 ) ( 0.413 ) 15.433 = 0.134 F= MS between MS within = n ( s x_ ) 2 ( s pooled ) 2 = ( 5 ) ( 0.413 ) 15.433 =0.134

The dfs for the numerator = the number of groups - 1 = 3 - 1 = 2 the number of groups-1=3-1=2

The dfs for the denominator = the total number of samples - the number of groups = 15 - 3 = 12 the total number of samples-the number of groups=15-3=12

The distribution for the test is F 2 , 12 F 2 , 12 and the F statistic is F = 0.134 F=0.134

The p-value is P ( F > 0.134 ) = 0.8759 P(F>0.134)=0.8759.

Decision: Since α = 0.03 α=0.03 and the p-value = 0.8759 p-value=0.8759, do not reject H o H o . (Why?)

Conclusion: With a 3% the level of significance, from the sample data, the evidence is not sufficient to conclude that the average heights of the bean plants are not different. Of the three media tested, it appears that it does not matter which one the bean plants are grown in.

(This experiment was actually done by three classmates of the son of one of the authors.)

Another fourth grader also grew bean plants but this time in a jelly-like mass. The heights were (in inches) 24, 28, 25, 30, and 32.

Problem 2

Do an ANOVA test on the 4 groups. You may use your calculator or computer to perform the test. Are the heights of the bean plants different? Use a solution sheet.

Solution

  • FF = 0.9496
  • p-valuep-value = 0.4401

The heights of the bean plants are the same.

Optional Classroom Activity

Randomly divide the class into four groups of the same size. Have each member of each group record the number of states in the United States he or she has visited. Run an ANOVA test to determine if the average number of states visited in the four groups are the same. Test at a 1% level of significance. Use one of the solution sheets at the end of the chapter (after the homework).

Content actions

Download module as:

Add module to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks