Connexions

You are here: Home » Content » CONFIDENCE INTERVALS II

Recently Viewed

This feature requires Javascript to be enabled.

CONFIDENCE INTERVALS II

Module by: Ewa Paszek. E-mail the author

Summary: This course is a short series of lectures on Introductory Statistics. Topics covered are listed in the Table of Contents. The notes were prepared by Ewa Paszek and Marek Kimmel. The development of this course has been supported by NSF 0203396 grant.

CONFIDENCE INTERVALS II

Confidence Intervals for Means

In the preceding considerations (Confidence Intervals I), the confidence interval for the mean μ μ of a normal distribution was found, assuming that the value of the standard deviation σ σ is known. However, in most applications, the value of the standard deviation σ σ is rather unknown, although in some cases one might have a very good idea about its value.

Suppose that the underlying distribution is normal and that σ 2 σ 2 is unknown. It is shown that given random sample X 1 , X 2 ,..., X n X 1 , X 2 ,..., X n from a normal distribution, the statistic T= X ¯ μ S/ n T= X ¯ μ S/ n has a t distribution with r=n1 r=n1 degrees of freedom, where S 2 S 2 is the usual unbiased estimator of σ 2 σ 2 , (see, t distribution).

Select t α/2 ( n1 ) t α/2 ( n1 ) so that P[ T t α/2 ( n1 ) ]=α/2. P[ T t α/2 ( n1 ) ]=α/2. Then

1α=P[ t α/2 ( n1 ) X ¯ μ S/ n t α/2 ( n1 ) ] =P[ t α/2 ( n1 ) S n X ¯ μ t α/2 ( n1 ) S n ] =P[ X ¯ t α/2 ( n1 ) S n μ X ¯ + t α/2 ( n1 ) S n ] =P[ X ¯ t α/2 ( n1 ) S n μ X ¯ + t α/2 ( n1 ) S n ]. 1α=P[ t α/2 ( n1 ) X ¯ μ S/ n t α/2 ( n1 ) ] =P[ t α/2 ( n1 ) S n X ¯ μ t α/2 ( n1 ) S n ] =P[ X ¯ t α/2 ( n1 ) S n μ X ¯ + t α/2 ( n1 ) S n ] =P[ X ¯ t α/2 ( n1 ) S n μ X ¯ + t α/2 ( n1 ) S n ].

Thus the observations of a random sample provide a x ¯ x ¯ and s 2 s 2 and x ¯ t α/2 ( n1 ) s n , x ¯ + t α/2 ( n1 ) s n x ¯ t α/2 ( n1 ) s n , x ¯ + t α/2 ( n1 ) s n is a 100( 1α )% 100( 1α )% interval for μ μ .

Example 1

Let X equals the amount of butterfat in pound produced by a typical cow during a 305-day milk production period between her first and second claves. Assume the distribution of X is N( μ, σ 2 ) N( μ, σ 2 ) . To estimate μ μ a farmer measures the butterfat production for n-20 cows yielding the following data:

 481 537 513 583 453 510 570 500 487 555 618 327 350 643 499 421 505 637 599 392 -

For these data, x ¯ =507.50 x ¯ =507.50 and s=89.75 s=89.75 . Thus a point estimate of μ μ is x ¯ =507.50 x ¯ =507.50 . Since t 0.05 ( 19 )=1.729 t 0.05 ( 19 )=1.729 , a 90% confidence interval for μ μ is 507.50±1.729( 89.75 20 ) 507.50±1.729( 89.75 20 ) , or equivalently, [472.80, 542.20].

Let T have a t distribution with n-1 degrees of freedom. Then, t α/2 ( n1 )> z α/2 t α/2 ( n1 )> z α/2 . Consequently, the interval x ¯ ± z α/2 σ/ n x ¯ ± z α/2 σ/ n is expected to be shorter than the interval x ¯ ± t α/2 ( n1 )s/ n x ¯ ± t α/2 ( n1 )s/ n . After all, there gives more information, namely the value of σ σ , in construction the first interval. However, the length of the second interval is very much dependent on the value of s. If the observed s is smaller than σ σ , a shorter confidence interval could result by the second scheme. But on the average, x ¯ ± z α/2 σ/ n x ¯ ± z α/2 σ/ n is the shorter of the two confidence intervals.

If it is not possible to assume that the underlying distribution is normal but μ μ and σ σ are both unknown, approximate confidence intervals for μ μ can still be constructed using T= X ¯ μ S/ n , T= X ¯ μ S/ n , which now only has an approximate t distribution.

Generally, this approximation is quite good for many normal distributions, in particular, if the underlying distribution is symmetric, unimodal, and of the continuous type. However, if the distribution is highly skewed, there is a great danger using this approximation. In such a situation, it would be safer to use certain nonparametric method for finding a confidence interval for the median of the distribution.

Confidence Interval for Variances

The confidence interval for the variance σ 2 σ 2 is based on the sample variance S 2 = 1 n1 i=1 n ( X i X ¯ ) 2 . S 2 = 1 n1 i=1 n ( X i X ¯ ) 2 .

In order to find a confidence interval for σ 2 σ 2 , it is used that the distribution of ( n1 ) S 2 / σ 2 ( n1 ) S 2 / σ 2 is χ 2 ( n1 ) χ 2 ( n1 ) . The constants a and b should selected from tabularized Chi Squared Distribution with n-1 degrees of freedom such that P( a ( n1 ) S 2 σ 2 b )=1α. P( a ( n1 ) S 2 σ 2 b )=1α.

That is select a and b so that the probabilities in two tails are equal: a= χ 1α/2 2 ( n1 ) a= χ 1α/2 2 ( n1 ) and b= χ α/2 2 ( n1 ). b= χ α/2 2 ( n1 ). Then, solving the inequalities, we have 1α=P( a ( n1 ) S 2 1 σ 2 b ( n1 ) S 2 )=P( ( n1 ) S 2 b σ 2 ( n1 ) S 2 a ). 1α=P( a ( n1 ) S 2 1 σ 2 b ( n1 ) S 2 )=P( ( n1 ) S 2 b σ 2 ( n1 ) S 2 a ).

Thus the probability that the random interval  [(n-1)S 2 /b, (n-1)S 2 /a]  [(n-1)S 2 /b, (n-1)S 2 /a] contains the unknown σ 2 σ 2 is 1- α α . Once the values of X 1 , X 2 ,..., X n X 1 , X 2 ,..., X n are observed to be x 1 , x 2 ,..., x n x 1 , x 2 ,..., x n and s 2 s 2 computed, then the interval [(n-1)S 2 /b, (n-1)S 2 /a] [(n-1)S 2 /b, (n-1)S 2 /a] is a 100( 1α )% 100( 1α )% confidence interval for σ 2 σ 2 .

It follows that [ ( n1 )/bs , ( n1 )/as ] [ ( n1 )/bs , ( n1 )/as ] is a 100( 1α )% 100( 1α )% confidence interval for σ σ , the standard deviation.

Example 2

Assume that the time in days required for maturation of seeds of a species of a flowering plant found in Mexico is N( μ, σ 2 ) N( μ, σ 2 ) . A random sample of n=13 seeds, both parents having narrow leaves, yielded x ¯ x ¯ =18.97 days and 12 s 2 = i=1 13 ( x x ¯ ) 2 =128.41 12 s 2 = i=1 13 ( x x ¯ ) 2 =128.41 .

A confidence interval for σ 2 σ 2 is [ 128.41 21.03 , 128.41 5.226 ]=[ 6.11,24.57 ] [ 128.41 21.03 , 128.41 5.226 ]=[ 6.11,24.57 ] , because 5.226= χ 0.95 2 ( 12 ) 5.226= χ 0.95 2 ( 12 ) and 21.03= χ 0.055 2 ( 12 ) 21.03= χ 0.055 2 ( 12 ) , what can be read from the tabularized Chi Squared Distribution. The corresponding 90% confidence interval for σ σ is [ 6.11 , 24.57 ]=[ 2.47,4.96 ]. [ 6.11 , 24.57 ]=[ 2.47,4.96 ].

Although a and b are generally selected so that the probabilities in the two tails are equal, the resulting 100( 1α )% 100( 1α )% confidence interval is not the shortest that can be formed using the available data. The tables and appendixes gives solutions for a and b that yield confidence interval of minimum length for the standard deviation.

Content actions

Give feedback:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks