# Connexions

You are here: Home » Content » Confidence Intervals: Introduction

### Recently Viewed

This feature requires Javascript to be enabled.

### Tags

(What is a tag?)

These tags come from the endorsement, affiliation, and other lenses that include this content.

# Confidence Intervals: Introduction

Note: You are viewing an old version of this document. The latest version is available here.

## Student Learning Objectives

By the end of this chapter, the student should be able to:

• Calculate and interpret confidence intervals for one population average and one population proportion.
• Interpret the student-t probability distribution as the sample size changes.
• Discriminate between problems applying the normal and the student-t distributions.

## Introduction

Suppose you are trying to determine the average rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write down several rents listed, and average them together. You would have obtained a point estimate of the true mean. If you are trying to determine the percent of times you make a basket when shooting a basketball, you might count the number of shots you make and divide that by the number of shots you attempted. In this case, you would have obtained a point estimate for the true proportion.

We use sample data to make generalizations about an unknown population. This part of statistics is called inferential statistics. The sample data help us to make an estimate of a population parameter. We realize that the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct confidence intervals in which we believe the parameter lies.

In this chapter, you will learn to construct and interpret confidence intervals. You will also learn a new distribution, the Student-t, and how it is used with these intervals.

If you worked in the marketing department of an entertainment company, you might be interested in the average number of compact discs (CD's) a consumer buys per month. If so, you could conduct a survey and calculate the sample average, x¯ x , and the sample standard deviation, ss. You would use x¯ x to estimate the population mean and ss to estimate the population standard deviation. The sample mean, x¯ x , is the point estimate for the population mean, μμ. The sample standard deviation, ss, is the point estimate for the population standard deviation, σσ.

Each of x¯ x and ss is also called a statistic.

A confidence interval is another type of estimate but, instead of being just one number, it is an interval of numbers. The interval of numbers is an estimated range of values calculated from a given set of sample data. The confidence interval is likely to include an unknown population parameter.

Suppose for the CD example we do not know the population mean μμ but we do know that the population standard deviation is σ=1σ=1 and our sample size is 100. Then by the Central Limit Theorem, the standard deviation for the sample mean is

σ n = 1 100 = 0.1 σ n = 1 100 =0.1.

The Empirical Rule, which applies to bell-shaped distributions, says that in approximately 95% of the samples, the sample mean, x¯ x , will be within two standard deviations of the population mean μμ. For our CD example, two standard deviations is (2)(0.1) = 0.2(2)(0.1) = 0.2. The sample mean x¯ x is within 0.2 units of μμ.

Because x¯ x is within 0.2 units of μμ, which is unknown, then μμ is within 0.2 units of x¯ x in 95% of the samples. The population mean μμ is contained in an interval whose lower number is calculated by taking the sample mean and subtracting two standard deviations ((2)(0.1)(2)(0.1)) and whose upper number is calculated by taking the sample mean and adding two standard deviations. In other words, μμ is between x¯ - 0.2 x -0.2 and x¯ + 0.2 x +0.2 in 95% of all the samples.

For the CD example, suppose that a sample produced a sample mean x¯ = 2 x =2. Then the unknown population mean μμ is between

x¯ - 0.2 = 2 - 0.2 = 1.8 x -0.2=2-0.2=1.8 and x¯ + 0.2 = 2 + 0.2 = 2.2 x +0.2=2+0.2=2.2

We say that we are 95% confident that the unknown population mean number of CDs is between 1.8 and 2.2. The 95% confidence interval is (1.8, 2.2).

The 95% confidence interval implies two possibilities. Either the interval (1.8, 2.2) contains the true mean μμ or our sample produced an x¯ x that is not within 0.2 units of the true mean μμ. The second possibility happens for only 5% of all the samples (100% - 95%).

Remember that a confidence interval is created for an unknown population parameter like the population mean, μμ. A confidence interval has the form

(point estimate - margin of error, point estimate + margin of error)

The margin of error depends on the confidence level or percentage of confidence.

## Optional Collaborative Classroom Activity

Have your instructor record the number of meals each student in your class eats out in a week. Assume that the standard deviation is known to be 3 meals. Construct an approximate 95% confidence interval for the true average number of meals students eat out each week.

1. Calculate the sample mean.
2. σ = 3 σ=3 and n = n= the number of students surveyed.
3. Construct the interval ( x¯ - 2 σ n , x¯ + 2 σ n ) ( x -2 σ n , x +2 σ n )

We say we are approximately 95% confident that the true average number of meals that students eat out in a week is between __________ and ___________.

## Glossary

Confidence Interval (CI):
An interval estimate for an unknown population parameter. This depends on:
• The desired confidence level.
• Information that is known about the distribution (for example, known standard deviation).
• The sample and its size.
Inferential Statistics :
Also called statistical inference or inductive statistics. This facet of statistics deals with estimating a population parameter based on a sample statistic. For example, if 4 out of the 100 calculators sampled are defective we might infer that 4 percent of the production is defective.
Parameter:
A numerical characteristic of the population.
Point Estimate:
A single number computed from a sample and used to estimate a population parameter.

## Content actions

### Add module to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

#### Definition of a lens

##### Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

##### What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

##### Who can create a lens?

Any individual member, a community, or a respected organization.

##### What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks