Summary: Calculations for determining the required sample sized when calculation a confidence interval for the population mean or population proportion.
Prior to creating a confidence interval a sample must be taken. Often the number of data values needed in a sample to obtain a particular level of confidence within a given error needs to be determined prior to taking the sample. If the sample is too small the result may not be useful and if the sample is too big both time and money are wasted in the sampling.
From the formula for the error bound, the following formula can be derived:
Often the population standard deviation is unknown. Often the sample standard deviation from a previous sample of size greater than 30 may be used as an approximation to σ.
Often times the value found by using the formula for sample size is not a whole number. However the sample size must be a whole number, so always round up to the next larger whole number.
Suppose the scores on a statistics final are normally distributed with a standard deviation of 10 points. You have been asked to construct a 95% confidence interval with an error of no more than 2 points.
E = 2
σ = 10
Hence, a sample of size 68, must be taken to create a 95% confidence interval with an error of no more than two points.
To determine the sample size necessary to ensure a given error for a particular confidence level, the formula for the error bound can be rewritten as follows:
Generally the population proportion is unknown and p’ is determined using a previous sample. Hence
If there is no previous sample then p = 0.5 is used since it maximized the value of p(1 - p). Hence
Suppose Halmark wish to know what proportion of oldest children buy their mothers a Mother’s Day Card. (See example 8 -5) How many people must be sampled is they wish to be 95% certain that the proportion is within 2%?
a) Use the following sample data as an estimate for the population proportion.
Given that 421 of 500 responded in the affirmative, p’ =
E = 0.02
p’ = 0.842
Hence 900 people need to be surveyed to ensure a 95% confidence interval with an error of at most 2%.
b) Suppose there is no previous sample. How many people need to be surveyed?
E = 0.02
assume p = 0.5
Hence 1692 people need to be surveyed to ensure a 95% confidence interval with an error of at most 2%.
Note that not having a previous sample greatly increases the number of data values needed in a sample. Often a pilot study is done to generate an approximation for p.