Skip to content Skip to navigation

OpenStax-CNX

You are here: Home » Content » Discrete Random Variables: Hypergeometric (optional)

Navigation

Recently Viewed

This feature requires Javascript to be enabled.

Tags

(What is a tag?)

These tags come from the endorsement, affiliation, and other lenses that include this content.
 

Discrete Random Variables: Hypergeometric (optional)

Module by: Barbara Illowsky, Ph.D., Susan Dean. E-mail the authors

Summary: This module describes the properties of a hypergeometric experiment and hypergeometric probability distribution. This module is included in the Elementary Statistics textbook/collection as an optional lesson. Note: This module is currently under revision, and its content is subject to change. This module is being prepared as part of a statistics textbook that will be available for the Fall 2008 semester.

Note: You are viewing an old version of this document. The latest version is available here.

The characteristics of a hypergeometric experiment are:

  1. You take samples from 2 groups.
  2. You are concerned with a group of interest, called the first group.
  3. You sample without replacement from the combined groups. For example, you want to choose a softball team from a combined group of 11 men and 13 women. The team consists of 10 players.
  4. Each pick is not independent, since sampling is without replacement. In the softball example, the probability of picking a women first is 13241324. The probability of picking a man second is 11231123 if a woman was picked first. It is 10231023 if a man was picked first. The probability of the second pick depends on what happened in the first pick.
  5. You are not dealing with Bernoulli Trials.
The outcomes of a hypergeometric experiment fit a hypergeometric probability distribution. The mean and variance are given in the summary in this chapter.

Example 1

A candy dish contains 100 jelly beans and 80 gumdrops. Fifty candies are picked at random. What is the probability that 35 of the 50 are gumdrops? The two groups are jelly beans and gumdrops. Since the probability question asks for the probability of gumdrops, the group of interest (first group) is gumdrops. The size of the group of interest (first group) is 80. The size of the second group is 100. The size of the sample is 50 (jelly beans or gumdrops). Let XX = the number of gumdrops in the sample of 50. XX takes on the values xx = 0, 1, 2, ..., 50. The probability question is P(X = 35)P(X = 35).

Example 2

Suppose a shipment of 100 VCRs is known to have 10 defective VCRs. An inspector chooses 12 for inspection. He is interested in determining the probability that, among the 12, at most 2 are defective. The two groups are the 90 non-defective VCRs and the 10 defective VCRs. The group of interest (first group) is the defective group because the probability question asks for the probability of at most 2 defective VCRs. The size of the sample is 12 VCRs. (They may be non-defective or defective.) Let XX = the number of defective VCRs in the sample of 12. XX takes on the values 0, 1, 2, ..., 10. XX may not take on the values 11 or 12. The sample size is 12, but there are only 10 defective VCRs. The inspector wants to know P(X2)P(X2) ("At most" means "less than or equal to").

Example 3

You are president of an on-campus special events organization. You need a committee of 7 to plan a special birthday party for the president of the college. Your organization consists of 18 women and 15 men. You are interested in the number of men on your committee. What is the probability that your committee has more than 4 men?

This is a hypergeometric problem because you are choosing your committee from two groups (men and women). You are choosing with or without replacement? What is the group of interest? How many are in the group of interest? How many does the other group have in it? Let XX = ____________ on the committee. What values does XX take on?

The probability question is P(_______)P(_______).

Notation for the Hypergeometric: H = Hypergeometric Probability Distribution Function

XX~H(r, b, n)H(r, b, n)

Read this as "XX is a random variable with a hypergeometric distribution." The parameters are rr, bb, and nn. rr = the size of the group of interest (first group), bb = the size of the second group, nn = the size of the chosen sample

Example 4: Hypergeometric

A school site committee is to be chosen from 6 men and 5 women. If the committee consists of 4 members, what is the probability that 2 of them are men? How many men do you expect to be on the committee?

Let XX = the number of men on the committee of 4. The men are the group of interest (first group).

XX takes on the values 0, 1, 2, 3, 4, where r=6r=6, b=5b=5 , and n=4n=4. X~H(6, 5, 4)X~H(6, 5, 4)

Find P (X=2 )P (X2 ). P (X=2 )=0.4545 P (X2 ) 0.4545 (calculator or computer)

Note:

Currently, the TI-83+ and TI-84 do not have hypergeometric probability functions. There are a number of computer packages, including Microsoft Excel, that do.

The probability that there are 2 men on the committee is about 0.45.

The graph of XX~H(6, 5, 4)H(6, 5, 4) is:

The hypergeometric probability distribution function graph has five bars that are slightly normally distributed with an x-axis of 0-4 and a y-axis of 0-0.5 in increments of 0.1. The x-axis is equal to the number of men on the committee of 4.

The yy-axis contains the probability of XX, where XX = the number of men on the committee.

You would expect m=2.18m=2.18(about 2) men on the committee.

The formula for the mean is μ= nr r+b = 46 6+5 =2.18 μ nr r+b 46 6+5 2.18

The formula for the variance is fairly complex. You will find it in the Summary of the Discrete Probability Functions Chapter [link pending].

Glossary

Hypergeometric Probability:
A discrete random variable (RV) with characteristics:
  • There is a fixed number of trials.
  • The probability of success is not the same from trial to trial, so it is not Bernoulli trials.
The typical example is sampling from a mixture of two groups of items, when we are interested in the only one. XX is defined as the number of successes out of the total number chosen. The notation is: X~H(r,b,n).X~H(r,b,n). size 12{X "~" H \( r,b,n \)} {}, where rr = number of items in the group of interest, bb = number of items in the group not of interest, and nn = number of items chosen.

Content actions

Download module as:

Add module to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks