Skip to content Skip to navigation

Connexions

You are here: Home » Content » Chebyshev’s Inequality

Navigation

Content Actions

  • Download module PDF
  • Add to ...
    Add the module to:
    • My Favorites
    • A lens
    • An external social bookmarking service
    • My Favorites (What is 'My Favorites'?)
      'My Favorites' is a special kind of lens which you can use to bookmark modules and collections directly in Connexions. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need a Connexions account to use 'My Favorites'.
    • A lens (What is a lens?)

      Definition of a lens

      Lenses

      A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

      What is in a lens?

      Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

      Who can create a lens?

      Any individual Connexions member, a community, or a respected organization.

      What are tags? tag icon

      Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

    • External bookmarks
  • E-mail the author
  • Rate this module (How does the rating system work?)

    Rating system

    Ratings

    Ratings allow you to judge the quality of modules. If other users have ranked the module then its average rating is displayed below. Ratings are calculated on a scale from one star (Poor) to five stars (Excellent).

    How to rate a module

    Hover over the star that corresponds to the rating you wish to assign. Click on the star to add your rating. Your rating should be based on the quality of the content. You must have an account and be logged in to rate content.

    (0 ratings)

Recently Viewed

This feature requires Javascript to be enabled.

Chebyshev’s Inequality

Module by: Ewa Paszek

Summary: This course is a short series of lectures on Introductory Statistics. Topics covered are listed in the Table of Contents. The notes were prepared by Ewa Paszek and Marek Kimmel. The development of this course has been supported by NSF 0203396 grant.

Note: Your browser may not currently support MathML. See our browser support page for additional details. You can always view the correct math in the PDF version.

Chebyshev’s Inequality

In this paragraph the Chebyshev’s inequality is used to show, in another sense, that the sample mean, x ¯ x ¯ , is a good statistic to use to estimate a population with mean μ μ ; the relative frequency of successes in n Bernoulli trials, y/n y/n , is a good statistic for estimating p; and the empirical distribution function, F n ( x ) F n ( x ) , can be used to estimate the theoretical distribution function F( x ) F( x ) . The effect of the sample size n on these estimates is discussed.

At the beginning, it is showed that the Chebyshev’s inequality gives added significance to the standard deviation in terms of bounding certain probabilities. The inequality is valid for all distributions for which the standard deviation exists. The proof is given for the discrete case, but it holds for the continuous case with integrals replacing summations.

Theorem 1: Chebyshev’s Inequality

If the random variable X has a mean μ μ and variance σ 2 σ 2 , then for every k1 k1 ,

P( | Xμ |kσ ) 1 k 2 . P( | Xμ |kσ ) 1 k 2 .

Proof

Let f( x ) f( x ) denote p.d.f. of X. Then σ 2 =E[ ( Xμ ) 2 ]= xR ( xμ ) 2 f( x ) = xA ( xμ ) 2 f( x ) + xA' ( xμ ) 2 f( x ) , σ 2 =E[ ( Xμ ) 2 ]= xR ( xμ ) 2 f( x ) = xA ( xμ ) 2 f( x ) + xA' ( xμ ) 2 f( x ) ,

where A=( x:| xμ |kσ ) A=( x:| xμ |kσ ) . The second term in the right-hand member of the equation is the sum of nonnegative numbers and thus is greater than or equal to zero, Hence σ 2 xA ( xμ ) 2 f( x ) . σ 2 xA ( xμ ) 2 f( x ) .

However, in A, | xμ |kσ | xμ |kσ so σ 2 xA ( kσ ) 2 f( x )= k 2 σ 2 xA f( x ) . σ 2 xA ( kσ ) 2 f( x )= k 2 σ 2 xA f( x ) .

But the latter summation equals P( XA ) P( XA ) , and thus σ 2 k 2 σ 2 P( XA )= k 2 σ 2 P( | Xμ |kσ ). σ 2 k 2 σ 2 P( XA )= k 2 σ 2 P( | Xμ |kσ ).

That is, P( | Xμ |kσ ) 1 k 2 . P( | Xμ |kσ ) 1 k 2 .

COROLLARY

If ε=kσ ε=kσ , then P( | Xμ |ε ) σ 2 ε 2 . P( | Xμ |ε ) σ 2 ε 2 .

In words, Chebyshev’s inequality states that the probability that X differs from its mean by at least k standard deviations is less than or equal to 1 k 2 1 k 2 . It follows that the probability that X differs from its mean by less than k standard deviations is at least 1 k 2 1 k 2 . That is,

P( | Xμ |<kσ )1 1 k 2 . P( | Xμ |<kσ )1 1 k 2 .

From the corollary, it also follows that P( | Xμ |<ε )1 σ 2 ε 2 . P( | Xμ |<ε )1 σ 2 ε 2 .

Thus Chebyshev’s inequality can be used as a bound for certain probabilities. However, in many instances, the bound is not very close to the true probability.

Example 1

If it is known that X has a mean of 25 and a variance of 16, then, σ=4 σ=4 a lower bound for P( 17<X<33 ) P( 17<X<33 ) is given by

P( 17<X<33 )=P( | X25 |<8 )=P( | Xμ |<2σ )1 1 4 =0.75, P( 17<X<33 )=P( | X25 |<8 )=P( | Xμ |<2σ )1 1 4 =0.75,

and an upper bound for P( | X25 |12 ) P( | X25 |12 ) is found to be

P( | X25 |12 )=P( | Xμ |3σ ) 1 9 . P( | X25 |12 )=P( | Xμ |3σ ) 1 9 .

note that:

Note that the results of the last example hold for any distribution with mean 25 and standard deviation 4. But, even stronger, the probability that any random variable X differs from its mean by 3 or more standard deviations is at most 1/9 by letting k =3 in the theorem. Also the probability that any random variable X differs from its mean by less than 2 standard deviations is at least 3/4 by letting k=2.

Comments, questions, feedback, criticisms?

Send feedback