Skip to content Skip to navigation

Connexions

You are here: Home » Content » The Central Limit Theorem

Navigation

Content Actions

  • Download module PDF
  • Add to ...
    Add the module to:
    • My Favorites
    • A lens
    • An external social bookmarking service
    • My Favorites (What is 'My Favorites'?)
      'My Favorites' is a special kind of lens which you can use to bookmark modules and collections directly in Connexions. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need a Connexions account to use 'My Favorites'.
    • A lens (What is a lens?)

      Definition of a lens

      Lenses

      A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

      What is in a lens?

      Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

      Who can create a lens?

      Any individual Connexions member, a community, or a respected organization.

    • External bookmarks
  • E-mail the author

Recently Viewed

This feature requires Javascript to be enabled.

The Central Limit Theorem

Module by: Don Johnson

Let X l X l denote a sequence of independent, identically distributed, random variables. Assuming they have zero means and finite variances (equaling σ2 σ 2 ), the Central Limit Theorem states that the sum l=1L X l L l 1 L X l L converges in distribution to a Gaussian random variable. 1Ll=1L X l L 0σ2 1 L l 1 L X l L 0 σ 2 Because of its generality, this theorem is often used to simplify calculations involving finite sums of non-Gaussian random variables. However, attention is seldom paid to the convergence rate of the Central Limit Theorem. Kolmogorov, the famous twentieth century mathematician, is reputed to have said, "The Central Limit Theorem is a dangerous tool in the hands of amateurs." Let's see what he meant.

Taking σ2=1 σ 2 1 , the key result is that the magnitude of the difference between Px P x , defined to be the probability that the sum given above exceeds xx, and Qx Q x , the probability that a unit-variance Gaussian random variable exceeds xx, is bounded by a quantity inversely related to the square root of LL (Cramer: Theorem 24). |Px-Qx|cE|X|3σ31L P x Q x c X 3 σ 3 1 L The constant of proportionality c c is a number known to be about 0.8 (Hall: p6). The ratio of absolute third moment of X l X l to the cube of its standard deviation, known as the skew and denoted by γ X γ X , depends only on the distribution of X l X l and is independent of scale. This bound on the absolute error has been shown to be tight (Cramer: pp. 79ff). Using our lower bound for Q· Q · (see (Reference)), we find that the relative error in the Central Limit Theorem approximation to the distribution of finite sums is bounded for x>0 x 0 as

|Px-Qx|Qxc γ X 2πL+x222ifx11+x2xifx>1 P x Q x Q x c γ X 2 L x 2 2 2 x 1 1 x 2 x x 1 (1)
Suppose we require that the relative error not exceed some specific value εε. The normalized (by the standard deviation) boundary xx at which the approximation is evaluated must not violate Lε22πc2 γ X 2x24ifx11+x2x2ifx>1 L ε 2 2 c 2 γ X 2 x 2 4 x 1 1 x 2 x 2 x 1 As shown in Figure 1, the right side of this equation is a monotonically increasing function.
Figure 1: The quantity which governs the limits of validity for numerically applying the Central Limit Theorem on finite numbers of data is shown over a portion of its range. To judge these limits, we must compute the quantity Lε22πc2 γ X L ε 2 2 c 2 γ X , where εε denotes the desired percentage error in the Central Limit Theorem approximation and LL the number of observations. Selecting this value on the vertical axis and determining the value of xx yielding it, we find the normalized ( x=1 x 1 implies unit variance) upper limit on an LL-term sum to which the Central Limit Theorem is guaranteed to apply. Note how rapidly the curve increases, suggesting that large amounts of data are needed for accurate approximation.
Figure 1 (clt.png)

Example 1

If ε=0.1 ε 0.1 and taking c γ X c γ X arbitrarily to be unity (a reasonable value), the upper limit of the preceding equation becomes 1.6×10-3L 1.6-3 L . Examining Figure 1, we find that for L=10000 L 10000 , xx must not exceed 1.17. Because we have normalized to unit variance, this example suggests that the Gaussian approximates the distribution of a ten-thousand term sum only over a range corresponding to a 76% area about the mean. Consequently, the Central Limit Theorem, as a finite-sample distributional approximation, is only guaranteed to hold near the mode of the Gaussian, with huge numbers of observations needed to specify the tail behavior. Realizing this fact will keep us from being ignorant amateurs.

References

  1. H. Cramér. (1970). Random Variables and Probability Distributions. (Third Edition). Cambridge University Press.
  2. P. Hall. (1982). Rates of Convergence in the Central Limit Theorem. In Research Notes in Mathematics. (Vol. 62, p. 6). Pitman Advanced Publishing Program.

Comments, questions, feedback, criticisms?

Send feedback