Central Limit Theorem
Statement
Let be i.i.d. with mean and finite variance . Define the standardized sum
Then converges in distribution to the standard normal :
The moment generating function (MGF) route: if is finite near 0, then , which is the MGF of .
Visualization
Histograms of standardized by , for :
n=1 (2 bars) n=5 (6 bars)
P P
0.5|█ █ 0.3| ██
| | ████
| 0.1| ████████
+--0--1 +--0--5
n=30 (bell emerging) n=100 (near-Gaussian)
P P
0.15| ████ 0.08| ████████
| ████████ | ████████████
| ██████████ | ██████████████
| ████████████ | ████████████████
+----0----30 +------0------100
↑ bell shape clearly visible for n≥30
For , : , . The probability , matching .
Proof Sketch
- Taylor expand the MGF. For the centred variable , write .
- Independence gives product. The MGF of is by independence of the .
- Limit of . The product converges to .
- MGF convergence implies distributional convergence (Lévy continuity theorem via characteristic functions).
- Identify the limit. is the MGF of .
Mathlib v4.28.0 does not yet contain a full general CLT statement. The characteristic function of the Gaussian distribution is available as ProbabilityTheory.charFun_gaussianReal.
Connections
The CLT builds directly on the Strong Law of Large NumbersStrong Law of Large NumbersThe sample mean of i.i.d. integrable random variables converges almost surely to the population meanRead more → and the Characteristic FunctionCharacteristic FunctionThe characteristic function φ_X(t) = E[exp(itX)] uniquely determines the distribution of a random variableRead more → machinery for measuring distributional convergence. The bound on convergence rates is given by the Berry–Esseen theorem, which sharpens Chebyshev's InequalityChebyshev's InequalityA random variable rarely deviates from its mean by more than a few standard deviationsRead more →.
Lean4 Proof
import Mathlib.Probability.Distributions.Gaussian.Real
import Mathlib.MeasureTheory.Measure.CharacteristicFunction
open ProbabilityTheory MeasureTheory Complex
/-- The characteristic function of the standard Gaussian N(0,1) is exp(-t²/2).
This is the key identity at the heart of the CLT (via Lévy's continuity theorem).
Mathlib: `ProbabilityTheory.charFun_gaussianReal` (with μ=0, v=1). -/
theorem clt_key_identity (t : ℝ) :
charFun (gaussianReal (μ := 0) (v := 1)) t = cexp (-(t : ℂ) ^ 2 / 2) := by
rw [charFun_gaussianReal]
simp only [mul_zero, zero_mul, ofReal_one, one_mul]
ring_nfReferenced by
- Characteristic FunctionProbability
- Weak Convergence (Distribution)Probability