Chebyshev's Inequality

This article is not about Chebyshev's sum inequality.

Chebyshev's inequality (or Tchebysheff's inequality or Chebyshev's theorem), named in honor of Pafnuty Chebyshev, is a result in probability theory that gives a lower bound for the probability that a value of a random variable of any distribution with finite variance lies within a certain distance from the variable's mean; equivalently, the theorem provides an upper bound for the probability that values lie outside the same distance from the mean.

The theorem

'Theorem.' Let X be a random variable with mean μ and finite variance σ2. Then for any real number k > 0,
P(\left|X-\mu\right|\geq k\sigma)\leq\frac{1}{k^2}.
Only the cases k > 1 provide useful information. Another consequence of the theorem is that for any distribution with mean μ and finite standard deviation σ, at least half of the values lie in the interval (μ − √2 σ, μ + √2 σ). Typically, the theorem will provide rather loose bounds. However, the bounds provided by Chebyshev's inequality cannot, in general (remaining sound for variables of arbitrary distribution), be improved upon. For example, for any k > 1, the following example (where σ = 1/k) meets the bounds exactly.
\begin{matrix}P(X=-1) & = & 1/2k^2 \\ P(X=0) & = & 1 - 1/k^2 \\ P(X=1) & = & 1/2k^2 \end{matrix}
The theorem can be useful despite loose bounds because it applies to random variables of any distribution, and because these bounds can be calculated knowing no more about the distribution than the mean and variance. Chebyshev's inequality is used for proving the weak law of large numbers.

Variants

A one-tailed variant with k > 0, is
P(X-\mu \geq k\sigma)\leq\frac{1}{1+k^2}.
A stronger result applicable to unimodal probability distributions is the Vysochanskiï-Petunin inequality.

Example application

For illustration, assume Wikipedia articles are on average 1000 characters long with a standard deviation of 200 characters. From Chebyshev's inequality we can then deduce that at least 75% of Wikipedia articles have a length between 600 and 1400 characters (k = 2).

Proof

Markov's inequality is applied with a=k^2, and X as (X-\mu)^2. A direct proof shows why bounds we get are quite loose:
math>P| \geq k\sigma = \int_{\left \{\omega :\, \left | X(\omega)-\mu \right | \geq k\sigma\right \} } 1 \, dP
|\leq \int_{\left \{\omega :\, \left | X(\omega)-\mu \right | \geq k\sigma\right \} } \left (\frac{X-\mu }{k\sigma} \right )^2\, dP
|\leq \frac{1}{k^2\sigma^2}\int_R (X-\mu )^2 dP = \frac{1}{k^2}
First, we increase the 1 in integrand and then we increase the set over which we integrate to all reals, so any probability of not being at the mean or the boundary points loosens the inequality.

See also

 

<< PreviousWord BrowserNext >>
open directory license
places in afghanistan
tahj mowry
analog synthesizer
andrei alexandrescu
jean pierre boyer
schwimmwagen
amphibian vehicle
wyrd
geography of ireland
harry potter and the order of the phoenix
university of trinity college
mountbatten class hovercraft
colovaginoplasty
heikant
gukurahundi
grantham grammar school
foghorn leghorn
alfred moisiu
kor
fanny ardant
geraldo rivera
university of liverpool
caxton
udon
williams revolution
super furry animals
dafydd ieuan
two (the twilight zone)
single (music)
b side
the arrival (the twilight zone)
unicode characters for the arabic alphabet
cian ciaran
stepin fetchit
remix
gruff rhys
audio mixing
guto pryce
huw bunford
ankst
the shelter (the twilight zone)
baby boy
the passersby