mjtsai1974's Dev Blog Welcome to mjt's AI world

The Central Limit Theorem

Prologue To The Central Limit Theorem

The central limit theorem is a refinement of the law of large number. For a large number of random variables X1,X2,...,Xn, with converged expect value and finite variance, the standardization process would settle down ¯Xn in a normal distribution, irrelevant to the original distribution these Xi is belonging to.

Standardizing The Average

Given a large number of random variables Xi belonging to the same sample, with the same expect value μ and variance σ2, the law of large number guarantees the average would approximate to μ.

Here comes the question as what is the distribution of ¯Xn? Since each random variables Xi has the same μ and σ2, it would be a good idea to stablize the expect value and variance of ¯Xn.

We already know E[¯Xn]=μ and Var[¯Xn]=σ2n. What would be the acceptable expect value with regards to the stablized variance?
➀by E[¯Xnμ]=0, we can zerolize the expect value, to be believed the smallest value.
➁next to make the variance stable, suppose there exists any c>0 such that Var[c¯Xn] could be well stablized. If we can factor out whatever the variance residing in the distribution of ¯Xn itself, then there will be a hope. For unknown distribution, this is quiet difficult.
➂but, we would make it easy by taking c=nσ, the mathematic thing guarantees the purity and balance of the variance, since Var[nσ¯Xn]=nσ2Var[¯Xn]=1
Var[nσ(¯Xnμ)]
=Var[nσ¯Xn], we can further stablize the variance in a centered average format.

Above procedure is the standardization or the standardize process.

The Central Limit Theorem

Given X1,X2,…,Xn are identically independent distributed random variables, each has the same expect value μ and variance σ2, which are all finite.
For any n1, let Zn be any random variable, defined by

Zn=¯Xnμσ/n;

then, E[Zn]=0 and Var[Zn]=1.

Zn itself is the standard normal distribution, N(0,1)

, for any a, we have FZn(a)=ɸ(a).

We treat Zn as the standardized ¯Xn.

Example: Illustration Of The Central Limit Theorem

Suppose you are given a random sample of size 500 containing random variables X1,X2,…,X500, all of them coming from the same unknown distribution with each having expect value 2 and variance also 2.

After completing all the 500 runs of test, we get the experiement average of ¯Xn=2.06, do you think it a plausible result?

To answer this question, we have to compute the probability of the case that ¯Xn is greater than or equal to 2.06.
P(¯Xn2.06)
=P(¯Xnμ2.06μ)
=P(¯Xnμσ/n2.06μσ/n)
=P(¯X500μσ/5002.0622/500)μ=2,σ=2
=P(Z5000.95)
=1-P(Z500<0.95)
=1-ɸ(0.95)
0.1711, it indicates that there exists probability of 0.1711 that the average is 0.06 larger than 2, the expect value of the real thing.
Since 0.1711 is quiet a large probability, it is rather weak to say that 2.06 is an abnormal experimental result of average. 2.06 would thus be plausible.