The Central Limit Theorem
12 May 2018Prologue To The Central Limit Theorem
Standardizing The Average
Given a large number of random variables Xi belonging to the same sample, with the same expect value μ and variance σ2, the law of large number guarantees the average would approximate to μ.
Here comes the question as what is the distribution of ¯Xn? Since each random variables Xi has the same μ and σ2, it would be a good idea to stablize the expect value and variance of ¯Xn.
We already know E[¯Xn]=μ and Var[¯Xn]=σ2n. What would be the acceptable expect value with regards to the stablized variance?
➀by E[¯Xn−μ]=0, we can zerolize the expect value, to be believed the smallest value.
➁next to make the variance stable, suppose there exists any c>0 such that Var[c⋅¯Xn] could be well stablized. If we can factor out whatever the variance residing in the distribution of ¯Xn itself, then there will be a hope. For unknown distribution, this is quiet difficult.
➂but, we would make it easy by taking c=√nσ, the mathematic thing guarantees the purity and balance of the variance, since Var[√nσ⋅¯Xn]=nσ2⋅Var[¯Xn]=1
➃Var[√nσ⋅(¯Xn−μ)]
=Var[√nσ⋅¯Xn], we can further stablize the variance in a centered average format.Above procedure is the standardization or the standardize process.
The Central Limit Theorem
Given X1,X2,…,Xn are identically independent distributed random variables, each has the same expect value μ and variance σ2, which are all finite.
Zn=¯Xn−μσ/√n;
For any n≥1, let Zn be any random variable, defined bythen, E[Zn]=0 and Var[Zn]=1.
Zn itself is the standard normal distribution, N(0,1), for any a, we have FZn(a)=ɸ(a).
We treat Zn as the standardized ¯Xn.
Example: Illustration Of The Central Limit Theorem
Suppose you are given a random sample of size 500 containing random variables X1,X2,…,X500, all of them coming from the same unknown distribution with each having expect value 2 and variance also 2.
After completing all the 500 runs of test, we get the experiement average of ¯Xn=2.06, do you think it a plausible result?To answer this question, we have to compute the probability of the case that ¯Xn is greater than or equal to 2.06.
P(¯Xn≥2.06)
=P(¯Xn−μ≥2.06−μ)
=P(¯Xn−μσ/√n≥2.06−μσ/√n)
=P(¯X500−μσ/√500≥2.06−2√2/√500)…μ=2,σ=√2
=P(Z500≥0.95)
=1-P(Z500<0.95)
=1-ɸ(0.95)
≈0.1711, it indicates that there exists probability of 0.1711 that the average is 0.06 larger than 2, the expect value of the real thing.
Since 0.1711 is quiet a large probability, it is rather weak to say that 2.06 is an abnormal experimental result of average. 2.06 would thus be plausible.