Central Limit Theorem

The Central Limit Theorem states that if you take sufficiently large random samples from the population with replacement (whose values are summed), then the distribution of the sample means will be approximately normally distributed.

title: The Sampling Distribution of the Sample Mean (Theorem)
Let $X_1, X_2, \dots , X_n$ be i.i.d. $N(\mu, \sigma^2)$, then
$$S_n = \sum\limits X_i ∼ N(nμ, nσ^2)$$
$$X_n = \frac{1}{n}\sum\limits X_i ∼ N(\mu, \frac{σ^2}{n})$$

I know that this works, but I don’t know the theory on why it works. The teacher says the proof is done with Moment-Generating Functions.

title: CLT
The [[Law of Large Numbers]] tells us that the r.v. $\overline{X}_n$ approaches $\mu$ as $n$ approaches $\infty$. But how does $X_n$ behave?  
The Central Limit Theorem tells us that as $n → \infty$:  
$$X_n ∼ N \left(\mu, \frac{\sigma^{2}}{n}\right) \space OR \space Z = \frac{X_n - \mu}{\frac{\sigma}{\sqrt{n}}} \sim N(0, 1)$$

This is so powerful because $X_{n}$ can model ANY distribution., For example, if $X \sim B in (n, p)$ then $X \sim N (n p, n p (p - 1))$ → wait is this CLT?

If $X_{1}, X_{2}, \dots, X_{n} \sim P o i (λ)$ , then $X_{n} \sim N (λ, \frac{λ}{n})$

Proof

I asked for a proof in class for the CLT which the teacher gave, except I don’t understand it…

Let $X_{1}, X_{2}, \dots, X_{N}$ be i.i.d r.v.s. with $E (X_{i}) = μ$ , $Va r (X_{i}) = σ^{2}$

Let $z = \frac{X - μ}{\frac{σ}{n}}$ , where $\overline{X} = \frac{1}{n} \sum_{i = 1}^{N} X_{i}$ We will show that $lim_{n \to \infty} M_{z} (t) = e^{\frac{t ^{2}}{2}}$ : Let $Y_{i} = \frac{X _{i} - μ}{σ}$ $z = \frac{Y - μ}{\frac{σ}{n}}$ , $Y_{i}$ is i.i.d with $E (Y_{i}) = 0$ , $Va r (Y_{i}) = 1$

Then, we use the Moment-Generating Function $M_{Z} (t) = E [e^{t}] = π_{i = 1}^{n} E [e^{t \frac{y _{i}}{n}}] = [M_{y_{i}} \frac{t}{n}]^{n}$ Taking the limits, we have (2nd step to 3rd step jump is by Taylor Approximation, it seems by magic) $lim_{n \to \infty} M_{Z} (t) = lim_{n \to \infty} [M_{y_{i}} (\frac{t}{n})]^{n} = lim_{n \to \infty} [1 + 0 + \frac{\frac{t ^{2}}{2}}{n}]^{n} = e^{\frac{t ^{2}}{2}}$

The central limit theorem is super interesting! Learned it while at Ericsson. Based around the Ericsson. Based around the Law of Large Numbers.

In my laymanns terms, take any distribution and randomly sample from it. Say if you sample at least 30 times, and you take the sum of those sampled values. Then repeat that process like 1000 times. If you plot the sums of those samples, it will follow a Normal Distribution.

The Continuity Correction

To convert discrete to continuous, we can apply a continuity correction to have a better approximation…? $P (X = 4000) = P (3999.5 < X < 4000.5)$

To generate a Random Normal Number

Obtained from Xixian at Ericsson. I still don’t know enough to understand why this works.

Suppose $x_{1}$ , $x_{2}$ , and $x_{N}$ are independent samples chosen from the uniform distribution with the mean $μ$ and variance $σ^{2}$ . Let $y$ $y = \frac{( \sum _{i = 1}^{N} x _{i} - N * μ )}{N * σ )}$

Where

$σ$ is the real Variance
$μ$ is the real mean of the distribution, NOT of the sample. (I made this fatal mistake in the implementation)

Then $y$ is a random variable with a standard normal distribution if $N$ is sufficient large, e.g, $N = 30$ is usually the number used

🛠️ Steven Gong

Table of Contents

Central Limit Theorem

Proof

The Continuity Correction

To generate a Random Normal Number

Graph View

Backlinks