The distribution of X and the central limit theorem

The distribution of X and the central limit theorem

Distribution of X

  • Random variables are variables linked to a random event. The distribution of X describes the possible outcomes of the random variable X, along with their associated probabilities.

  • The probability distribution of a variable is detailed by the probability function, which provides the probabilities of discrete outcomes, or the probability density function for continuous variables.

  • A discrete probability distribution or probability mass function (pmf) lists the exact probabilities of discrete outcomes, while the probability density function (pdf) provides probabilities for ranges of outcomes for continuous outcomes.

  • The expected value, E(X), of a random variable X is computed as the sum of the product of each outcome and its associated probability for discrete variables, or integrated over the range for continuous variables.

  • The variance, Var(X), measures the dispersion of the random variable from its expected value. It is calculated as the expected value of the squared deviation from the mean.

  • The standard deviation is the square root of the variance, providing a measure of dispersion in the same units as the random variable.

Central Limit Theorem

  • The Central Limit Theorem (CLT) is a fundamental theorem in probability theory and statistics which states that the distribution of the sum (or average) of a large number of independent, identically distributed random variables approaches a normal distribution, regardless of the shape of the original distribution.

  • This theorem has immense importance due to the ubiquity of the Normal Distribution in applied science, engineering, mathematics and natural science.

  • The central limit theorem explains why many distributions tend to be close to the normal distribution. The key factor is that the average of a large number of variables, irrespective of the original distribution, follows a normal distribution.

  • Only two critical conditions are required for the theorem to hold: The random variables must be identically distributed, and they must be independent of each other.

  • The standard normal distribution, also known as the z-distribution, is a special case of the normal distribution where the mean is 0 and the standard deviation is 1.

  • The concept of the central limit theorem is the logical foundation for many statistical procedures, including hypothesis testing and confidence intervals, which both assume normal distribution.

  • Practically, it allows us to make significant inferences about population parameters using sample data.