Home / AP Statistics 5.3 The Central Limit Theorem Study Notes

AP Statistics 5.3 The Central Limit Theorem Study Notes

AP Statistics 5.3 The Central Limit Theorem Study Notes- New syllabus

AP Statistics 5.3 The Central Limit Theorem Study Notes -As per latest AP Statistics Syllabus.

LEARNING OBJECTIVE

  • Probabilistic reasoning allows us to anticipate patterns in data

Key Concepts:

  • Sampling Distributions and the Central Limit Theorem

AP Statistics -Concise Summary Notes- All Topics

Sampling Distributions and the Central Limit Theorem

Sampling Distributions and the Central Limit Theorem

 Sampling Distribution

A sampling distribution is the probability distribution of a statistic (such as the sample mean, sample proportion, or sample variance) based on all possible random samples of a fixed size taken from the same population.

  • It shows how the statistic would vary if we repeatedly took samples of the same size.
  • The variability of a sampling distribution decreases as sample size increases because larger samples reduce sampling error.

Example: If we take many samples of size \( n = 50 \) from a population with mean \( \mu = 100 \), the average values from each sample will not be identical, but their distribution forms the sampling distribution of the sample mean.

Central Limit Theorem (CLT)

The CLT states that if the sample size is sufficiently large, the sampling distribution of the sample mean \( \bar{X} \) will be approximately normal, regardless of the population’s distribution.

Parameters of the Sampling Distribution of the Mean:

    • Mean: \( \mu_{\bar{X}} = \mu \) (the mean of the sample means equals the population mean).
    • Standard Deviation (Standard Error): \( \sigma_{\bar{X}} = \dfrac{\sigma}{\sqrt{n}} \), where \( \sigma \) is the population standard deviation and \( n \) is the sample size.
  • As \( n \) increases, the sampling distribution becomes more tightly clustered around the population mean.

Example: Even if exam scores in a class are skewed, the distribution of average scores from many random samples of 40 students will look approximately normal.

Conditions for the CLT

  • Independence: The sampled values must be independent. This is usually satisfied if sampling is random and the population is large relative to the sample size (rule of thumb: population at least 10 times the sample size).
  • Sample Size: The sample size must be sufficiently large.
    • If the population distribution is normal, any sample size works.
    • If the population is not normal, \( n \geq 30 \) is usually considered large enough.

 Randomization Distribution

A randomization distribution is created by simulating the process of random assignment under the assumption that the null hypothesis is true.

  • For experiments, this involves repeatedly and randomly reallocating the observed responses to treatment groups to see what kinds of differences could occur just by chance.
  • This forms the basis for significance testing: if the observed statistic is very unlikely compared to this randomization distribution, we conclude that the treatment had an effect.

Example: In a drug experiment, we could shuffle “improvement” responses among treatment and placebo groups thousands of times to build a randomization distribution of differences in means.

 Simulating a Sampling Distribution

When it is impractical to calculate a theoretical sampling distribution, we can use computer simulation to approximate it.

  • The process:
    1. Take a random sample from the population.
    2. Compute the statistic of interest (e.g., mean, proportion).
    3. Repeat steps 1 and 2 many times (hundreds or thousands).
    4. Plot the distribution of these statistics. This approximates the sampling distribution.

Example: If we simulate rolling a fair die 50 times and calculate the average, then repeat this 1,000 times, the distribution of those averages will be approximately normal (by the CLT).

Example 

A population has mean \( \mu = 50 \) and standard deviation \( \sigma = 12 \). Suppose we take random samples of size \( n = 36 \).

Find the mean and standard deviation of the sampling distribution of the sample mean \( \bar{X} \).

▶️ Answer / Explanation

Step 1: Recall the formulas for the sampling distribution:

  • Mean: \( \mu_{\bar{X}} = \mu \)
  • Standard deviation: \( \sigma_{\bar{X}} = \dfrac{\sigma}{\sqrt{n}} \)

Step 2: Substitute values:

  • \( \mu_{\bar{X}} = 50 \)
  • \( \sigma_{\bar{X}} = \dfrac{12}{\sqrt{36}} = \dfrac{12}{6} = 2 \)

Answer: The sampling distribution has mean \( \mu_{\bar{X}} = 50 \) and standard deviation \( \sigma_{\bar{X}} = 2 \).

Example 

Heights of adult women follow a distribution with mean \( \mu = 64 \) inches and standard deviation \( \sigma = 3 \) inches. A random sample of \( n = 36 \) women is taken.

What is the probability that the sample mean height \( \bar{X} \) is greater than 65 inches?

▶️ Answer / Explanation

Step 1: Parameters of the sampling distribution:

  • \( \mu_{\bar{X}} = 64 \)
  • \( \sigma_{\bar{X}} = \dfrac{3}{\sqrt{36}} = 0.5 \)

Step 2: Standardize 65:

\( z = \dfrac{65 – 64}{0.5} = \dfrac{1}{0.5} = 2 \).

Step 3: Find probability:

\( P(\bar{X} > 65) = P(Z > 2) = 0.0228 \).

Answer: The probability is approximately 2.28%.

Example

A study compares average test scores between students who used a new study app and those who did not. The observed difference in means is 4 points (app group higher). To test whether this could happen by chance, researchers repeatedly shuffled the scores between the two groups and calculated new differences in means.

 If only 2% of the simulated differences are greater than or equal to 4, what conclusion should be made?

▶️ Answer / Explanation

Step 1: This is a randomization distribution under the null hypothesis (no treatment effect).

Step 2: The observed difference of 4 is rare compared to the simulated differences (only 2% as extreme).

Step 3: Since 2% is less than the usual significance level (5%), the result is statistically significant.

Answer: There is strong evidence that the study app improves test scores.

Scroll to Top