AP Statistics 5.8 Sampling Distributions for Differences in Sample Means Study Notes

AP Statistics 5.8 Sampling Distributions for Differences in Sample Means Study Notes- New syllabus

AP Statistics 5.8 Sampling Distributions for Differences in Sample Means Study Notes -As per latest AP Statistics Syllabus.

LEARNING OBJECTIVE

Probabilistic reasoning allows us to anticipate patterns in data.

Key Concepts:

Determine Parameters of a Sampling Distribution for a Difference in Sample Means
Approximate Normality for $\bar{x}_1 – \bar{x}_2$
Contextual Interpretation of Difference in Sample Means

AP Statistics -Concise Summary Notes- All Topics

Determine Parameters of a Sampling Distribution for a Difference in Sample Means

Statistic of interest:

The difference between two independent sample means, $ \bar{x}_1 – \bar{x}_2 $.

Mean of the sampling distribution:

$ \mu_{\bar{x}_1 – \bar{x}_2} = \mu_1 – \mu_2 $

Interpretation: On average, the difference between sample means equals the difference between the population means.

Standard error of the sampling distribution:

$ \sigma_{\bar{x}_1 – \bar{x}_2} = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}} $ where $ \sigma_1, \sigma_2 $ are the population standard deviations, and $ n_1, n_2 $ are the sample sizes.

Shape: If both populations are normal, the distribution of $ \bar{x}_1 – \bar{x}_2 $ is normal. If not, the Central Limit Theorem ensures approximate normality when $ n_1 $ and $ n_2 $ are sufficiently large.

Conditions: Samples must be independent, and each sample should be random. For finite populations, the sample size should not exceed $10\%$ of the population size.

Example:

A researcher compares the average test scores of two schools.

School A population: $ \mu_1 = 75 $, $ \sigma_1 = 10 $, sample size $ n_1 = 40 $.

School B population: $ \mu_2 = 70 $, $ \sigma_2 = 12 $, sample size $ n_2 = 50 $.

Find the mean and standard error of the sampling distribution of $ \bar{x}_1 – \bar{x}_2 $.

▶️ Answer / Explanation

Step 1: Mean

$ \mu_{\bar{x}_1 – \bar{x}_2} = \mu_1 – \mu_2 = 75 – 70 = 5 $.

Interpretation: On average, School A’s sample mean score is 5 points higher than School B’s.

Step 2: Standard Error

$ \sigma_{\bar{x}_1 – \bar{x}_2} = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}} = \sqrt{\dfrac{10^2}{40} + \dfrac{12^2}{50}} $.

$ = \sqrt{\dfrac{100}{40} + \dfrac{144}{50}} = \sqrt{2.5 + 2.88} = \sqrt{5.38} \approx 2.32 $.

Final Answer: Mean = 5, Standard Error ≈ 2.32.

Example

A nutritionist studies the average daily calorie intake of two groups.

Group 1: $ \mu_1 = 2200 $, $ \sigma_1 = 400 $, sample size $ n_1 = 64 $.

Group 2: $ \mu_2 = 2100 $, $ \sigma_2 = 500 $, sample size $ n_2 = 100 $.

Find the mean and standard error of the sampling distribution of $ \bar{x}_1 – \bar{x}_2 $.

▶️ Answer / Explanation

Step 1: Mean

$ \mu_{\bar{x}_1 – \bar{x}_2} = 2200 – 2100 = 100 $.

Interpretation: On average, Group 1 consumes 100 more calories per day than Group 2.

Step 2: Standard Error

$ \sigma_{\bar{x}_1 – \bar{x}_2} = \sqrt{\dfrac{400^2}{64} + \dfrac{500^2}{100}} $.

$ = \sqrt{\dfrac{160000}{64} + \dfrac{250000}{100}} = \sqrt{2500 + 2500} = \sqrt{5000} \approx 70.71 $.

Final Answer: Mean = 100, Standard Error ≈ 70.71.

Approximate Normality for $\bar{x}_1 – \bar{x}_2$

If both populations are normal Then the sampling distribution of $ \bar{x}_1 – \bar{x}_2 $ is normal, regardless of sample sizes.

If populations are not normal: By the Central Limit Theorem (CLT), the sampling distribution of $ \bar{x}_1 – \bar{x}_2 $ will be approximately normal if both sample sizes are sufficiently large (commonly $ n_1, n_2 \geq 30 $).
Independence condition: The two samples must be independent. If sampling without replacement, sample sizes should be no more than 10% of the population sizes.

Approximate normality depends on (1) shape of population distributions and (2) size of samples.

Example:

Researchers compare average study times of two groups of students.

Group 1: population not normal, $ n_1 = 40 $.
Group 2: population not normal, $ n_2 = 35 $.

Can the sampling distribution of $ \bar{x}_1 – \bar{x}_2 $ be treated as approximately normal?

▶️ Answer / Explanation

Step 1: Populations are not normal. So we must use the CLT.

Step 2: Both sample sizes are large enough ($ n_1 = 40 $, $ n_2 = 35 $, both ≥ 30).

Step 3: Samples are independent (assumed random and from different groups).

Conclusion: By the Central Limit Theorem, the sampling distribution of $ \bar{x}_1 – \bar{x}_2 $ can be described as approximately normal.

Contextual Interpretation of Difference in Sample Means

Mean ($ \mu_{\bar{x}_1 – \bar{x}_2} $):

Equal to $ \mu_1 – \mu_2 $.

Interpretation: On average, the difference between sample means equals the true difference between the population means.

Standard Error ($ \sigma_{\bar{x}_1 – \bar{x}_2} $):

$ \sigma_{\bar{x}_1 – \bar{x}_2} = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}} $

Interpretation: Describes how much the difference in sample means typically varies from the difference in population means.

Shape: If conditions for normality are met, we can use z-scores and normal probability calculations for inference about $ \bar{x}_1 – \bar{x}_2 $.

Contextual Interpretation: Probabilities must be expressed in terms of the original populations being studied (e.g., test scores, weights, times).

Example:

Suppose average math test scores are being compared between two schools. – School A: $ \mu_1 = 78 $, $ \sigma_1 = 12 $, $ n_1 = 36 $. – School B: $ \mu_2 = 74 $, $ \sigma_2 = 10 $, $ n_2 = 49 $.

Find the mean and standard error of $ \bar{x}_1 – \bar{x}_2 $. Interpret the results.

▶️ Answer / Explanation

Step 1: Mean

$ \mu_{\bar{x}_1 – \bar{x}_2} = \mu_1 – \mu_2 = 78 – 74 = 4 $. Interpretation: On average, School A’s sample mean test score is 4 points higher than School B’s.

Step 2: Standard Error

$ \sigma_{\bar{x}_1 – \bar{x}_2} = \sqrt{\dfrac{12^2}{36} + \dfrac{10^2}{49}} = \sqrt{\dfrac{144}{36} + \dfrac{100}{49}} $.

= $ \sqrt{4 + 2.04} = \sqrt{6.04} \approx 2.46 $.

Interpretation: The difference in sample means typically varies by about 2.46 points from the true difference in population means.

Final Conclusion: The sampling distribution of $ \bar{x}_1 – \bar{x}_2 $ has mean 4 and standard error ≈ 2.46. In context, this means School A’s average scores are expected to be about 4 points higher, but sample-to-sample variation is around 2.5 points.

AP Statistics 5.8 Sampling Distributions for Differences in Sample Means Study Notes

AP Statistics 5.8 Sampling Distributions for Differences in Sample Means Study Notes- New syllabus

LEARNING OBJECTIVE

Determine Parameters of a Sampling Distribution for a Difference in Sample Means

Approximate Normality for $\bar{x}_1 - \bar{x}_2$

Contextual Interpretation of Difference in Sample Means

Resources

Members

Company