AP Statistics 8.3 Carrying Out a Chi-Square Test for Goodness of Fit Study Notes

AP Statistics 8.3 Carrying Out a Chi-Square Test for Goodness of Fit Test Study Notes- New syllabus

AP Statistics 8.3 Carrying Out a Chi-Square Test for Goodness of Fit Test Study Notes -As per latest AP Statistics Syllabus.

LEARNING OBJECTIVE

The chi-square distribution may be used to model variation.

Key Concepts:

Chi-Square Test Statistic for Goodness-of-Fit
Determine the P-Value for a Chi-Square Goodness-of-Fit Significance Test
Interpret the P-Value for the Chi-Square Goodness-of-Fit Test
Justifying a Claim Based on a Chi-Square Goodness-of-Fit Test

AP Statistics -Concise Summary Notes- All Topics

Chi-Square Test Statistic for Goodness-of-Fit

The chi-square test statistic measures how far the observed counts are from the expected counts under the null hypothesis. The larger the differences, the more evidence against \(H_0\).

Formula:

\(\displaystyle \chi^2 = \sum_{i=1}^k \frac{(O_i – E_i)^2}{E_i}\)

\(O_i\) = observed count for category \(i\)
\(E_i\) = expected count for category \(i\) under \(H_0\)
\(k\) = number of categories

Key Points:

Each term \((O_i – E_i)^2 / E_i\) measures the squared difference between observed and expected counts, scaled by the expected count.
The sum of these terms gives the test statistic \(\chi^2\), which is compared to a chi-square distribution with \(df = k – 1\) degrees of freedom for goodness-of-fit tests.

Example

A school claims that students’ favorite subjects are equally preferred among Math, Science, English, and History. A survey of 60 students gives observed counts:

Math: 14
Science: 18
English: 16
History: 12

Expected counts under \(H_0\) (equal preference): 60 × 0.25 = 15 for each subject.

Calculate the chi-square test statistic.

▶️ Answer / Explanation

Step 1 — Apply the formula:

\(\displaystyle \chi^2 = \sum \frac{(O_i – E_i)^2}{E_i}\)

Step 2 — Calculate each term:

Math: \((14 – 15)^2 / 15 = 1 / 15 \approx 0.067\)
Science: \((18 – 15)^2 / 15 = 9 / 15 = 0.6\)
English: \((16 – 15)^2 / 15 = 1 / 15 \approx 0.067\)
History: \((12 – 15)^2 / 15 = 9 / 15 = 0.6\)

Step 3 — Sum the terms:

\(\chi^2 \approx 0.067 + 0.6 + 0.067 + 0.6 = 1.334\)

Step 4 — Interpretation: The chi-square statistic of 1.334 measures how much the observed counts differ from expected counts. This value can now be compared to a chi-square distribution with \(df = k – 1 = 4 – 1 = 3\) to determine if the difference is statistically significant.

Determine the P-Value for a Chi-Square Goodness-of-Fit Significance Test

The p-value is the probability of obtaining a chi-square statistic as extreme or more extreme than the observed value, assuming the null hypothesis \( H_0 \) is true.

Used in the goodness-of-fit test to assess whether the observed distribution matches the expected distribution.
A smaller p-value indicates stronger evidence that the observed frequencies differ from the expected frequencies.

Steps to Determine the P-Value:

Calculate the Chi-Square Statistic:
\( \displaystyle \chi^2 = \sum \dfrac{(O_i – E_i)^2}{E_i} \)
Determine Degrees of Freedom:
\( df = k – 1 \), where \( k \) is the number of categories.
Find the P-Value:
Using the chi-square distribution, compute \( P(\chi^2 \ge \chi^2_{\text{observed}}) \).

Notes:

The p-value is found from the right-tail area of the chi-square curve.
Larger \( \chi^2 \) → smaller p-value → more evidence against \( H_0 \).
All expected counts should be ≥ 5 for the approximation to be valid.

Example:

Observed counts of favorite subjects among 60 students:

Math = 14, Science = 18, English = 16, History = 12

Expected counts = 15 each. Chi-square statistic = \( \chi^2 = 1.334 \); \( df = 3 \).

Determine the p-value.

▶️ Answer / Explanation

Use the chi-square distribution with \( df = 3 \): Find \( P(\chi^2 ≥ 1.334) \).
From table/software → p-value ≈ 0.72.
Interpretation: A p-value of 0.72 means such a difference would occur about 72% of the time if \( H_0 \) is true.

Interpret the P-Value for the Chi-Square Goodness-of-Fit Test

Interpreting the p-value tells us whether the observed data provide convincing evidence that the actual distribution differs from the expected model.

Small p-value (\( p ≤ α \)) → Reject \( H_0 \): Observed differences are unlikely under the assumed distribution.
Large p-value (\( p > α \)) → Fail to Reject \( H_0 \): Observed differences are consistent with random variation.
The p-value does not give the probability that \( H_0 \) is true; it shows how compatible the data are with \( H_0 \).

Decision Guidelines:

P-Value	Decision	Interpretation
\( p ≤ α \)	Reject \( H_0 \)	Evidence that the distribution differs from expected.
\( p > α \)	Fail to Reject \( H_0 \)	Observed differences are consistent with random chance.

Example :

From the previous test, \( \chi^2 = 1.334 \), \( df = 3 \), p-value = 0.72, \( α = 0.05 \).

Interpret the p-value.

▶️ Answer / Explanation

Since \( p = 0.72 > 0.05 \), we fail to reject \( H_0 \).
The observed subject preferences could easily occur by random chance if students are equally likely to choose any subject.

Justifying a Claim Based on a Chi-Square Goodness-of-Fit Test

A chi-square goodness-of-fit test allows us to assess whether observed categorical data are consistent with a claimed distribution. The results can be used to justify or refute a claim about the population proportions.

Steps for Justifying a Claim:

State hypotheses:
- \(H_0\): The population proportions are as claimed.
- \(H_a\): At least one population proportion differs from the claim.
Check conditions: Random sample, independent observations, and expected counts ≥ 5.
Calculate the chi-square statistic: \(\displaystyle \chi^2 = \sum \frac{(O_i – E_i)^2}{E_i}\).
Determine the p-value: Find \(P(\chi^2 \ge \chi^2_\text{observed})\) using the chi-square distribution with \(df = k – 1\).
Compare p-value to significance level (\(\alpha\)):
- If \(p \le \alpha\): Reject \(H_0\). There is sufficient evidence to conclude that the population does not match the claimed distribution.
- If \(p > \alpha\): Fail to reject \(H_0\). There is insufficient evidence to refute the claim; observed variation could be due to chance.
Contextual interpretation: Always state the conclusion in terms of the population and the claim.

Notes:

The chi-square test does not prove that the null hypothesis is true; it only assesses whether the observed data are consistent with the claimed distribution.
A small p-value provides strong evidence against the claim, while a large p-value indicates that the claim is plausible.

Example

A candy company claims that its four-color candy packs contain equal numbers of red, green, blue, and yellow candies. A random sample of 80 candies yields:

Red: 18
Green: 22
Blue: 20
Yellow: 20

Chi-square statistic: \(\chi^2 = 0.8\), \(df = 3\), p-value ≈ 0.85.

Can we justify the company’s claim about equal proportions of candy colors?

▶️ Answer / Explanation

Step 1 — Compare p-value to significance level:

Assume \(\alpha = 0.05\). The p-value (0.85) is much larger than 0.05.

Step 2 — Decision: Fail to reject \(H_0\).

Step 3 — Contextual interpretation:

There is insufficient evidence to conclude that the proportions of candy colors differ from equal. The observed variation is consistent with random chance. Therefore, the company’s claim about equal proportions is justified based on this sample.

AP Statistics 8.3 Carrying Out a Chi-Square Test for Goodness of Fit Study Notes

AP Statistics 8.3 Carrying Out a Chi-Square Test for Goodness of Fit Test Study Notes- New syllabus

LEARNING OBJECTIVE

Chi-Square Test Statistic for Goodness-of-Fit

Determine the P-Value for a Chi-Square Goodness-of-Fit Significance Test

Interpret the P-Value for the Chi-Square Goodness-of-Fit Test

Justifying a Claim Based on a Chi-Square Goodness-of-Fit Test

Resources

Members

Company