Home / AP_Statistics_2022_FRQ

Question 1

A biologist gathered data on the length, in millimeters (mm), and the mass, in grams (g), for 11 bullfrogs. The data are shown in Plot 1.
(a) Based on the scatterplot, describe the relationship between mass and length, in context.
From the data, the biologist calculated the least-squares regression line for predicting mass from length. The least-squares regression line is shown in Plot 2.
predicted mass = \(-546 + 6.086\) (length)
(b) Identify and interpret the slope of the least-squares regression line in context.
(c) Interpret the coefficient of determination of the least-squares regression line, \( r^2 \approx 0.819 \), in context.
(d) From Plot 2, consider the residuals of the 11 bullfrogs.
(i) Based on the plot, approximately what is the length and mass of the bullfrog with the largest absolute value residual?
(ii) Does the least-squares regression line overestimate or underestimate the mass of the bullfrog identified in part (d-i)? Explain your answer.

Most-appropriate topic codes (CED):

TOPIC 2.4: Representing the Relationship Between Two Quantitative Variables — part (a)
TOPIC 2.8: Least Squares Regression — parts (b), (d)
TOPIC 2.5: Correlation — part (c)
▶️ Answer/Explanation
Detailed solution

(a)
The scatterplot reveals a strong, positive, roughly linear association between the mass and length of bullfrogs. There are no points that seriously deviate from the straight-line pattern exhibited by most of the points in the plot.

(b)
The slope of the least-squares regression line is \( 6.086 \). This indicates that for each additional millimeter of length, the predicted mass of a bullfrog increases by 6.086 grams.

(c)
The coefficient of determination \( r^2 \approx 0.819 \) indicates that 81.9% of the variation in bullfrog mass can be explained by the linear relationship with bullfrog length as described by the least-squares regression line.

(d)
(i) The bullfrog with the largest absolute value residual has a length of approximately 162 mm and a mass of approximately 356 g.
(ii) The least-squares regression line overestimates the mass of this bullfrog. This is evident because the point for this bullfrog is below the regression line in Plot 2, meaning the actual mass is less than the predicted mass.

Question 2

A dermatologist will conduct an experiment to investigate the effectiveness of a new drug to treat acne. The dermatologist has recruited \(36\) pairs of identical twins. Each person in the experiment has acne and each person in the experiment will receive either the new drug or a placebo. After each person in the experiment uses either the new drug or the placebo for \(2\) weeks, the dermatologist will evaluate the improvement in acne severity for each person on a scale from \(0\) (no improvement) to \(100\) (complete cure).
(a) Identify the treatments, experimental units, and response variable of the experiment.
  • Treatments:
  • Experimental units:
  • Response variable:
Each twin in the experiment has a severity of acne similar to that of the other twin. However, the severity of acne differs from one twin pair to another.
(b) For the dermatologist’s experiment, describe a statistical advantage of using a matched-pairs design where twins are paired rather than using a completely randomized design.
(c) For the dermatologist’s experiment, describe how the treatments can be randomly assigned to people using a matched-pairs design in which twins are paired.

Most-appropriate topic codes (CED):

TOPIC 3.5: Introduction to Experimental Design — parts (a), (c)
TOPIC 3.6: Selecting an Experimental Design — part (b)
▶️ Answer/Explanation
Detailed solution

(a)

  • Treatments: The new drug and the placebo.
  • Experimental units: The \(72\) people participating in the experiment.
  • Response variable: The improvement in acne severity, measured on a scale from \(0\) to \(100\).

(b)
The severity of acne differs from one twin pair to another. This initial acne severity is a potential confounding variable. In a completely randomized design, this variability between pairs could obscure the true effect of the treatment.

A statistical advantage of using a matched-pairs design is that it controls for the variability in initial acne severity between the pairs of twins. By pairing individuals who are similar (in this case, identical twins), the design isolates the effect of the treatment from the effect of initial severity. This reduction in variability provides a more precise estimate of the treatment effect (the difference in improvement between the drug and the placebo) and increases the statistical power of the experiment, making it easier to detect a significant difference between the drug and the placebo if one truly exists.

(c)
The random assignment should be performed separately for each of the \(36\) pairs of twins. One valid method is as follows:

For each pair of twins, flip a fair coin.

  • If the coin lands on heads, one twin (e.g., the older twin) receives the new drug and the other twin receives the placebo.
  • If the coin lands on tails, the assignment is reversed: the same twin receives the placebo and the other receives the new drug.

This process is repeated for all \(36\) pairs, ensuring that within each pair, each twin has a \(50\%\) chance of receiving either treatment, and the assignment for one pair is independent of all other pairs.

Question 3

A machine at a manufacturing company is programmed to fill shampoo bottles such that the amount of shampoo in each bottle is normally distributed with mean \(0.60\) liter and standard deviation \(0.04\) liter. Let the random variable \(A\) represent the amount of shampoo, in liters, that is inserted into a bottle by the filling machine.
(a) A bottle is considered to be underfilled if it has less than \(0.50\) liter of shampoo. Determine the probability that a randomly selected bottle of shampoo will be underfilled. Show your work.
After the bottles are filled, they are placed in boxes of \(10\) bottles per box. After the bottles are placed in the boxes, several boxes are placed in a crate for shipping to a beauty supply warehouse. The manufacturing company’s contract with the beauty supply warehouse states that one box will be randomly selected from a crate. If \(2\) or more bottles in the selected box are underfilled, the entire crate will be rejected and sent back to the manufacturing company.
(b) The beauty supply warehouse manager is interested in the probability that a crate shipped to the warehouse will be rejected. Assume that the amounts of shampoo in the bottles are independent of each other.
(i) Define the random variable of interest for the warehouse manager and state how the random variable is distributed.
(ii) Determine the probability that a crate will be rejected by the warehouse manager. Show your work.
To reduce the number of crates rejected by the beauty supply warehouse manager, the manufacturing company is considering adjusting the programming of the filling machine so that the amount of shampoo in each bottle is normally distributed with mean \(0.56\) liter and standard deviation \(0.03\) liter.
(c) Would you recommend that the manufacturing company use the original programming of the filling machine or the adjusted programming of the filling machine? Provide a statistical justification for your choice.

Most-appropriate topic codes (CED):

Topic 1.10: The Normal Distribution — Part (a), Part (c)
Topic 4.10: Introduction to the Binomial Distribution — Part (b)
Topic 4.11: Parameters for a Binomial Distribution — Part (b), Part (c)
▶️ Answer/Explanation
Detailed solution

(a)
The amount of shampoo, \( A \), is normally distributed with \( \mu = 0.60 \) and \( \sigma = 0.04 \).
Calculate the z-score:
\( z = \frac{0.50 – 0.60}{0.04} = -2.5 \)
Find the probability:
\( P(A < 0.50) = P(Z < -2.5) \approx 0.0062 \)
So, \(\boxed{P \approx 0.0062}\).

(b)
(i) Let \( X \) be the number of underfilled bottles in a box of \( 10 \). Then \( X \sim \mathrm{Binomial}(n = 10, p = 0.0062) \).
(ii) A crate is rejected if \( X \ge 2 \). Using the complement rule:
\( P(X \ge 2) = 1 – P(X \le 1) = 1 – [P(X=0) + P(X=1)] \)
\( P(X=0) = \binom{10}{0}(0.0062)^0(0.9938)^{10} \approx 0.9396 \)
\( P(X=1) = \binom{10}{1}(0.0062)^1(0.9938)^9 \approx 0.0584 \)
\( P(X \ge 2) = 1 – (0.9396 + 0.0584) = 0.002 \) (more precisely \( 0.0017 \))
So, \(\boxed{P \approx 0.0017}\).

(c)
For the original programming:
\( P(A < 0.5) = P\left(Z < \frac{0.5 – 0.60}{0.04}\right) = P(Z < -2.5) \approx 0.0062 \).
For the adjusted programming:
\( P(A < 0.5) = P\left(Z < \frac{0.5 – 0.56}{0.03}\right) = P(Z < -2.0) \approx 0.0228 \).

Because the probability of an underfilled bottle is greater for the adjusted programming (\( 0.0228 \) vs \( 0.0062 \)), this would result in more rejected shipments. ✅ Recommendation: The company should continue with the original machine programming because it yields a lower probability of underfilling and therefore fewer rejected shipments.

Question 4

A survey conducted by a national research center asked a random sample of \( 920 \) teenagers in the United States how often they use a video streaming service. From the sample, \( 59\% \) answered that they use a video streaming service every day.
(a) Construct and interpret a \( 95\% \) confidence interval for the proportion of all teenagers in the United States who would respond that they use a video streaming service every day.
(b) Based on the confidence interval in part (a), do the sample data provide convincing statistical evidence that the proportion of all teenagers in the United States who would respond that they use a video streaming service every day is not \( 0.5 \)? Justify your answer.

Most-appropriate topic codes (CED):

TOPIC 6.2: Constructing a Confidence Interval for a Population Proportion — part (a)
TOPIC 6.3: Justifying a Claim Based on a Confidence Interval for a Population Proportion — part (b)
TOPIC 6.1: Introducing Statistics: Why Be Normal? — conditions check
▶️ Answer/Explanation
Detailed solution

(a)
State: We will construct a one-sample z-interval for a population proportion.
Let \( p \) = the true proportion of all teenagers in the United States who would respond that they use a video streaming service every day.

Plan: We verify the conditions:
Random: The sample was randomly selected.
10% Condition: \( 920 \) is less than 10% of all teenagers in the United States.
Large Counts: \( n\hat{p} = 920 \times 0.59 = 542.8 \geq 10 \) and \( n(1-\hat{p}) = 920 \times 0.41 = 377.2 \geq 10 \)

Do: The \( 95\% \) confidence interval is:
\( \hat{p} \pm z^*\sqrt{\frac{\hat{p}(1-\hat{p})}{n}} = 0.59 \pm 1.96\sqrt{\frac{0.59 \times 0.41}{920}} \)
\( = 0.59 \pm 1.96\sqrt{0.000263} \approx 0.59 \pm 1.96 \times 0.0162 \approx 0.59 \pm 0.0318 \)
The interval is \( (0.5582, 0.6218) \)

Conclude: We are \( 95\% \) confident that the true proportion of all teenagers in the United States who would respond that they use a video streaming service every day is between \( 0.558 \) and \( 0.622 \).

(b)
Yes, the sample data provide convincing statistical evidence that the proportion is not \( 0.5 \).
Justification: The \( 95\% \) confidence interval \( (0.558, 0.622) \) does not contain \( 0.5 \). Since \( 0.5 \) is not a plausible value for the population proportion based on this interval, we have convincing evidence that the true proportion is different from \( 0.5 \).

Question 5

Studies have shown that foods rich in compounds known as flavonoids help lower blood pressure. Researchers conducted a study to investigate whether there was a greater reduction in blood pressure for people who consumed dark chocolate, which contains flavonoids, than people who consumed white chocolate, which does not contain flavonoids.
Twenty-five healthy adults agreed to participate in the study and add \(3.5\) ounces of chocolate to their daily diets. Of the \(25\) participants, \(13\) were randomly assigned to the dark chocolate group and the rest were assigned to the white chocolate group. All participants had their blood pressure recorded, in millimeters of mercury (mmHg), before adding chocolate to their daily diets and again \(30\) days after adding chocolate to their daily diets.
The reduction in blood pressure (before minus after) for each of the participants in the two groups is shown in the dotplots below.
(a) Determine and compare the medians of the reduction in blood pressure for the two groups.
The researchers found the mean reduction in blood pressure for those who consumed dark chocolate is \(\bar{x}_{\text{dark}} = 6.08\) mmHg and the mean reduction in blood pressure for those who consumed white chocolate is \(\bar{x}_{\text{white}} = 0.42\) mmHg.
(b) One researcher indicated that because the difference in sample means of \(5.66\) mmHg is greater than \(0\) there is convincing statistical evidence to conclude that the population mean blood pressure reduction for those who consume dark chocolate is greater than for those who consume white chocolate. Why might the researcher’s conclusion, based only on the difference in sample means of \(5.66\) mmHg, not necessarily be true?
A simulation was conducted to investigate whether there is a greater reduction of blood pressure for those who consume dark chocolate than for those who consume white chocolate. The simulation was conducted under the assumption that no difference exists. The results of \(120\) trials of the simulation are shown in the following dotplot.
(c) Use the results of the simulation to determine whether the results from the \(25\) participants in the study provide convincing statistical evidence, at a \(5\) percent level of significance, that adding dark chocolate to a daily diet will result in a greater reduction in blood pressure, on average, than adding white chocolate to a daily diet. Justify your answer.

Most-appropriate topic codes (CED):

TOPIC 1.9: Comparing Distributions of a Quantitative Variable — part (a)
TOPIC 5.8: Sampling Distributions for Differences in Sample Means — part (b)
TOPIC 6.5: Interpreting p-Values — part (c)
TOPIC 6.6: Concluding a Test for a Population Proportion — part (c)
▶️ Answer/Explanation
Detailed solution

(a)
To find the medians, we locate the middle value for each group.
Dark Chocolate Group: There are \(13\) participants. The median is the value of the \(\frac{13+1}{2} = 7^{\text{th}}\) observation. Counting from the left on the dotplot, the \(7^{\text{th}}\) value is \(7\) mmHg.
White Chocolate Group: There are \(25 – 13 = 12\) participants. The median is the average of the \(\frac{12}{2} = 6^{\text{th}}\) and \(7^{\text{th}}\) observations. Counting from the left, the \(6^{\text{th}}\) value is \(-1\) mmHg and the \(7^{\text{th}}\) value is \(1\) mmHg. The median is \(\frac{-1 + 1}{2} = 0\) mmHg.

Comparison: The median reduction in blood pressure for the dark chocolate group (\(7\) mmHg) is greater than the median reduction for the white chocolate group (\(0\) mmHg).

(b)
The researcher’s conclusion is not necessarily true because the observed difference in sample means (\(5.66\) mmHg) could be due to sampling variability. The random assignment of participants into two groups can result in a difference between the groups by chance alone, even if there is no true difference in the effects of the two types of chocolate. A single sample difference is not sufficient evidence. A formal statistical inference procedure is needed to determine the probability of observing a difference this large or larger purely by chance, assuming no real effect exists.

(c)
The observed difference in the mean reduction in blood pressure between the two groups is \(\bar{x}_{\text{dark}} – \bar{x}_{\text{white}} = 6.08 – 0.42 = 5.66\) mmHg.

The simulation was conducted under the assumption of no difference. We need to find the probability of observing a difference of \(5.66\) mmHg or greater in this simulation. Looking at the dotplot of simulation results, we count the number of trials where the simulated difference in means was \(5.66\) or more. There is one dot at \(6\), one at \(7\), and one at \(8\). Thus, \(3\) out of the \(120\) trials resulted in a difference of \(5.66\) mmHg or greater.

The estimated p-value is \(P(\text{difference} \geq 5.66) = \frac{3}{120} = 0.025\).

Since the p-value of \(0.025\) is less than the significance level of \(\alpha = 0.05\), we reject the null hypothesis. There is convincing statistical evidence that adding dark chocolate to a daily diet will result in a greater mean reduction in blood pressure than adding white chocolate to a daily diet for people similar to those in this study.

Question 6

To compare success rates for treating allergies at two clinics that specialize in treating allergy sufferers, researchers selected random samples of patient records from the two clinics. The following table summarizes the data.
 Clinic AClinic BTotal
Unsuccessful treatment\( 51 \)\( 33 \)\( 84 \)
Successful treatment\( 88 \)\( 35 \)\( 123 \)
Total\( 139 \)\( 68 \)\( 207 \)
(a) (i) Complete the following table by recording the relative frequencies of successful and unsuccessful treatments at each clinic.
 Clinic AClinic B
Unsuccessful treatment  
Successful treatment  
(ii) Based on the relative frequency table in part (a-i), which clinic is more successful in treating allergy sufferers? Justify your answer.
(b) Based on the design of the study, would a statistically significant result allow the researchers to conclude that receiving treatments at the clinic you selected in part (a-i) causes a higher percentage of successful treatments than at the other clinic? Explain your answer.
A physician who worked at both clinics believed that it was important to separate the patients in the study by severity of the patient’s allergy (severe or mild). The physician constructed the following mosaic plot.
Based on the mosaic plot, the physician concluded the following:
For mild allergy sufferers, Clinic B was more successful in treating allergies.
For severe allergy sufferers, Clinic B was more successful in treating allergies.
(c) (i) For each clinic, which allergy severity is treated more successfully? Justify your answer.
• Clinic A:
• Clinic B:
(ii) For each clinic, which allergy severity is more likely to be treated? Justify your answer.
• Clinic A:
• Clinic B:
(d) Using your answers from part (c), give a reasonable explanation of why the more successful clinic identified in part (a-ii) is the same as or different from the physician’s conclusion that Clinic B is more successful in treating both severe and mild allergies.

Most-appropriate topic codes (CED):

TOPIC 2.3: Statistics for Two Categorical Variables — parts (a), (c)
TOPIC 3.1: Introducing Statistics: Do the Data We Collected Tell the Truth? — part (b)
TOPIC 8.4: Expected Counts in Two-Way Tables — parts (c), (d)
▶️ Answer/Explanation
Detailed solution

(a)
(i) Relative frequency table:

 Clinic AClinic B
Unsuccessful treatment\( \frac{51}{139} \approx 0.367 \)\( \frac{33}{68} \approx 0.485 \)
Successful treatment\( \frac{88}{139} \approx 0.633 \)\( \frac{35}{68} \approx 0.515 \)

(ii) Clinic A is more successful. Clinic A has a higher success rate (\( 63.3\% \)) compared to Clinic B (\( 51.5\% \)).

(b)
No, a statistically significant result would not allow researchers to conclude causation. This is an observational study (not a randomized experiment) where patients were not randomly assigned to clinics. There may be confounding variables, such as allergy severity, that affect both the clinic a patient attends and the success of treatment.

(c)
(i)
Clinic A: More successful in treating mild allergies (\( \frac{78}{104} = 0.75 \)) than severe (\( \frac{10}{35} \approx 0.286 \))
Clinic B: More successful in treating mild allergies (\( \frac{11}{12} \approx 0.917 \)) than severe (\( \frac{24}{56} \approx 0.429 \))

(ii)
Clinic A: More likely to treat mild allergy sufferers (\( \frac{104}{139} \approx 0.748 \)) than severe (\( \frac{35}{139} \approx 0.252 \))
Clinic B: More likely to treat severe allergy sufferers (\( \frac{56}{68} \approx 0.824 \)) than mild (\( \frac{12}{68} \approx 0.176 \))

(d)
The overall success rate favored Clinic A in part (a-ii), which is different from the physician’s conclusion. This is an example of Simpson’s Paradox. Clinic A treats mostly mild cases (\( 74.8\% \)) which have higher success rates, while Clinic B treats mostly severe cases (\( 82.4\% \)) which have lower success rates. When we look within each severity level, Clinic B actually has higher success rates for both mild (\( 91.7\% \) vs \( 75.0\% \)) and severe (\( 42.9\% \) vs \( 28.6\% \)) allergies. The overall success rates are misleading because the clinics treat different types of patients.

Scroll to Top