AP Statistics 1.7 Summary Statistics for a Quantitative Variable Study Notes
AP Statistics 1.7 Summary Statistics for a Quantitative Variable- New syllabus
AP Statistics 1.7 Summary Statistics for a Quantitative Variable Study Notes -As per latest AP Statistics Syllabus.
LEARNING OBJECTIVE
- Graphical representations and statistics allow us to identify and represent key features of data.
Key Concepts:
- Summary Statistics for a Quantitative Variable
Summary Statistics for a Quantitative Variable
Summary Statistics for a Quantitative Variable
To describe a quantitative distribution numerically, we use measures of center, spread (variability), and position. These are called summary statistics.
Measures of Center
Mean (Arithmetic Average):
\( \displaystyle \bar{x} = \dfrac{\text{sum of data values}}{\text{number of values}} \)
- Sensitive to outliers and skewed data.
- Best for symmetric distributions with no extreme values.
Median:
- The middle value when data are ordered.
- If even number of observations → average of the two middle values.
- Resistant to outliers and skewness.
- Best for skewed or data with outliers.
Mode:
- The most frequently occurring value or values.
- Useful mainly for categorical or discrete data; not as common in AP Stats analysis of quantitative data.
Measures of Spread (Variability)
Range:
\( \text{Range} = \text{Maximum} – \text{Minimum} \)
- Simple, but influenced by outliers.
Interquartile Range (IQR):
\( IQR = Q_3 – Q_1 \)
- Measures the spread of the middle 50% of the data.
- Resistant to outliers.
Variance:
\( s^2 = \dfrac{\sum (x_i – \bar{x})^2}{n-1} \)
- Average squared deviation from the mean.
Standard Deviation:
\( s = \sqrt{ \dfrac{\sum (x_i – \bar{x})^2}{n-1} } \)
- Measures typical distance of data points from the mean.
- Sensitive to outliers and skewness.
Measures of Position
Quartiles:
- \( Q_1 \) = median of lower half (25th percentile).
- \( Q_3 \) = median of upper half (75th percentile).
Percentiles:
The \(k\)-th percentile is a value where \(k\%\) of the data are at or below it.
Z-Score (Standardized Score):
\( z = \dfrac{x – \bar{x}}{s} \)
- Tells how many standard deviations a value is from the mean.
- Positive \(z\) → above mean; Negative \(z\) → below mean.
- Helps compare values from different distributions.
Important Note
When describing data:
- Use mean and standard deviation for symmetric distributions without outliers.
- Use median and IQR for skewed distributions or when outliers are present.
Example :
The exam scores of 8 students are:
72, 75, 78, 80, 82, 84, 85, 90
Compute the mean, median, range, IQR, and standard deviation. Comment on which measures are most appropriate.
▶️ Answer/Explanation
Step 1: Mean
\( \bar{x} = \dfrac{72 + 75 + 78 + 80 + 82 + 84 + 85 + 90}{8} = \dfrac{646}{8} = 80.75 \)
Step 2: Median
Ordered data → middle two values = 80 and 82. Median = \( \dfrac{80+82}{2} = 81 \).
Step 3: Range
Range = \( 90 – 72 = 18 \).
Step 4: Quartiles & IQR
- Lower half (72, 75, 78, 80) → \( Q_1 = \dfrac{75+78}{2} = 76.5 \).
- Upper half (82, 84, 85, 90) → \( Q_3 = \dfrac{84+85}{2} = 84.5 \).
- \( IQR = Q_3 – Q_1 = 84.5 – 76.5 = 8 \).
Step 5: Standard Deviation
\( s = \sqrt{ \dfrac{\sum (x_i – \bar{x})^2}{n-1} } \). Deviations squared → 76.56, 33.06, 7.56, 0.56, 1.56, 10.56, 18.06, 85.56. Sum = 233.5. \( s = \sqrt{ \dfrac{233.5}{7} } = \sqrt{33.36} \approx 5.77 \).
Example :
The daily screen times (in hours) of 10 students are:
1, 2, 2, 3, 3, 4, 4, 5, 6, 15
Compute the mean, median, range, IQR, and standard deviation. Identify the best summary statistics.
▶️ Answer/Explanation
Step 1: Mean
\( \bar{x} = \dfrac{1+2+2+3+3+4+4+5+6+15}{10} = \dfrac{45}{10} = 4.5 \).
Step 2: Median
Ordered data → middle two = 3 and 4. Median = \( \dfrac{3+4}{2} = 3.5 \).
Step 3: Range
Range = \( 15 – 1 = 14 \).
Step 4: Quartiles & IQR
- Lower half (1, 2, 2, 3, 3) → median = 2 → \( Q_1 = 2 \).
- Upper half (4, 4, 5, 6, 15) → median = 5 → \( Q_3 = 5 \).
- \( IQR = Q_3 – Q_1 = 5 – 2 = 3 \).
Step 5: Standard Deviation
Deviations squared (from mean 4.5) → 12.25, 6.25, 6.25, 2.25, 2.25, 0.25, 0.25, 0.25, 2.25, 110.25. Sum = 142.5. \( s = \sqrt{ \dfrac{142.5}{9} } = \sqrt{15.83} \approx 3.98 \).
Step 6: Outlier Check
\( Q_1 – 1.5 \times IQR = 2 – 4.5 = -2.5 \). \( Q_3 + 1.5 \times IQR = 5 + 4.5 = 9.5 \). Any value > 9.5 is an outlier → 15 is an outlier.