AP Statistics 1.3 Representing a Categorical Variable with Tables Study Notes
AP Statistics 1.3 Representing a Categorical Variable with Tables- New syllabus
AP Statistics 1.3 Representing a Categorical Variable with Tables Study Notes -As per latest AP Statistics Syllabus.
LEARNING OBJECTIVE
- Graphical representations and statistics allow us to identify and represent key features of data.
Key Concepts:
- Representing a Categorical Variable with Tables
- How to Describe Categorical Data
Represent Categorical Data Using Frequency or Relative Frequency Tables
Purpose of Tables:
- To organize raw categorical data into a clear and interpretable format.
- Helps identify patterns, compare categories, and prepare data for graphical representation (bar graphs, pie charts).
- Summarizes large amounts of data in a compact structure.
Types of Tables for Categorical Data:
Frequency Table:
- Lists each category of the variable.
- Shows the count (frequency) of observations that fall into each category.
- Good for showing raw numbers but less useful for comparisons when sample sizes differ.
Relative Frequency Table:
- Lists each category along with the proportion (or percentage) of the total sample in that category.
- Formula: \( \text{Relative Frequency} = \dfrac{\text{Category Frequency}}{\text{Total Frequency}} \)
- Relative frequencies can be expressed as decimals, fractions, or percentages.
- Useful for comparing distributions across groups of different sizes.
Cumulative Frequency Table:
- Shows the running total of frequencies up to a certain category.
- More often used with ordered categorical or quantitative data.
Two-Way (Contingency) Table:
- Summarizes the relationship between two categorical variables.
- Displays joint frequencies (counts of individuals that fall into a combination of categories).
- Forms the basis for conditional relative frequencies and association analysis.
Steps in Creating a Frequency/Relative Frequency Table:
- Identify all unique categories of the variable.
- Count the number of individuals in each category (frequency).
- Compute relative frequency by dividing each frequency by the total number of observations.
- Verify that the sum of relative frequencies equals 1 (or 100%).
Key Advantages of Using Tables:
- Organizes data systematically for analysis.
- Relative frequency tables allow fair comparison between datasets of different sizes.
- Provides the foundation for constructing graphs (bar graphs, pie charts, segmented bar charts).
- Helps detect dominant and less common categories quickly.
Example:
A survey asked 30 students about their favorite type of movie. Results: 12 chose Action, 8 chose Comedy, 6 chose Drama, and 4 chose Horror. Create a frequency and relative frequency table.
▶️ Answer/Explanation
Step 1: Organize into a frequency table.
Movie Type | Frequency | Relative Frequency |
---|---|---|
Action | 12 | \( \dfrac{12}{30} = 0.40 \) → 40% |
Comedy | 8 | \( \dfrac{8}{30} \approx 0.27 \) → 26.7% |
Drama | 6 | \( \dfrac{6}{30} = 0.20 \) → 20% |
Horror | 4 | \( \dfrac{4}{30} \approx 0.13 \) → 13.3% |
Total | 30 | 100% |
Step 2: Interpretation.
- 40% of students prefer Action movies, the most popular category.
- Only 13.3% prefer Horror movies, the least popular.
Final Point: Frequency tables summarize raw counts, while relative frequency tables allow comparisons across groups of different sizes.
How to Describe Categorical Data
How to Describe Categorical Data:
- Identify which category has the highest or lowest frequency (most or least common).
- Use relative frequencies (percentages) to make comparisons between groups or datasets.
- Comment on overall trends — whether data are evenly distributed or skewed toward certain categories.
- Use cumulative or two-way tables to describe totals, relationships, or patterns between variables.
Example:
A class survey of 25 students asked for their preferred type of smartphone: 10 chose Apple, 7 chose Samsung, 5 chose OnePlus, and 3 chose Other. Create a frequency and relative frequency table and describe the results.
▶️ Answer/Explanation
Step 1: Organize into a frequency table with relative frequencies.
Smartphone Brand | Frequency | Relative Frequency |
---|---|---|
Apple | 10 | \( \dfrac{10}{25} = 0.40 \) → 40% |
Samsung | 7 | \( \dfrac{7}{25} = 0.28 \) → 28% |
OnePlus | 5 | \( \dfrac{5}{25} = 0.20 \) → 20% |
Other | 3 | \( \dfrac{3}{25} = 0.12 \) → 12% |
Total | 25 | 100% |
Step 2: Description and Interpretation.
- 40% of students prefer Apple, the most popular choice.
- Only 12% prefer Other brands, the least popular category.
- Samsung (28%) and OnePlus (20%) show moderate popularity.
Final Point: Describing data focuses on interpreting what frequencies reveal — such as popularity, trends, and comparisons.
Example:
A group of 40 students was asked how many books they read last month. The data were grouped into categories: 0–1 books (8 students), 2–3 books (12 students), 4–5 books (10 students), 6–7 books (6 students), and 8+ books (4 students). Construct and describe a cumulative frequency table.
▶️ Answer/Explanation
Step 1: Frequency and cumulative frequency table.
Books Read | Frequency | Cumulative Frequency |
---|---|---|
0–1 | 8 | 8 |
2–3 | 12 | 20 |
4–5 | 10 | 30 |
6–7 | 6 | 36 |
8+ | 4 | 40 |
Step 2: Description.
- 20 students read 3 or fewer books.
- 30 students read 5 or fewer books.
- All 40 students are represented by the final row.
Final Point: Cumulative frequency helps describe totals and locate medians or quartiles.