IB Mathematics AI SL Concepts of population, sample MAI Study Notes - New Syllabus
IB Mathematics AI SL Concepts of population, sample MAI Study Notes
LEARNING OBJECTIVE
- Concepts of population, sample, random sample, discrete and continuous data.
Key Concepts:
- Sampling
- Data Collection
- Statistical Measures
- IBDP Maths AI SL- IB Style Practice Questions with Answer-Topic Wise-Paper 1
- IBDP Maths AI SL- IB Style Practice Questions with Answer-Topic Wise-Paper 2
- IB DP Maths AI HL- IB Style Practice Questions with Answer-Topic Wise-Paper 1
- IB DP Maths AI HL- IB Style Practice Questions with Answer-Topic Wise-Paper 2
- IB DP Maths AI HL- IB Style Practice Questions with Answer-Topic Wise-Paper 3
BASIC CONCEPTS OF STATISTICS
Statistics is the science and practice of collecting, organizing, presenting, analyzing, and interpreting data to help in making more effective decisions and draw meaningful conclusions from complex datasets. It is widely applied across various fields.

Data, the raw material of statistics, can originate from a Population, which is the complete set of all individuals or items of interest, or a Sample, a subset of the population selected for study. A Random Sample is particularly important as it ensures each member of the population has an equal chance of selection, helping to minimize bias.
Data can be broadly classified into different types:
- Qualitative Data: Descriptive data that represents attributes or characteristics that are not numerical (e.g., gender, eye color).
- Quantitative Data: Numerical data that represents counts or measurements. This can be further divided into:- Discrete Data: Data that can only take specific, separate values, often resulting from counting.- Examples: Number of students in a class, the number of cars in a parking lot.
 
- Continuous Data: Data that can take any value within a given range or interval, typically resulting from measurement.- Examples: Height, weight, temperature.
 
 
- Discrete Data: Data that can only take specific, separate values, often resulting from counting.
A Variable is a characteristic of interest that can have different values for different elements. An Observation is the set of measurements collected for a particular element.
DATA COLLECTION METHODS
Data Collection Methods
Collecting relevant and reliable data is a critical first step in statistics. Common methods include:
- Surveys and Questionnaires: Gathering information by asking a set of predefined questions to a group of individuals. These can be administered online, by phone, in person, or by mail.
- Interviews: Conducting in-depth discussions to gather detailed information on experiences, opinions, or behaviors.
- Observations: Systematically watching and recording behaviors or phenomena.
- Experiments: Conducting controlled tests to investigate cause-and-effect relationships.
- Secondary Data Analysis: Utilizing data that has already been collected and published by others (e.g., government reports, academic journals, existing databases).
Bias in Sampling and Data Reliability
As mentioned earlier, selecting a sample which is not biased is crucial for drawing accurate conclusions about the population. Bias can occur through flawed sampling methods or issues in data collection. Reliability of data sources and bias in sampling also involves addressing practical issues like dealing with missing data and errors in the recording of data.
INTERPRETATION OF OUTLIERS
Definition:
An outlier is a data point that differs significantly from other observations in a dataset. Outliers may occur due to variability in the data, measurement errors, or indicate a novel finding.
How to Detect Outliers:
- Using IQR (Interquartile Range):
 A value is considered an outlier if it lies: \[ \text{Below } Q_1 – 1.5 \times \text{IQR} \quad \text{or} \quad \text{Above } Q_3 + 1.5 \times \text{IQR} \] where IQR = \( Q_3 – Q_1 \)
- Using z-scores:
 A data point is an outlier if its z-score is beyond ±3 (in standard normal distribution context).
Interpretation of Outliers:
- Data Entry Error: An outlier could be the result of a mistake (e.g., typo, misreading).
- Natural Variability: The value may be valid but rare (e.g., exceptionally high test score).
- Different Group: The outlier may belong to a different population or subgroup.
- Impact on Statistical Measures:- Can distort the mean and standard deviation
- Less impact on median and IQR
- Can influence the shape of distributions (e.g., skewness)
 
What to Do with Outliers:
- Investigate the cause before removing.
- Determine if the outlier is valid and meaningful.
- If not valid, consider removing or adjusting the value.
- Consider using robust statistics like median or trimmed mean if outliers exist.
| Example Given the following data set: 7, 8, 9, 10, 11, 12, 13, 14, 15, 50 ▶️ Show SolutionSolution: Order the data (already sorted) Q1 (25th percentile): Median of lower half = (8 + 9) ÷ 2 = 8.5 The number 50 is an unusually high value in this dataset. | 
SAMPLING TECHNIQUES
Sampling Techniques and Their Effectiveness:
- Simple Random Sampling:- Method: Each member of the population has an equal chance of being selected (e.g., drawing names from a hat).
- Example (Population 100,000, Sample 100): Select 100 people randomly from the entire group.
- Effectiveness: Fair; generally considered effective in minimizing bias. Can be time-consuming.
 
- Systematic Sampling:- Method: Select a random starting point and then choose every k-th member of the population, where k is the ratio of population size to sample size.
- Example (Population 100,000, Sample 100): k = 100,000 / 100 = 1000. Pick a random start (e.g., 8th person) and select the 8th, 1008th, 2008th, etc.
- Effectiveness: Can be efficient. Risk of bias if there’s a periodic pattern in the population that aligns with the sampling interval (e.g., if every 1000th person has a specific characteristic).
 
- Stratified Sampling:- Method: Divide the population into subgroups (strata) based on relevant characteristics (e.g., gender, age groups). Then, draw a sample from each stratum.
 
- Quota Sampling:- Method: Similar to stratified sampling, the population is divided into subgroups. However, a specific number (quota) of individuals is selected from each subgroup, often proportionally to their representation in the population.
 
- Convenience Sampling:- Method: Selecting individuals who are easily accessible or convenient to the researcher.
- Example: Surveying only the students in your class.
- Effectiveness: Easy to implement but highly likely to be biased as the sample is unlikely to be representative of the population.
 


