Home / IB DP Maths 2026, 2027 & 2028 / Application and Interpretation HL / IB Mathematics AI AHL Design of valid data collection methods MAI Study Notes

IB Mathematics AI AHL Design of valid data collection methods MAI Study Notes - New Syllabus

IB Mathematics AI AHL Design of valid data collection methods MAI Study Notes

LEARNING OBJECTIVE

  • Design of valid data collection methods, such as surveys and questionnaires.

Key Concepts: 

  •  Selecting/Choosing relevant variables from many variables.
  • Categorizing numerical data 
  • Definition of reliability and validity. 

MAI HL and SL Notes – All topics

SURVEY AND QUESTIONNAIRE DESIGN

Purpose:
To gather structured, unbiased, and relevant information from participants.

Key Features of Good Design:

  • Unbiased phrasing (avoid leading questions)
  • Structured layout (clear sections and question flow)
  • Consistent answer choices (e.g., Likert scale)
  • Precision in questioning (specific and measurable)

Example:

  • Bad: “Don’t you think the AI tutor is great?”

  • Good: “On a scale of 1–5, how would you rate the effectiveness of the AI tutor in helping you understand mathematical concepts?”

Tip:
Pilot Test the survey on a small group to check clarity and identify issues.

VARIABLE SELECTION

Definition:

Choosing the most relevant variables (also called features) for your AI model or statistical analysis.

Goals:

  • Maximize relevance to the outcome
  • Eliminate irrelevant or redundant variables
  • Consider cost and feasibility of measurement

Example:
Predicting student performance in math:

  •  Relevant: Previous grades, time on homework, attendance
  •  Irrelevant: Hair color, favorite food, number of siblings

DATA SELECTION FOR ANALYSIS

Importance:

Only appropriate, clean, and representative data should be analyzed to ensure valid outcomes.

Key Criteria:

  • Accuracy (no errors)
  • Completeness (no missing values)
  • Consistency (uniform format)
  • Relevance (aligned with research goals)
  • Representativeness (sample mirrors the target population)

Note:

In AI: “Garbage in, garbage out.” Low-quality data leads to unreliable models.

CHI-SQUARED TABLE CATEGORIZATION

Used for:

Analyzing categorical data to test for independence or goodness of fit.

Good Practices:

  • Expected frequencies > 5 for each cell (otherwise results are unreliable)
  • Logical grouping of data (e.g., age ranges: 0–10, 11–20)
  • Avoid:
    • Too few categories → loss of detail
    • Too many → small expected frequencies → reduced statistical power

DEGREES OF FREEDOM IN CHI-SQUARED TESTS

Definition:

Number of independent values that can vary in a calculation.

Goodness of Fit Test:

$\text{df} = \text{number of categories} – 1 – \text{number of parameters estimated}$

Test for Independence (contingency table):

$\text{df} = (\text{rows} – 1) \times (\text{columns} – 1)$

Common Mistake:

Forgetting to subtract estimated parameters → incorrect df → wrong conclusions.

RELIABILITY AND VALIDITY

Definition of Reliability

Reliability = Consistency

A reliable method yields consistent results under the same conditions.

Types of Reliability Tests (with Examples)

TypeDescriptionExample
Test–RetestSame test, same group, different timesA university gives the same math test at the beginning and end of a semester (with no new teaching). High correlation = high test–retest reliability.
Parallel FormsTwo equivalent testsDriving license exams with two question versions. Similar scores = strong reliability.
Inter-RaterMultiple raters, same taskGymnastics judges score routines independently. High agreement (e.g., Cohen’s Kappa) = strong reliability.
Internal ConsistencySimilarity of items within one testA 20-question anxiety test shows strong correlation between items (via Cronbach’s Alpha), indicating internal consistency.

Definition of Validity

Validity = Accuracy

How well a method measures what it is supposed to measure.

Types of Validity Tests (with Examples)

TypeDescriptionExample
Content ValidityCovers all aspects of the constructA physics exam is reviewed by experts to ensure it tests all key topics like Newton’s laws and motion.
Criterion ValidityCorrelation with external outcomeA company uses an aptitude test and compares scores with job performance after 6 months (predictive validity).
Construct ValidityMeasures intended theoretical conceptA new depression scale correlates with brain scans and established tools → shows construct validity.
Face ValidityAppears to measure the right thingA customer satisfaction survey asks about delivery, pricing, and service. People easily recognize its purpose.

Distinguishing Between Reliability and Validity

ConceptMeaningCan it exist without the other?Example
ReliabilityConsistencyYes – a test can be reliable but not validA miscalibrated scale always shows 5kg too heavy → reliable but invalid
ValidityAccuracyNo – a valid test must be reliableA tutor gives very different scores each time → not reliable, so can’t be valid

Conclusion:

  • Reliability ≠ Validity
  • A method must be reliable to be valid, but reliable methods aren’t always valid
Scroll to Top