Home / IB Mathematics SL 4.4 Linear correlation of bivariate data AI SL Paper 2 – Exam Style Questions

IB Mathematics SL 4.4 Linear correlation of bivariate data AI SL Paper 2 - Exam Style Questions - New Syllabus

Question

The mean annual temperatures for Earth, recorded at fifty-year intervals, are shown in the table below. Tami and Thandizo are investigating linear models to represent this global warming trend.
Year (\( x \))\( 1708 \)\( 1758 \)\( 1808 \)\( 1858 \)\( 1908 \)\( 1958 \)\( 2008 \)
Temperature \( ^\circ\text{C} \) (\( y \))\( 8.73 \)\( 9.22 \)\( 9.10 \)\( 9.12 \)\( 9.13 \)\( 9.45 \)\( 9.76 \)
Tami creates a linear model by finding the equation of the straight line passing through the points \( (1708, 8.73) \) and \( (1958, 9.45) \).
(a) Calculate the gradient of Tami’s straight line.
(b) (i) Interpret the meaning of the gradient in the context of global temperatures.
(ii) State the appropriate units for this gradient.
(c) Find the equation of Tami’s line in the form \( y = mx + c \).
(d) Use Tami’s model to estimate the mean annual temperature in the year \( 2000 \).
Thandizo decides to use all the data points to perform a linear regression.
(e) (i) Determine the equation of the regression line \( y \) on \( x \).
(ii) Find the value of \( r \), the Pearson’s product-moment correlation coefficient.
(f) Use Thandizo’s model to estimate the mean annual temperature in the year \( 2000 \).
Thandizo uses his regression line to predict the year when the mean annual temperature will first exceed \( 15 ^\circ\text{C} \).
(g) State two reasons why Thandizo’s prediction may not be valid.

Most-appropriate topic codes (IB Mathematics AI SL 2025):

SL 2.1: Linear functions and their graphs — parts (a), (c), (d)
SL 4.4: Linear regression and Pearson’s correlation coefficient — part (e)
SL 4.4: Interpolation and extrapolation; regression line interpretation — parts (b), (f), (g)
▶️ Answer/Explanation

(a)
\( m = \frac{9.45 – 8.73}{1958 – 1708} = \frac{0.72}{250} \).
\(\boxed{0.00288}\)

(b)
(i) The gradient represents the average increase in temperature per year.
(ii) Units: \(\boxed{^\circ\text{C}/\text{year}}\)

(c)
Using \( y – y_1 = m(x – x_1) \):
\( y – 8.73 = 0.00288(x – 1708) \)
\( y = 0.00288x – 4.91904 + 8.73 \)
\(\boxed{y = 0.00288x + 3.81}\)

(d)
\( y = 0.00288(2000) + 3.81 = 5.76 + 3.81 \).
\(\boxed{9.57 ^\circ\text{C}}\)

(e)
Using a GDC for linear regression on all 7 data points:
(i) \(\boxed{y = 0.00256x + 4.46}\)
(ii) \(\boxed{r \approx 0.861}\)

(f)
\( y = 0.00256(2000) + 4.46 = 5.12 + 4.46 \).
\(\boxed{9.58 ^\circ\text{C}}\)

(g)
1. Extrapolation: \( 15 ^\circ\text{C} \) is well outside the range of observed data; linear trends may not continue indefinitely.
2. Regression Direction: The line \( y \) on \( x \) is used to predict temperature from years. To predict years from temperature, an \( x \) on \( y \) regression line should ideally be used.

Markscheme

(a) \( 0.00288 \) M1A1

(b) (i) Yearly change in temperature A1, (ii) \( ^\circ\text{C}/\text{year} \) A1

(c) \( y = 0.00288x + 3.81 \) M1A1

(d) \( 9.57 ^\circ\text{C} \) M1A1

(e) (i) \( y = 0.00256x + 4.46 \), (ii) \( 0.861 \) M1A1A1

(f) \( 9.58 ^\circ\text{C} \) M1A1

(g) Extrapolation and Regression direction A1A1

Scroll to Top