Home / IB DP Maths / IB Math Analysis and Approach HL / MAA HL Study Notes / Intersections of a line with a planes Study Notes

IB Mathematics AA Linear correlation of bivariate data Study Notes

IB Mathematics AA Linear correlation of bivariate data Study Notes

IB Mathematics AA Linear correlation of bivariate data Study Notes

IB Mathematics AA Linear correlation of bivariate data Notes Offer a clear explanation of Linear correlation of bivariate data, including various formula, rules, exam style questions as example to explain the topics. Worked Out examples and common problem types provided here will be sufficient to cover for topic Linear correlation of bivariate data.

Linear Correlation of Bivariate Data

Introduction

Linear correlation of bivariate data explores the relationship between two variables to determine how one may change in response to the other. This topic includes understanding Pearson’s product-moment correlation coefficient (\( r \)), scatter diagrams, regression lines, and the implications of correlation and causation in real-world contexts.

Key Concepts

Correlation and Pearson’s Product-Moment Correlation Coefficient (\( r \))
  • Definition: Measures the strength and direction of a linear relationship between two variables. Values of \( r \) range from -1 to 1:
    • \( r = 1 \): Perfect positive correlation
    • \( r = -1 \): Perfect negative correlation
    • \( r = 0 \): No correlation
  • Calculation:

    The formula for \( r \) is:

    \( r = \frac{\sum (x_i – \bar{x})(y_i – \bar{y})}{\sqrt{\sum (x_i – \bar{x})^2 \sum (y_i – \bar{y})^2}} \)

    where:

    • \( x_i, y_i \): Individual data points
    • \( \bar{x}, \bar{y} \): Means of \( x \) and \( y \)

    Hand calculations enhance conceptual understanding, while technology simplifies computation for larger data sets.

  • Critical Values: Students should be familiar with critical values of \( r \) for significance testing when provided.
  • Important Note: Pearson’s \( r \) is only meaningful for linear relationships.
Solved Example 1: Calculating \( r \)

Problem: Given the following data, calculate the correlation coefficient (\( r \)):

\( x \)12345
\( y \)24545

Solution:

    1. Calculate the means: \( \bar{x} = 3 \), \( \bar{y} = 4 \).
    2. Compute \( (x_i – \bar{x}) \), \( (y_i – \bar{y}) \), and their products:
\( x_i \)\( y_i \)\( x_i – \bar{x} \)\( y_i – \bar{y} \)\( (x_i – \bar{x})(y_i – \bar{y}) \)
12-2-24
24-100
35010
44100
55212
  1. Calculate the sums: \( \sum (x_i – \bar{x})(y_i – \bar{y}) = 6 \), \( \sum (x_i – \bar{x})^2 = 10 \), \( \sum (y_i – \bar{y})^2 = 6 \).
  2. Substitute into the formula:

    \( r = \frac{6}{\sqrt{10 \cdot 6}} = \frac{6}{\sqrt{60}} \approx 0.77 \).

Interpretation: There is a strong positive correlation between \( x \) and \( y \).

Scatter Diagrams and Lines of Best Fit
  • Scatter Diagrams: Graphical representation of bivariate data to visualize relationships.
  • Lines of Best Fit:
    • Drawn by eye, passing through the mean point (\( \bar{x}, \bar{y} \)).
    • Helps in identifying trends and predicting values.
Solved Example 2: Line of Best Fit

Problem: Draw the line of best fit for the data:

\( x \)12345
\( y \)24545

Solution:

  1. Calculate \( \bar{x} = 3 \), \( \bar{y} = 4 \).
  2. Use technology or manual methods to determine the regression equation: \( y = 0.8x + 2.6 \).
  3. Plot the points on a scatter diagram and draw the line passing through \( (3, 4) \).

IB Mathematics AA SL Linear correlation of bivariate data Exam Style Worked Out Questions

Question

Observations on 12 pairs of values of the random variables X , Y yielded the following results.

Σx = 76.3 , Σx 2 = 563.7, Σy = 72.2, Σy 2 = 460.1, Σxy = 495.4

    1. (i) Calculate the value of r , the product moment correlation coefficient of the sample.

      (ii) Assuming that the distribution of X , Y is bivariate normal with product moment correlation coefficient ρ , calculate the p-value of your result when testing the hypotheses H0 : ρ = 0; H1 : ρ > 0.

  1.   (iii) State whether your p-value suggests that X and Y are independent. [7]
  2. b             Given a further value x = 5.2 from from the distribution of X , Y , predict the corresponding value of y . Give your answer to one decimal place. [3]
▶️Answer/Explanation

Ans:

(a)

(i) use of 

(ii)

t = 0.80856… \(\sqrt{\frac{10}{1-0.80856…}}\)

= 4.345…

p-value = 7.27 × 10-4 

(iii) this value indicates that X,Y are not independent

(b)

use of

putting x = 5.2 gives y = 5.5

Question

Jim is investigating the relationship between height and foot length in teenage boys.

A sample of 13 boys is taken and the height and foot length of each boy are measured.

The results are shown in the table.

You may assume that this is a random sample from a bivariate normal distribution.

Jim wishes to determine whether or not there is a positive association between height and foot length.

a.Calculate the product moment correlation coefficient.[2]

b.Find the \(p\)value.[2]

c.Interpret the \(p\)value in the context of the question.[1]

d.Find the equation of the regression line of \(y\) on \(x\).[2]

e.Estimate the foot length of a boy of height 170 cm.[2]

▶️Answer/Explanation

Markscheme

a.Note: In all parts accept answers which round to the correct 2sf answer.

\(r = 0.806\)     A2

b.

\(4.38 \times {10^{ – 4}}\)     A2

c.

\(p\)-value represents strong evidence to indicate a (positive) association between height and foot length     A1

Note: FT the \(p\)-value

d.

\(y = 0.103x + 12.3\)     A2

e.

attempted substitution of \(x = 170\)     (M1)

\(y = 29.7\)     A1

Note: Accept \(y = 29.8\)

 

More resources for IB Mathematics AA SL

Scroll to Top