Question
The random variables $X, Y$ follow a bivariate normal distribution with product moment correlation coefficient $\rho$.
A random sample of 11 observations on $X, Y$ was obtained and the value of the sample product moment correlation coefficient, $r$, was calculated to be $-0.708$
The covariance of the random variables $U, V$ is defined by
$$
\operatorname{Cov}(U, V)=E((U-E(U))(V-E(V)))
$$
a. State suitable hypotheses to investigate whether or not a negative linear association exists between $X$ and $Y$.
b.i. Determine the $p$-value.
b.ii.State your conclusion at the $1 \%$ significance level.
[1]
c.i. Show that $\operatorname{Cov}(U, V)=E(U V)-E(U) \mathrm{E}(V)$.
c.ii.Hence show that if $U, V$ are independent random variables then the population product moment correlation coefficient, $\rho$, is zero.
[3]
▶️Answer/Explanation
Markscheme
a. * This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
$\mathrm{H}_0: \rho=0 ; \mathrm{H}_1: \rho<0 \quad \boldsymbol{A 1}$
[1 mark]
b.i. $t=-0.708 \sqrt{\frac{11-2}{1-(-0.708)^2}}=(-3.0075 \ldots) \quad$ (M1)
degrees of freedom $=9$
(A1)
$\mathrm{P}(T<-3.0075 \ldots)=0.00739 \quad A 1$
Note: Accept any answer that rounds to 0.0074 .
[3 marks]
b.iireject $\mathrm{H}_0$ or equivalent statement
R1
Note: Apply follow through on the candidate’s $p$-value.
[1 mark]
$$
\begin{aligned}
& \text { c.i. } \operatorname{Cov}(U, V)+\mathrm{E}((U-\mathrm{E}(U))(V-\mathrm{E}(V))) \\
&= \mathrm{E}(U V-\mathrm{E}(U) V-\mathrm{E}(V) U+\mathrm{E}(U) \mathrm{E}(V)) \quad \text { M1 } \\
&=\mathrm{E}(U V)-\mathrm{E}(\mathrm{E}(U) V)-\mathrm{E}(\mathrm{E}(V) U)+\mathrm{E}(\mathrm{E}(U) \mathrm{E}(V)) \quad \text { (A1 }) \\
&= \mathrm{E}(U V)-\mathrm{E}(U) \mathrm{E}(V)-\mathrm{E}(V) \mathrm{E}(U)+\mathrm{E}(U) \mathrm{E}(V) \quad \text { A1 } \\
& \operatorname{Cov}(U, V)=\mathrm{E}(U V)-\mathrm{E}(U) \mathrm{E}(V) \quad \text { AG } \\
& {[3 \text { marks] }} \\
& \text { c.ii.E}(U V)=\mathrm{E}(U) \mathrm{E}(V)(\text { independent random variables) } \quad \text { R1 } \\
& \Rightarrow \operatorname{Cov}(U, V)=\mathrm{E}(U) \mathrm{E}(V)-\mathrm{E}(U) \mathrm{E}(V)=0 \quad \text { A1 } \\
& \text { hence, } \rho=\frac{\operatorname{Cov}(U, V)}{\sqrt{\operatorname{Var}(U) \operatorname{Var}(V)}}=0 \quad \text { A1AG }
\end{aligned}
$$
Note: Accept the statement that $\operatorname{Cov}(U, V)$ is the numerator of the formula for $\rho$.
Note: Only award the first $\boldsymbol{A} 1$ if the $\boldsymbol{R} \mathbf{1}$ is awarded.
[3 marks]
Question
If \(X\) and \(Y\) are two random variables such that \({\text{E}}(X) = {\mu _X}\) and \({\text{E}}(Y) = {\mu _Y}\) then \({\text{Cov}}(X,{\text{ }}Y) = {\text{E}}\left( {(X – {\mu _X})(Y – {\mu _Y})} \right)\).
a.Prove that if \(X\) and \(Y\) are independent then \({\text{Cov}}(X,{\text{ }}Y) = 0\).[3]
b.In a particular company, it is claimed that the distance travelled by employees to work is independent of their salary. To test this, 20 randomly selected employees are asked about the distance they travel to work and the size of their salaries. It is found that the product moment correlation coefficient, \(r\), for the sample is \( – 0.35\).
You may assume that both salary and distance travelled to work follow normal distributions.
Perform a one-tailed test at the \(5\% \) significance level to test whether or not the distance travelled to work and the salaries of the employees are independent.[8]
▶️Answer/Explanation
Markscheme
METHOD 1
\({\text{Cov}}(X,{\text{ }}Y) = {\text{E}}\left( {(X – {\mu _X})(Y – {\mu _Y})} \right)\)
\( = {\text{E}}(XY – X{\mu _Y} – Y{\mu _X} + {\mu _X}{\mu _Y})\) (M1)
\( = {\text{E}}(XY) – {\mu _Y}{\text{E}}(X) – {\mu _X}{\text{E}}(Y) + {\mu _X}{\mu _Y}\)
\( = {\text{E}}(XY) – {\mu _X}{\mu _Y}\) A1
as \(X\) and \(Y\) are independent \({\text{E}}(XY) = {\mu _X}{\mu _Y}\) R1
\({\text{Cov}}(X,{\text{ }}Y) = 0\) AG
METHOD 2
\({\text{Cov}}(X,{\text{ }}Y) = {\text{E}}\left( {(X – {\mu _x})(Y – {\mu _y})} \right)\)
\( = {\text{E}}(X – {\mu _x}){\text{E}}(Y – {\mu _y})\) (M1)
since \(X,Y\) are independent R1
\( = ({\mu _x} – {\mu _x})({\mu _y} – {\mu _y})\) A1
\( = 0\) AG
[3 marks]
\({H_0}:\rho = 0\;\;\;{H_1}:\rho < 0\) A1
Note: The hypotheses must be expressed in terms of \(\rho \).
test statistic \({t_{test}} = – 0.35\sqrt {\frac{{20 – 2}}{{1 – {{( – 0.35)}^2}}}} \) (M1)(A1)
\( = – 1.585 \ldots \) (A1)
\({\text{degrees of freedom}} = 18\) (A1)
EITHER
\(p{\text{ – value}} = 0.0652\) A1
this is greater than \(0.05\) M1
OR
\({t_{5\% }}(18) = – 1.73\) A1
this is less than \( – {\text{1.59}}\) M1
THEN
hence accept \({H_0}\) or reject \({H_1}\) or equivalent or contextual equivalent R1
Note: Allow follow through for the final R1 mark.
[8 marks]
Total [11 marks]
Examiners report
Solutions to (a) were often disappointing with few candidates gaining full marks, a common error being failure to state that
\(E(XY) = E(X)E(Y)\) or \({\text{E}}\left( {(X – {\mu _x})(Y – {\mu _y})} \right) = {\text{E}}(X – {\mu _x}){\text{E}}(Y – {\mu _y})\) in the case of independence.
In (b), the hypotheses were sometimes given incorrectly. Some candidates gave \({H_1}\) as \(\rho \ne 0\), not seeing that a one-tailed test was required. A more serious error was giving the hypotheses as \({H_0}:r = 0,{\text{ }}{H_1}:r < 0\) which shows a complete misunderstanding of the situation. Subsequent parts of the question were well answered in general.
Question
The random variables X , Y follow a bivariate normal distribution with product moment correlation coefficient ρ.
A random sample of 11 observations on X, Y was obtained and the value of the sample product moment correlation coefficient, r, was calculated to be −0.708.
The covariance of the random variables U, V is defined by
Cov(U, V) = E((U − E(U))(V − E(V))).
a.State suitable hypotheses to investigate whether or not a negative linear association exists between X and Y.[1]
b.i.Determine the p-value.[3]
b.ii.State your conclusion at the 1 % significance level.[1]
c.i.Show that Cov(U, V) = E(UV) − E(U)E(V).[3]
c.ii.Hence show that if U, V are independent random variables then the population product moment correlation coefficient, ρ, is zero.[3]
▶️Answer/Explanation
Markscheme
H0 : ρ = 0; H1 : ρ < 0 A1
[1 mark]
\(t = – 0.708\sqrt {\frac{{11 – 2}}{{1 – {{\left( { – 0.708} \right)}^2}}}} \,\, = \,\,\left( { – 3.0075 \ldots } \right)\) (M1)
degrees of freedom = 9 (A1)
P(T < −3.0075…) = 0.00739 A1
Note: Accept any answer that rounds to 0.0074.
[3 marks]
reject H0 or equivalent statement R1
Note: Apply follow through on the candidate’s p-value.
[1 mark]
Cov(U, V) + E((U − E(U))(V − E(V)))
= E(UV − E(U)V − E(V)U + E(U)E(V)) M1
= E(UV) − E(E(U)V) − E(E(V)U) + E(E(U)E(V)) (A1)
= E(UV) − E(U)E(V) − E(V)E(U) + E(U)E(V) A1
Cov(U, V) = E(UV) − E(U)E(V) AG
[3 marks]
E(UV) = E(U)E(V) (independent random variables) R1
⇒Cov(U, V) = E(U)E(V) − E(U)E(V) = 0 A1
hence, ρ = \(\frac{{{\text{Cov}}\left( {U,\,V} \right)}}{{\sqrt {{\text{Var}}\left( U \right)\,{\text{Var}}\left( V \right)} }} = 0\) A1AG
Note: Accept the statement that Cov(U,V) is the numerator of the formula for ρ.
Note: Only award the first A1 if the R1 is awarded.
[3 marks]
Examiners report
[N/A]
[N/A]
[N/A]
[N/A]
[N/A]