If any of my solutions look wrong, please refer to the mark scheme. You can exit full-screen mode for the question paper and mark scheme by clicking the icon in the bottom-right corner or by pressing Esc on your keyboard.

A-Level Statistics 2019 (Edexcel Paper 31)

Pearson Edexcel GCE A-Level Statistics (Paper 31) – June 2019

📝 Exam Guide

  • Paper Reference: 9MA0/31
  • Topics: Probability, Box Plots, Normal Distribution, Correlation, Hypothesis Testing
  • Total Marks: 50
  • Calculators: Allowed and required

Question 1 (8 marks)

Three bags, \( A, B \) and \( C \), each contain 1 red marble and some green marbles.

  • Bag \( A \) contains 1 red marble and 9 green marbles only
  • Bag \( B \) contains 1 red marble and 4 green marbles only
  • Bag \( C \) contains 1 red marble and 2 green marbles only

Sasha selects at random one marble from bag \( A \).

If he selects a red marble, he stops selecting.

If the marble is green, he continues by selecting at random one marble from bag \( B \).

If he selects a red marble, he stops selecting.

If the marble is green, he continues by selecting at random one marble from bag \( C \).

(a) Draw a tree diagram to represent this information.

(b) Find the probability that Sasha selects 3 green marbles.

(c) Find the probability that Sasha selects at least 1 marble of each colour.

(d) Given that Sasha selects a red marble, find the probability that he selects it from bag \( B \).

Worked Solution

Step 1: Drawing the Tree Diagram (Part a)

Why we do this: A tree diagram helps visualize the sequential events. We start at Bag A. If we get Green, we move to Bag B. If we get Green again, we move to Bag C. If we get Red at any point, we stop.

How:

  • Bag A: Total 10 marbles (1 Red, 9 Green). \( P(R) = \frac{1}{10} \), \( P(G) = \frac{9}{10} \).
  • Bag B: Total 5 marbles (1 Red, 4 Green). \( P(R) = \frac{1}{5} \), \( P(G) = \frac{4}{5} \).
  • Bag C: Total 3 marbles (1 Red, 2 Green). \( P(R) = \frac{1}{3} \), \( P(G) = \frac{2}{3} \).
9/10 1/10 G R 4/5 1/5 G R 2/3 1/3 G R
Step 2: Probability of 3 Green Marbles (Part b)

Strategy: Follow the path G \( \to \) G \( \to \) G on the tree diagram.

\[ P(G, G, G) = \frac{9}{10} \times \frac{4}{5} \times \frac{2}{3} \] \[ = \frac{72}{150} \] \[ = \frac{12}{25} \quad \text{or} \quad 0.48 \]

(2 marks)

Step 3: Probability of at least 1 marble of each colour (Part c)

Why we do this: The question asks for “at least 1 marble of each colour”. Since we stop on Red, we only pick multiple marbles if we pick Greens first.

However, re-reading the question: “Three bags… Sasha selects from A…”.

Wait, if we select Red immediately, we have 1 Red, 0 Green. (Not 1 of each).

If we select Green then Red, we have 1 Green, 1 Red. (1 of each).

If we select Green then Green then Red, we have 2 Green, 1 Red. (1 of each).

If we select Green then Green then Green, we have 3 Green, 0 Red. (Not 1 of each).

So the valid outcomes are:

  • Green from A, Red from B (End). Outcome: {G, R}
  • Green from A, Green from B, Red from C (End). Outcome: {G, G, R}
\[ P(\text{1 of each}) = P(G, R) + P(G, G, R) \] \[ P(G, R) = \frac{9}{10} \times \frac{1}{5} = \frac{9}{50} \] \[ P(G, G, R) = \frac{9}{10} \times \frac{4}{5} \times \frac{1}{3} = \frac{36}{150} = \frac{6}{25} \]

Now sum them:

\[ \frac{9}{50} + \frac{6}{25} = \frac{9}{50} + \frac{12}{50} = \frac{21}{50} \] \[ = 0.42 \]

(2 marks)

Step 4: Conditional Probability (Part d)

Why we do this: We need \( P(\text{Red from B} \mid \text{Red selected}) \).

Formula: \( P(A \mid B) = \frac{P(A \cap B)}{P(B)} \).

Here: \( \frac{P(\text{Red from B})}{P(\text{Any Red selected})} \).

Calculate Denominator (Total Probability of Red):

  • Red from A: \( \frac{1}{10} \)
  • Red from B: \( \frac{9}{10} \times \frac{1}{5} = \frac{9}{50} \)
  • Red from C: \( \frac{9}{10} \times \frac{4}{5} \times \frac{1}{3} = \frac{6}{25} \) (Wait, calculated above as 36/150)

Sum of all probabilities involving Red:

\[ P(\text{Red}) = \frac{1}{10} + \frac{9}{50} + \frac{12}{50} \] \[ = \frac{5}{50} + \frac{9}{50} + \frac{12}{50} = \frac{26}{50} \]

Alternatively: \( P(\text{Red}) = 1 – P(\text{No Red}) = 1 – P(G,G,G) = 1 – \frac{12}{25} = \frac{13}{25} = \frac{26}{50} \). This matches.

Calculate Numerator (Red from B):

\[ P(\text{Red from B}) = \frac{9}{50} \]
\[ P(\text{Red from B} \mid \text{Red}) = \frac{9/50}{26/50} \] \[ = \frac{9}{26} \]

(2 marks)

Final Answers:

(b) \( 0.48 \) or \( \frac{12}{25} \)

(c) \( 0.42 \) or \( \frac{21}{50} \)

(d) \( \frac{9}{26} \)

✓ Total: 8 marks

↑ Back to Top

Question 2 (11 marks)

The partially completed box plot below shows the distribution of daily mean air temperatures using the data from the large data set for Beijing in 2015.

7 9 11 13 15 17 19 21 23 25 27 29 31 33 Temperature (°C)

An outlier is defined as a value more than \( 1.5 \times \text{IQR} \) below \( Q_1 \) or more than \( 1.5 \times \text{IQR} \) above \( Q_3 \).

The three lowest air temperatures in the data set are \( 7.6^\circ\text{C} \), \( 8.1^\circ\text{C} \) and \( 9.1^\circ\text{C} \).

The highest air temperature in the data set is \( 32.5^\circ\text{C} \).

(a) Complete the box plot above showing clearly any outliers.

(b) Using your knowledge of the large data set, suggest from which month the two outliers are likely to have come.

Using the data from the large data set, Simon produced the following summary statistics for the daily mean air temperature, \( x^\circ\text{C} \), for Beijing in 2015

\[ n = 184 \qquad \sum x = 4153.6 \qquad S_{xx} = 4952.906 \]

(c) Show that, to 3 significant figures, the standard deviation is \( 5.19^\circ\text{C} \).

Simon decides to model the air temperatures with the random variable

\[ T \sim N(22.6, 5.19^2) \]

(d) Using Simon’s model, calculate the 10th to 90th interpercentile range.

Simon wants to model another variable from the large data set for Beijing using a normal distribution.

(e) State two variables from the large data set for Beijing that are not suitable to be modelled by a normal distribution. Give a reason for each answer.

Worked Solution

Step 1: Completing the Box Plot (Part a)

Strategy: Calculate the Interquartile Range (IQR) and the outlier boundaries (fences). Determine which points are outliers and draw the whiskers to the last non-outlier values.

Values from graph/text:

  • \( Q_1 = 19.4 \) (Read from left of box)
  • \( Q_3 = 26.6 \) (Read from right of box)
  • \( \text{IQR} = 26.6 – 19.4 = 7.2 \)

Outlier Fences:

  • Lower Fence: \( Q_1 – 1.5 \times \text{IQR} = 19.4 – 1.5(7.2) = 19.4 – 10.8 = 8.6 \)
  • Upper Fence: \( Q_3 + 1.5 \times \text{IQR} = 26.6 + 1.5(7.2) = 26.6 + 10.8 = 37.4 \)

Analyzing Data Points:

  • Lowest values: 7.6, 8.1, 9.1.
  • Are they below 8.6? Yes: 7.6 and 8.1 are outliers. 9.1 is inside the fence.
  • Lower whisker goes to 9.1.
  • Highest value: 32.5.
  • Is it above 37.4? No.
  • Upper whisker goes to 32.5.
71217222732

(4 marks)

Step 2: Interpretation and Calculation (Parts b, c)

(b) Month of outliers: The outliers are very low temperatures (7.6, 8.1). In Beijing (Northern Hemisphere), the coldest months are in winter. The data covers May to October. The coldest month in this range would be October.

(c) Standard Deviation:

Use the formula \( \sigma = \sqrt{\frac{S_{xx}}{n}} \).

\[ \sigma = \sqrt{\frac{4952.906}{184}} \] \[ \sigma = \sqrt{26.9179…} \] \[ \sigma = 5.18825… \] \[ \sigma \approx 5.19 \text{ (to 3 s.f.)} \]

(b: 1 mark, c: 1 mark)

Step 3: Interpercentile Range (Part d)

Why we do this: We need the difference between the 90th percentile and the 10th percentile of the normal distribution \( N(22.6, 5.19^2) \).

How: Find the \( z \)-score for 90% (Area = 0.9) and 10% (Area = 0.1).

From tables or calculator, \( z \approx 1.2816 \) for 90%.

The 10th percentile is symmetrical, so \( z \approx -1.2816 \).

Range = \( 2 \times z \times \sigma \).

\[ P_{90} = \mu + 1.2816\sigma \] \[ P_{10} = \mu – 1.2816\sigma \] \[ \text{Range} = P_{90} – P_{10} = 2(1.2816)\sigma \] \[ = 2(1.2816)(5.19) \] \[ = 13.303… \] \[ \approx 13.3 \]

Alternatively calculate values:

\[ P_{90} = 22.6 + 1.2816(5.19) = 29.25 \] \[ P_{10} = 22.6 – 1.2816(5.19) = 15.95 \] \[ 29.25 – 15.95 = 13.3 \]

(3 marks)

Step 4: Critique of Normal Model (Part e)

We need variables from the Large Data Set (LDS) for Beijing that aren’t Normal.

Common Answers:

  • Rainfall: It is not symmetric (skewed). There are many days with 0 rainfall, so it’s not bell-shaped.
  • Wind Speed / Beaufort conversion: Wind speed is skewed. Beaufort scale is discrete/qualitative (ordinal), not continuous.

(2 marks)

Final Answers:

(a) Box plot drawn with whiskers to 9.1 and 32.5, outliers at 7.6 and 8.1.

(b) October

(c) Shown (\( 5.19 \))

(d) \( 13.3 \)

(e) Rainfall (skewed/zero values), Wind Speed (skewed), or Beaufort (discrete).

✓ Total: 11 marks

↑ Back to Top

Question 3 (9 marks)

Barbara is investigating the relationship between average income (GDP per capita), \( x \) US dollars, and average annual carbon dioxide (\( \text{CO}_2 \)) emissions, \( y \) tonnes, for different countries.

She takes a random sample of 24 countries and finds the product moment correlation coefficient between average annual \( \text{CO}_2 \) emissions and average income to be \( 0.446 \).

(a) Stating your hypotheses clearly, test, at the 5% level of significance, whether or not the product moment correlation coefficient for all countries is greater than zero.

Barbara believes that a non-linear model would be a better fit to the data.

She codes the data using the coding \( m = \log_{10} x \) and \( c = \log_{10} y \) and obtains the model \( c = -1.82 + 0.89m \).

The product moment correlation coefficient between \( c \) and \( m \) is found to be \( 0.882 \).

(b) Explain how this value supports Barbara’s belief.

(c) Show that the relationship between \( y \) and \( x \) can be written in the form \( y = ax^n \) where \( a \) and \( n \) are constants to be found.

Worked Solution

Step 1: Hypothesis Test for Correlation (Part a)

Why we do this: We are testing if the population correlation coefficient (\( \rho \)) is positive (“greater than zero”).

Hypotheses:

  • \( H_0: \rho = 0 \) (No correlation)
  • \( H_1: \rho > 0 \) (Positive correlation)

Critical Value: Sample size \( n = 24 \), Significance level 5% (0.05). One-tailed test.

From tables for PMCC with \( n=24 \): Critical Value \( r_{cv} = 0.3438 \).

Test Statistic \( r = 0.446 \).

\[ 0.446 > 0.3438 \]

Since the test statistic is in the critical region, we reject \( H_0 \).

Conclusion: There is sufficient evidence at the 5% level to suggest that the product moment correlation coefficient is greater than zero (positive correlation).

(3 marks)

Step 2: Interpreting Correlation (Part b)

The new correlation coefficient for the coded data is \( 0.882 \).

This value is closer to 1 than the original correlation of \( 0.446 \). This indicates a stronger linear relationship between \( \log x \) and \( \log y \), which implies the non-linear relationship (power law) fits the original data better than a linear one.

(1 mark)

Step 3: Deriving the Model (Part c)

Why we do this: We need to convert the linear log-log model back to the original variables \( y \) and \( x \).

Model: \( c = -1.82 + 0.89m \)

Substitutions: \( c = \log_{10} y \) and \( m = \log_{10} x \)

\[ \log_{10} y = -1.82 + 0.89 \log_{10} x \]

Use laws of logs: \( k \log x = \log (x^k) \)

\[ \log_{10} y = -1.82 + \log_{10} (x^{0.89}) \]

Remove logs by taking powers of 10:

\[ y = 10^{-1.82 + \log_{10} (x^{0.89})} \] \[ y = 10^{-1.82} \times 10^{\log_{10} (x^{0.89})} \] \[ y = 10^{-1.82} \times x^{0.89} \]

Calculate \( a = 10^{-1.82} \):

\[ a \approx 0.015135… \]

So the equation is:

\[ y = 0.015 x^{0.89} \]

Where \( a \approx 0.015 \) and \( n = 0.89 \).

(5 marks)

Final Answer:

\( y = 0.015 x^{0.89} \)

(Or \( a = 0.015, n = 0.89 \))

✓ Total: 9 marks

↑ Back to Top

Question 4 (9 marks)

Magali is studying the mean total cloud cover, in oktas, for Leuchars in 1987 using data from the large data set. The daily mean total cloud cover for all 184 days from the large data set is summarised in the table below.

Daily mean total cloud cover (oktas) 012345678
Frequency (number of days) 01471030525228

One of the 184 days is selected at random.

(a) Find the probability that it has a daily mean total cloud cover of 6 or greater.

Magali is investigating whether the daily mean total cloud cover can be modelled using a binomial distribution.

She uses the random variable \( X \) to denote the daily mean total cloud cover and believes that \( X \sim B(8, 0.76) \).

Using Magali’s model,

(b) (i) find \( P(X \ge 6) \)

(ii) find, to 1 decimal place, the expected number of days in a sample of 184 days with a daily mean total cloud cover of 7

(c) Explain whether or not your answers to part (b) support the use of Magali’s model.

There were 28 days that had a daily mean total cloud cover of 8.

For these 28 days the daily mean total cloud cover for the following day is shown in the table below.

Daily mean total cloud cover (oktas) 012345678
Frequency (number of days) 001121599

(d) Find the proportion of these days when the daily mean total cloud cover was 6 or greater.

(e) Comment on Magali’s model in light of your answer to part (d).

Worked Solution

Step 1: Probability from Data (Part a)

We need the proportion of days with cloud cover 6, 7, or 8.

Frequencies: 6 (52 days), 7 (52 days), 8 (28 days).

\[ P(\text{Cover} \ge 6) = \frac{52 + 52 + 28}{184} \] \[ = \frac{132}{184} \approx 0.717 \]

(1 mark)

Step 2: Binomial Model Calculations (Part b)

Model: \( X \sim B(8, 0.76) \)

(i) Find \( P(X \ge 6) \):

Using calculator (Cumulative Binomial CD) or formula.

\[ P(X \ge 6) = 1 – P(X \le 5) \]

Or sum individual probabilities \( P(X=6) + P(X=7) + P(X=8) \).

\[ P(X \ge 6) \approx 0.703 \]

(ii) Expected number of days with 7:

First find \( P(X = 7) \).

\[ P(X=7) = \binom{8}{7} (0.76)^7 (0.24)^1 \approx 0.2811 \]

Expected Number = \( n \times p = 184 \times 0.2811 \)

\[ E(7s) = 184 \times P(X=7) \approx 51.7 \]

(4 marks)

Step 3: Evaluating the Model (Part c)

Compare the data results with the model results.

  • Data \( P(\ge 6) \approx 0.717 \). Model \( P(\ge 6) \approx 0.703 \). (Close)
  • Data observed days with 7: 52. Model expected days: 51.7. (Very close)

The probabilities and expected frequencies are very similar to the observed data, so the model is supported.

(1 mark)

Step 4: Conditional Data Analysis (Part d & e)

(d) Proportion for following days:

Table shows days following an 8. Total = 28.

Days with 6, 7, or 8: \( 5 + 9 + 9 = 23 \).

\[ \text{Proportion} = \frac{23}{28} \approx 0.821 \]

(e) Comment on Model:

The overall probability of \( \ge 6 \) is \( 0.717 \). However, given the previous day was an 8, the probability rises to \( 0.821 \).

This suggests that the cloud cover on one day is dependent on the previous day (persistence). The Binomial model assumes independent trials. Therefore, the Binomial model may not be suitable because independence does not hold.

(3 marks)

Final Answers:

(a) \( 0.717 \)

(b)(i) \( 0.703 \)

(b)(ii) \( 51.7 \)

(d) \( 0.821 \)

✓ Total: 9 marks

↑ Back to Top

Question 5 (13 marks)

A machine puts liquid into bottles of perfume. The amount of liquid put into each bottle, \( D \) ml, follows a normal distribution with mean 25 ml.

Given that 15% of bottles contain less than 24.63 ml,

(a) find, to 2 decimal places, the value of \( k \) such that \( P(24.63 < D < k) = 0.45 \).

A random sample of 200 bottles is taken.

(b) Using a normal approximation, find the probability that fewer than half of these bottles contain between 24.63 ml and \( k \) ml.

The machine is adjusted so that the standard deviation of the liquid put in the bottles is now 0.16 ml.

Following the adjustments, Hannah believes that the mean amount of liquid put in each bottle is less than 25 ml.

She takes a random sample of 20 bottles and finds the mean amount of liquid to be 24.94 ml.

(c) Test Hannah’s belief at the 5% level of significance. You should state your hypotheses clearly.

Worked Solution

Step 1: Finding Standard Deviation and k (Part a)

Given: \( D \sim N(25, \sigma^2) \).

\( P(D < 24.63) = 0.15 \).

Find \( \sigma \): Standardize.

\[ Z = \frac{X – \mu}{\sigma} \]

From tables/calculator, \( P(Z < z) = 0.15 \Rightarrow z \approx -1.0364 \).

\[ \frac{24.63 – 25}{\sigma} = -1.0364 \] \[ \frac{-0.37}{\sigma} = -1.0364 \] \[ \sigma = \frac{0.37}{1.0364} \approx 0.357 \]

Find \( k \):

We are told \( P(24.63 < D < k) = 0.45 \).

We know \( P(D < 24.63) = 0.15 \).

So \( P(D < k) = P(D < 24.63) + 0.45 = 0.15 + 0.45 = 0.60 \).

Find \( z \) such that \( P(Z < z) = 0.60 \). From tables \( z \approx 0.2533 \).

\[ \frac{k – 25}{0.357} = 0.2533 \] \[ k = 25 + (0.357 \times 0.2533) \] \[ k \approx 25.09 \]

(5 marks)

Step 2: Normal Approximation to Binomial (Part b)

Define Variable: Let \( Y \) be the number of bottles between 24.63 and \( k \).

Probability \( p = 0.45 \) (from part a). Sample \( n = 200 \).

\( Y \sim B(200, 0.45) \).

Question: “Fewer than half”. So \( P(Y < 100) \).

Approximation: \( n \) is large, so approximate with Normal.

Mean \( \mu = np = 200 \times 0.45 = 90 \).

Variance \( \sigma^2 = np(1-p) = 90 \times 0.55 = 49.5 \).

So \( W \sim N(90, 49.5) \).

Continuity Correction: \( P(Y < 100) \) becomes \( P(W < 99.5) \).

\[ P(W < 99.5) = P\left(Z < \frac{99.5 - 90}{\sqrt{49.5}}\right) \] \[ = P(Z < 1.350...) \]

Using calculator: \( P(Z < 1.35) \approx 0.9115 \)

\[ = 0.912 \text{ (3 s.f.)} \]

(3 marks)

Step 3: Hypothesis Test (Part c)

Hypotheses:

  • \( H_0: \mu = 25 \)
  • \( H_1: \mu < 25 \) (Hannah's belief)

Distribution of Sample Mean: \( \bar{X} \sim N(\mu, \frac{\sigma^2}{n}) \).

\( \sigma = 0.16 \), \( n = 20 \).

Standard Error \( = \frac{0.16}{\sqrt{20}} \).

Test Statistic: Observed mean \( \bar{x} = 24.94 \).

\[ Z = \frac{24.94 – 25}{\frac{0.16}{\sqrt{20}}} \] \[ Z = \frac{-0.06}{0.03577…} \approx -1.677 \]

Critical Value: 5% level one-tailed. \( z_{crit} = -1.645 \).

Comparison: \( -1.677 < -1.645 \). The test statistic is in the critical region.

Alternatively find \( p \)-value: \( P(Z < -1.677) \approx 0.0468 \). Since \( 0.0468 < 0.05 \), reject \( H_0 \).

Conclusion: There is sufficient evidence to support Hannah’s belief that the mean amount is less than 25 ml.

(5 marks)

Final Answers:

(a) \( k = 25.09 \)

(b) \( 0.912 \)

(c) Reject \( H_0 \). Evidence supports belief.

✓ Total: 13 marks

↑ Back to Top