If any of my solutions look wrong, please refer to the mark scheme. You can exit full-screen mode for the question paper and mark scheme by clicking the icon in the bottom-right corner or by pressing Esc on your keyboard.

Edexcel A Level Statistics 2021 (Paper 31)

๐Ÿ’ก How to use this Interactive Exam

  • Try it first: Attempt each question on paper before checking the solution.
  • Three-Layer Solutions: We explain Why we do it, How we do it, and What the result means.
  • Navigation: Use the “Back to Top” button to return to the Table of Contents.
  • Calculator: This paper allows calculators. We show the setup and the result.

Question 1 (7 marks)

(a) State one disadvantage of using quota sampling compared with simple random sampling.

In a university 8% of students are members of the university dance club.

A random sample of 36 students is taken from the university.

The random variable \( X \) represents the number of these students who are members of the dance club.

(b) Using a suitable model for \( X \), find

(i) \( P(X = 4) \)

(ii) \( P(X \geqslant 7) \)

Only 40% of the university dance club members can dance the tango.

(c) Find the probability that a student is a member of the university dance club and can dance the tango.

A random sample of 50 students is taken from the university.

(d) Find the probability that fewer than 3 of these students are members of the university dance club and can dance the tango.

โ†‘ Back to Top

Question 2 (6 marks)

Marc took a random sample of 16 students from a school and for each student recorded:

  • the number of letters, \( x \), in their last name
  • the number of letters, \( y \), in their first name

His results are shown in the scatter diagram below.

Number of letters in last name (x) First name (y) 0 2 4 6 8 10 12 0 2 4 6 8 10

(a) Describe the correlation between \( x \) and \( y \).

Marc suggests that parents with long last names tend to give their children shorter first names.

(b) Using the scatter diagram comment on Marcโ€™s suggestion, giving a reason for your answer.

The results from Marcโ€™s random sample of 16 observations are given in the table below.

\( x \) 3 6 8 7 5 3 11 3 4 5 4 9 7 10 6 6
\( y \) 7 7 4 4 6 8 5 5 8 4 7 4 5 5 6 3

(c) Use your calculator to find the product moment correlation coefficient between \( x \) and \( y \) for these data.

(d) Test whether or not there is evidence of a negative correlation between the number of letters in the last name and the number of letters in the first name. You should:

  • state your hypotheses clearly
  • use a 5% level of significance
โ†‘ Back to Top

Question 3 (8 marks)

Stav is studying the large data set for September 2015.

He codes the variable Daily Mean Pressure, \( x \), using the formula \( y = x – 1010 \).

The data for all 30 days from Hurn are summarised by:

\[ \sum y = 214 \quad \quad \sum y^2 = 5912 \]

(a) State the units of the variable \( x \).

(b) Find the mean Daily Mean Pressure for these 30 days.

(c) Find the standard deviation of Daily Mean Pressure for these 30 days.

Stav knows that, in the UK, winds circulate:

  • in a clockwise direction around a region of high pressure
  • in an anticlockwise direction around a region of low pressure

The table gives the Daily Mean Pressure for 3 locations from the large data set on 26/09/2015:

Location Heathrow Hurn Leuchars
Daily Mean Pressure 1029 1028 1028
Cardinal Wind Direction

The Cardinal Wind Directions for these 3 locations on 26/09/2015 were, in random order:

W     NE     E

You may assume that these 3 locations were under a single region of pressure.

(d) Using your knowledge of the large data set, place each of these Cardinal Wind Directions in the correct location in the table. Give a reason for your answer.

โ†‘ Back to Top

Question 4 (11 marks)

A large college produces three magazines: Green issues (\(G\)), Equality (\(E\)), and Sports (\(S\)).

The Venn diagram below gives the probability for each subset, where \(p, q, r\) and \(t\) are probabilities.

G E S 0.08 0.09 0.36 0.05 p q r t

(a) Find the proportion of students in the college who read exactly one of these magazines.

No students read all three magazines and \( P(G) = 0.25 \).

(b) Find

(i) the value of \( p \)

(ii) the value of \( q \)

Given that \( P(S | E) = \frac{5}{12} \),

(c) find

(i) the value of \( r \)

(ii) the value of \( t \)

(d) Determine whether or not the events \( (S \cap E’) \) and \( G \) are independent. Show your working clearly.

โ†‘ Back to Top

Question 5 (11 marks)

The heights of females from a country are normally distributed with:

  • a mean of 166.5 cm
  • a standard deviation of 6.1 cm

Given that 1% of females from this country are shorter than \( k \) cm,

(a) find the value of \( k \).

(b) Find the proportion of females from this country with heights between 150 cm and 175 cm.

A female from this country is chosen at random from those with heights between 150 cm and 175 cm.

(c) Find the probability that her height is more than 160 cm.

The heights of females from a different country are normally distributed with a standard deviation of 7.4 cm.

Mia believes that the mean height of females from this country is less than 166.5 cm.

Mia takes a random sample of 50 females from this country and finds the mean of her sample is 164.6 cm.

(d) Carry out a suitable test to assess Miaโ€™s belief. You should:

  • state your hypotheses clearly
  • use a 5% level of significance
โ†‘ Back to Top

Question 6 (7 marks)

The discrete random variable \( X \) has the following probability distribution:

\( x \) \( a \) \( b \) \( c \)
\( P(X = x) \) \( \log_{36} a \) \( \log_{36} b \) \( \log_{36} c \)

where

  • \( a, b \) and \( c \) are distinct integers (\( a < b < c \))
  • all the probabilities are greater than zero

(a) Find

(i) the value of \( a \)

(ii) the value of \( b \)

(iii) the value of \( c \)

Show your working clearly.

The independent random variables \( X_1 \) and \( X_2 \) each have the same distribution as \( X \).

(b) Find \( P(X_1 = X_2) \)

โ†‘ Back to Top