Probing Questions: Median from a List of Data

Probing Questions

Median from a List of Data

Questions designed to stretch thinking, reveal misconceptions, and spark mathematical reasoning.

💬

Convince Me That…

Students must construct a mathematical argument for why each statement is true.

Convince me that the median of 7, 2, 9, 3, 5 is 5 and not 9

💡 Possible Argument

To find the median, the first step is always to arrange the values in order from smallest to largest: 2, 3, 5, 7, 9. With 5 values, the median is the value in position (5 + 1) ÷ 2 = 3rd, which is 5. A student who says the median is 9 has likely picked the 3rd value from the list as it was written down (7, 2, 9, 3, 5) without ordering first.

This reveals the “reading off the unordered list” misconception — students treat the list order as meaningful when, for the median, only the sorted order matters. You could show the same five numbers written in a different order (e.g. 5, 9, 3, 2, 7) and ask whether the median has changed — it hasn’t, because the values are the same.

Convince me that the median of 3, 5, 8, 12 is 6.5 even though 6.5 does not appear in the data

💡 Possible Argument

Ordering the values gives 3, 5, 8, 12. There are 4 values (an even count), so no single value sits exactly in the middle. The median is found by taking the mean of the 2nd and 3rd values: (5 + 8) ÷ 2 = 6.5. This is the correct median even though 6.5 is not one of the original data values.

Students often believe the “median must be a value in the data set” misconception. This is true when the data set has an odd number of values, but with an even count the median is the average of the two central values and can fall between them. This question also targets students who don’t know the procedure for even-count data sets — some pick just one of the two middle values (answering 5 or 8) rather than averaging them.

Convince me that a data set can have a mean of 20 but a median of 6

💡 Possible Argument

Consider the data set 1, 2, 6, 10, 81. The values in order are 1, 2, 6, 10, 81. The median (middle value of 5) is 6. The mean is (1 + 2 + 6 + 10 + 81) ÷ 5 = 100 ÷ 5 = 20. So the mean is 20 and the median is 6 — very different. The large outlier (81) pulls the mean upward but has no effect on which value sits in the middle position.

This targets the “mean and median are the same thing” misconception. Many students conflate the two averages or assume they must always be close. The key insight is that the mean is sensitive to extreme values while the median is not — one very large or very small value can drag the mean far from the median without affecting the median at all.

Convince me that the median of 3, 3, 3, 7, 10 is 3 and not 7

💡 Possible Argument

The values in order are: 3, 3, 3, 7, 10. With 5 values, the median is the 3rd value, which is 3. It doesn’t matter that 3 appears three times — every occurrence counts as a separate data point when determining position. A student who answers 7 may have collapsed the duplicates into the set {3, 7, 10} and found the middle of those three distinct values instead.

This reveals the “remove duplicates before finding the median” misconception. Students sometimes treat repeated values as a single entry, thinking “I’ve already counted 3.” But in statistics, every data point occupies its own position. The five values represent five observations, and all five must be included when finding the middle position. If a survey of 5 people finds that 3 of them scored 3, those are three separate results.

🎯

Give an Example Of…

Think carefully — the fourth box is a trap! Give a non-example that looks right but isn’t.

Give an example of a set of 5 numbers where the median is 8

An example

Another example

One no-one else will think of

A sneaky non-example

💡 Possible Answers

Example: {3, 6, 8, 11, 14} — ordered, the 3rd value is 8.

Another: {1, 4, 8, 8, 20} — the 3rd value is 8.

Creative: {8, 8, 8, 8, 8} — every value is 8, so the median is trivially 8. Or {−100, −5, 8, 999, 1000} — extreme surrounding values don’t affect the middle position.

Trap: {2, 4, 6, 10, 18} — a student might offer this thinking it works because the mean is (2 + 4 + 6 + 10 + 18) ÷ 5 = 40 ÷ 5 = 8. But the median is the 3rd value = 6, not 8. This exploits the “confusing mean with median” misconception.

Give an example of a set of 6 numbers where the median is 10

An example

Another example

One no-one else will think of

A sneaky non-example

💡 Possible Answers

Example: {3, 7, 9, 11, 14, 20} — the 3rd and 4th values are 9 and 11. Median = (9 + 11) ÷ 2 = 10.

Another: {1, 5, 10, 10, 15, 25} — the 3rd and 4th values are both 10. Median = (10 + 10) ÷ 2 = 10.

Creative: {−50, −1, 8, 12, 100, 200} — median = (8 + 12) ÷ 2 = 10. The wild surrounding values are irrelevant.

Trap: {2, 5, 10, 13, 17, 19} — a student might choose this because 10 appears in the data. But the 3rd and 4th values are 10 and 13, so the median = (10 + 13) ÷ 2 = 11.5, not 10. This exploits the “the median is just a value I can see in the list” misconception — for even-count data, you must average the two middle values, not just pick one that looks central.

Give an example of a set of 5 numbers where the mean and the median are different

An example

Another example

One no-one else will think of

A sneaky non-example

💡 Possible Answers

Example: {1, 2, 3, 4, 20} — median = 3, mean = 30 ÷ 5 = 6. Different.

Another: {1, 1, 2, 10, 11} — median = 2, mean = 25 ÷ 5 = 5. Different.

Creative: {1, 1, 1, 1, 96} — median = 1, mean = 100 ÷ 5 = 20. A single outlier creates a huge gap between mean and median.

Trap: {2, 4, 6, 8, 10} — a student might offer any “nice” set of numbers, but for this symmetric set: median = 6, mean = 30 ÷ 5 = 6. They are the same! This exploits the “any set of different numbers will have different mean and median” misconception — symmetric or evenly spaced data sets always have mean = median.

4 ✦

Give an example of a set of 5 numbers where the median equals the smallest value

An example

Another example

One no-one else will think of

A sneaky non-example

💡 Possible Answers

Example: {4, 4, 4, 7, 10} — ordered: 4, 4, 4, 7, 10. Median = 3rd value = 4. Smallest = 4.

Another: {1, 1, 1, 5, 8} — median = 1 = smallest.

Creative: {0, 0, 0, 0, 1000} — median = 0 = smallest. The first three values being equal forces the median down to the minimum, no matter how large the remaining values are.

Trap: {3, 3, 5, 7, 9} — a student might offer this because 3 is the smallest and appears twice, so it “feels” dominant. But the median is the 3rd value = 5, not 3. Another excellent trap is {5, 5, 2, 7, 9}. A student might think “5 appears the most and is the median, so the lowest distinct number is 2… wait, 2 is the smallest.” This exploits the “confusing frequency (mode) with position (median)” misconception and forces them to reconcile the unordered list trap with the smallest value trap simultaneously.

⚖️

Always, Sometimes, Never

Is the statement always true, sometimes true, or never true? Students should justify their decision with examples.

The median of a data set changes if you rearrange the order the values are written in.

NEVER

The median depends only on the values in the data set, not on the order they happen to be written. Rearranging {4, 1, 7} into {7, 4, 1} or {1, 7, 4} does not change the sorted order (1, 4, 7) and therefore does not change the median (4). This is because finding the median always starts with sorting — the presentation order is discarded.

Students who believe this is sometimes or always true are revealing the “the list order matters” misconception — the same error that leads students to pick the middle item from an unordered list. Establishing that the median is a property of the set of values (not their arrangement) is foundational.

The median of a data set is one of the values in the data set.

SOMETIMES

True case: {1, 3, 5, 7, 9} has 5 values. The median is the 3rd value = 5, which is in the data set. This is always the case for data sets with an odd number of values — the median is literally one of the values.

False case: {2, 3, 8, 11} has 4 values. The median = (3 + 8) ÷ 2 = 5.5, which does not appear in the data set. When the data has an even number of values and the two middle values differ, the median falls between them and is not in the data set. This targets the “median must be in the data” misconception.

In a data set with 7 values, the median is the 4th value when the data is ordered.

ALWAYS

With 7 values, the correct median position is found using the formula (n + 1) ÷ 2. Here, (7 + 1) ÷ 2 = 4th. This means 3 values sit below the median and 3 sit above it, confirming it is the true middle. This is always the case regardless of what the 7 values are.

Students who get this wrong often use the “wrong position formula” misconception, such as n ÷ 2 = 3.5 (leading them to arbitrarily pick the 3rd or 4th value with no clear rule), or simply counting to “roughly the middle” without a systematic method. Establishing the (n + 1) ÷ 2 formula and verifying it with small cases builds reliable technique.

The median of a set of whole numbers is a whole number.

SOMETIMES

True case: {1, 3, 5} — median = 3, a whole number.

False case: {1, 2, 5, 8} — median = (2 + 5) ÷ 2 = 3.5, not a whole number. This happens whenever the data set has an even number of values and the two middle values add to an odd total. Students who say “always” are revealing the “whole-number inputs always give whole-number outputs” misconception — they forget that the averaging step can produce a decimal.

If you add an extra value to a data set, the median will change.

SOMETIMES

True case: For the data set {1, 2, 3}, the median is 2. If we add the value 4, the set becomes {1, 2, 3, 4} and the median changes to 2.5.

False case: For the data set {1, 5, 5, 5, 9}, the median is 5. If we add another 5, the set becomes {1, 5, 5, 5, 5, 9} and the median remains 5. This tests a student’s structural understanding of position and how dynamic data affects central tendency.

🔴

Odd One Out

Which is the odd one out? Can you make a case for each one? There’s no single right answer!

Which is the odd one out?

{1, 3, 5, 7, 9}

{1, 3, 7, 9}

{1, 5, 5, 5, 9}

💡 A Case for Each

{1, 3, 5, 7, 9} is the odd one out — it’s the only set where the median is a unique, unrepeated value in the data set (Median = 5).

{1, 3, 7, 9} is the odd one out — it’s the only set where the median is not a value in the original data set (Median = 5).

{1, 5, 5, 5, 9} is the odd one out — it’s the only set where the median is also the mode (Median = 5, Mode = 5).

Which is the odd one out?

{1, 2, 3, 4, 100}

{3, 3, 7, 9, 13}

{10, 20, 30, 40, 50}

💡 A Case for Each

{1, 2, 3, 4, 100} is the odd one out — it’s the only set where the mean and the median are not equal (mean = 22, median = 3).

{3, 3, 7, 9, 13} is the odd one out — it’s the only set that contains a repeated value (3 appears twice).

{10, 20, 30, 40, 50} is the odd one out — it’s the only set where the values are evenly spaced (they increase by 10 each time).

Which is the odd one out?

{2, 4, 6, 8, 10}

{2, 4, 6, 8, 100}

{2, 4, 10, 12, 100}

💡 A Case for Each

{2, 4, 6, 8, 10} is the odd one out — it’s the only set where the mean and median are exactly the same (both are 6).

{2, 4, 6, 8, 100} is the odd one out — it’s the only set where the median is completely unaffected by an extreme outlier (the median is 6, identical to the first set despite the 100).

{2, 4, 10, 12, 100} is the odd one out — it’s the only set where removing the largest value changes the median (it drops from 10 to 7).

🔍

Explain the Mistake

Each example contains a deliberate error targeting a common misconception. Can you find where and why the reasoning goes wrong?

Find the median of 12, 4, 8, 1, 6

A student writes:

Answer: 8

Reasoning: “There are 5 numbers so the median is the 3rd one. I counted along and the 3rd number is 8.”

🔍 The Mistake

The student has applied the “pick the middle from the unordered list” misconception. They correctly identified that the median of 5 values is the 3rd, but they counted from the list as it was written (12, 4, 8, 1, 6) without sorting first.

The correct approach is to order the values: 1, 4, 6, 8, 12. The 3rd value in the ordered list is 6, not 8. Finding the median always requires sorting the data first — the order the numbers were originally written in is irrelevant.

Find the median of 6, 1, 4, 9, 2

A student writes:

Answer: 4 ✓

Reasoning: “The median is the middle number. There are 5 numbers so I pick the 3rd one, which is 4.”

🔍 The Mistake

The student got the correct answer (4) but through the “pick the middle from the unordered list” misconception. They picked the 3rd value from the list as written (6, 1, 4, 9, 2) without ordering. The correct method is to sort first: 1, 2, 4, 6, 9 — the 3rd value happens to also be 4, so the answer is coincidentally correct.

This reasoning would fail on many other lists — for example, applied to the list 9, 1, 3, 7, 5, the student would answer 3, but the correct median is 5 (after ordering: 1, 3, 5, 7, 9). Getting the right answer by luck doesn’t mean the method is sound.

Find the median of 2, 5, 8, 11

A student writes:

Answer: 5

Reasoning: “I put them in order: 2, 5, 8, 11. There are 4 numbers. To find the middle I did 4 ÷ 2 = 2, so the median is the 2nd value, which is 5.”

🔍 The Mistake

The student has used the “wrong position formula for even-count data” misconception. They divided the count by 2 and used the result as a position (2nd value = 5), rather than averaging the two central values.

The student’s formula (n ÷ 2) only identifies one of the two middle positions. The correct formula to find the median position is (n + 1) ÷ 2. For even-count data, this will give a decimal result (like 2.5), which reminds you to average the values at the 2nd and 3rd positions. Here that gives values 5 and 8, so the true median is 6.5.

Find the median of 5, 5, 5, 8, 12

A student writes:

Answer: 8

Reasoning: “I crossed out the repeated 5s so the list becomes 5, 8, 12. The middle of those three is 8.”

🔍 The Mistake

The student has applied the “remove duplicates before finding the median” misconception. They reduced {5, 5, 5, 8, 12} to the set of distinct values {5, 8, 12} and then found the median of that reduced set.

But in statistics, every data point counts — the original data set has 5 values, not 3. Keeping all five values in order: 5, 5, 5, 8, 12. The median is the 3rd value = 5. Each occurrence of 5 represents a separate observation and must be included. If three students scored 5 on a test, those are three separate results, not one.

Find the median of −4, 2, 0, −1, 5

A student writes:

Answer: 0.5

Reasoning: “I ordered them as −4, −1, 2, 5. I ignored the 0 because it means nothing. The middle is (−1 + 2) ÷ 2 = 0.5.”

🔍 The Mistake

The student has applied the “zero is nothing” misconception. They treated 0 as an absence of data rather than a valid data point occupying a position in the list.

Zero must be included when finding the median. The correct ordered list of all 5 values is −4, −1, 0, 2, 5. The median is the 3rd value, which is 0.

Find the median of 2, 4, 6, 8, 10

A student writes:

Answer: 6 ✓

Reasoning: “I added them all up to get 30, then divided by 5 because there are 5 numbers. 30 ÷ 5 = 6.”

🔍 The Mistake

The student got the correct answer (6) but through the “confusing mean with median” misconception. They calculated the mean by summing the values and dividing by the count.

Because this specific data set is perfectly symmetrical, the mean and median happen to be exactly the same. However, this method will fail on almost any other data set. This highlights the danger of arriving at the right answer using the wrong mathematical method.