Question 24·Hard·One-Variable Data Distributions; Measures of Center and Spread
Data set R contains 80 real numbers. Data set S is obtained by first adding 15 to every number in R and then doubling each of the resulting values.
Which of the following measures must be greater for data set S than for data set R?
I. The mean
II. The median
III. The interquartile range
For questions about how data transformations affect statistics, first rewrite the transformation as a formula (like ) and recall the general rules: adding a constant shifts center but not spread, while multiplying by a positive constant scales both center and spread. Then translate what happens to the mean, median, and IQR algebraically or conceptually, and pay close attention to words like “must” or “always”—use simple or extreme counterexamples (such as all values being equal or all very negative) to quickly test whether each statement is guaranteed for every possible data set.
Hints
Write the transformation as a formula
Let a number in R be . After you add 15 and then double, what is the new value in terms of ?
Think about how adding and multiplying affect center and spread
What happens to the mean and median if you add the same number to every data point? What happens if you then multiply every point by the same positive number? What about the IQR, which measures spread?
Use simple or extreme examples to test “must”
Try a very simple data set R, like when all 80 numbers are the same, or when they are all very negative. After transforming to S, compare the mean, median, and IQR to those of R. Do they always become larger?
Remember what “must” means
For a statement to be true under a “must” condition, it has to hold for every possible data set R, not just for some or most. A single counterexample is enough to show it is not guaranteed.
Desmos Guide
Define an example data set R in Desmos
In Desmos, enter a sample list for R, such as R = [-40, -40, -40, -40, -40] (you can use 5 numbers instead of 80; the behavior of the statistics is the same).
Create the transformed data set S
Define S = 2*(R + 15) in Desmos. This applies the transformation to every element of R.
Compare the means and medians
Use Desmos’s list statistics: type mean(R), mean(S), median(R), and median(S) in separate lines and observe the numerical values. Check whether the mean and median of S are always larger than those of R for this and other test lists you try.
Compare the IQRs
Compute the interquartile ranges with iqr(R) and iqr(S) (or by using quartile3 - quartile1 if needed). Try a data set where all values in R are the same and another where they are different, and observe whether the IQR in S is strictly greater in each case.
Step-by-step Explanation
Express the transformation from R to S
Each number in data set R is changed to a new number in S by:
So going from R to S means: add 15 to every value, then multiply every result by 2 (a shift, then a stretch).
Understand how mean and median change under this linear transformation
For any data set, when you:
-
Add a constant to every value:
- The mean increases by .
- The median increases by .
- The spread measures like IQR stay the same.
-
Multiply every value by a positive constant :
- The mean is multiplied by .
- The median is multiplied by .
- The IQR is multiplied by .
Here, going from R to S is the combined effect: first add 15, then multiply by 2. If the mean of R is and the median of R is , then for S:
- Mean of S .
- Median of S .
Check whether the mean and median are guaranteed to be greater
To see if the mean of S must be greater than the mean of R, compare and .
We want to know when
Subtract from both sides:
So the mean of S is greater than the mean of R only when the mean of R is greater than .
But the problem does not restrict the numbers in R; the 80 real numbers could easily have a mean less than or equal to . For example, if all 80 numbers in R are , then:
- Mean of R .
- Each value in S is , so mean of S .
Here the mean in S is actually smaller, so statement I (mean must be greater) is not always true.
The exact same algebra and example work for the median. If the median of R is , then median of S is , which is greater than only when . If all values are , the median in R is and in S is , so the median does not have to be greater either. So statement II is also not always true.
Analyze how the interquartile range (IQR) changes
Let the interquartile range (IQR) of R be .
- Adding 15 to every value does not change distances between values, so IQR stays .
- Then multiplying every value by 2 multiplies all distances by 2, so IQR becomes .
So the IQR of S is . To ask whether it must be greater than the IQR of R is to ask if we must have .
- If (the data are not all the same), then .
- But if (all 80 numbers in R are identical), then , so IQR of S equals IQR of R, not greater.
Using the same example as before (all 80 values equal ):
- In R, all values are the same, so IQR of R .
- In S, all values are also the same (), so IQR of S .
Thus statement III (IQR must be greater) is not always true either.
Choose the correct option based on the “must” condition
The question asks which measures must be greater for every possible data set R.
- I (mean) is not guaranteed to be greater.
- II (median) is not guaranteed to be greater.
- III (IQR) is not guaranteed to be greater.
Since none of I, II, or III must always be greater, the correct answer choice is D) None of the above.