Question 45·Hard·One-Variable Data Distributions; Measures of Center and Spread
A data set of 48 different numbers has a mean of 100 and a median of 100. A new data set is created by adding 15 to each of the 24 numbers that are greater than the median and subtracting 15 from each of the 24 numbers that are less than the median. Which of the following measures does NOT have the same value in both the original and the new data sets?
For questions about how modifying data affects statistics, first identify whether each option measures center (like mean or median), total (sum), or spread (like interquartile range). Then analyze how the transformation changes the total sum and the relative positions of values: symmetric increases and decreases that cancel leave the sum and center unchanged, but pushing the lower and upper halves away from each other changes measures of spread. Go through each choice one by one, using this reasoning, instead of trying to reason about all measures at once.
Hints
Track what happens to the total
Think about how much the total sum changes when you add 15 to 24 numbers and subtract 15 from another 24 numbers. Do those changes cancel or accumulate?
Think about position-based vs value-based measures
Some measures depend mainly on the position of values in the ordered list (like the middle), while others depend on the actual distances between values. Which type is more likely to change when one side is pushed up and the other side is pushed down?
Consider each choice separately
For each option (median, mean, sum, interquartile range), ask: if I move all low values down by 15 and all high values up by 15, does that measure have to stay the same, or can it change?
Focus on the middle 50%
One of the answer choices is based on the spread of the middle half of the data, not just a single central value. How does stretching the lower and upper halves in opposite directions affect that spread?
Desmos Guide
Create a simple example data set
In Desmos (calculator or table), pick a small, symmetric example list to mimic the situation. For instance, in a table column or as a list, enter something like , which has a clear center.
Compute statistics for the original list
In the expression list, type:
median(L1)mean(L1)total(L1)quartile3(L1) - quartile1(L1)These give you the median, mean, sum, and interquartile range of the original data.
Model the transformation and compare
Create a new list where the lower half is decreased by 15 and the upper half is increased by 15. For example, if has 8 values, define:
L2 = [L1[1]-15, L1[2]-15, L1[3]-15, L1[4]-15, L1[5]+15, L1[6]+15, L1[7]+15, L1[8]+15]. Then compute:median(L2)mean(L2)total(L2)quartile3(L2) - quartile1(L2)Compare each of these four outputs to the corresponding ones from and note which statistic is different between and .
Step-by-step Explanation
Understand how the data are changed
There are 48 numbers: 24 are less than the median and 24 are greater than the median.
In the new data set:
- Each of the 24 smaller numbers is decreased by 15.
- Each of the 24 larger numbers is increased by 15.
So the lower half is shifted down by 15, and the upper half is shifted up by 15.
Check the effect on the sum and the mean
Each of the 24 large numbers is increased by 15, which adds to the total.
Each of the 24 small numbers is decreased by 15, which subtracts from the total.
These changes cancel out:
So the sum of all numbers stays the same. Since the number of data points (48) does not change, and the sum stays the same, the mean also stays the same.
Check the effect on the median
With 48 numbers, the median is the average of the 24th and 25th values when the data are ordered.
Call the original 24th value and the 25th value . We are told the median is 100, so
In the transformation:
- is in the lower half, so it becomes .
- is in the upper half, so it becomes .
The new median is
So the median stays the same.
Check the effect on the spread (interquartile range)
The interquartile range measures how far apart the middle half of the data are: it is , where is the first quartile and is the third quartile.
In this problem:
- All values in the lower half (including those around ) were decreased by 15, so the new is .
- All values in the upper half (including those around ) were increased by 15, so the new is .
The new interquartile range is
which is larger than before. Therefore, the one measure that does not have the same value in both data sets is the interquartile range.