/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 34 The three measures of center int... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

The three measures of center introduced in this chapter are the mean, median, and trimmed mean. Two additional measures of center that are occasionally used are the midrange, which is the average of the smallest and largest observations, and the midfourth, which is the average of the two fourths. Which of these five measures of center are resistant to the effects of outliers and which are not? Explain your reasoning.

Short Answer

Expert verified
Resistant: Median, Trimmed Mean, and Midfourth. Not resistant: Mean and Midrange.

Step by step solution

01

Understanding Resistance to Outliers

A measure of center is resistant if it is not heavily influenced by extreme values (outliers). We will evaluate each measure to determine its resistance.
02

Evaluate Mean

The mean is calculated as the sum of all values divided by the number of values. This measure is not resistant to outliers, as a single extreme value can significantly affect it.
03

Evaluate Median

The median is the middle value of a data set when ordered from least to greatest. This measure is resistant to outliers because it relies on the positional value rather than magnitude.
04

Evaluate Trimmed Mean

A trimmed mean involves removing a certain percentage of the smallest and largest data points before averaging the remaining points. This method increases resistance to outliers, especially with a sufficient trim.
05

Evaluate Midrange

The midrange is calculated as the average of the smallest and largest observations. It is not resistant to outliers, as these extreme values directly determine the midrange.
06

Evaluate Midfourth

The midfourth is the average of the two "fourths" or quartiles in a data set. Since these points are typically not at the extremes of the data set, the midfourth is somewhat resistant to outliers.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Mean
The mean is one of the most commonly used measures of central tendency. It is calculated by adding up all the values in a data set and then dividing by the number of values. This gives you the average. For instance, if you have data points like 2, 4, 6, 8, and 10, the mean would be \( \frac{2 + 4 + 6 + 8 + 10}{5} = 6 \).

A key feature of the mean is that it uses all the data points in its computation. However, this means that it is sensitive to outliers. An outlier is a value that "lies outside" most of the other values in your data set. For example, if your data set is 2, 4, 6, 8, and 50, the mean becomes \( \frac{2 + 4 + 6 + 8 + 50}{5} = 14 \).

As you can see, the presence of 50, which is an outlier, significantly increases the mean, indicating that mean is not resistant to outliers.
Median
The median is another important measure of central tendency, which represents the middle value when a data set is arranged in ascending order. If the number of observations is even, the median is the average of the two middle numbers. For instance, in the data set 3, 5, 7, the median is 5. For an even set, such as 3, 5, 7, 9, the median would be \( \frac{5 + 7}{2} = 6 \).

Unlike the mean, the median is a positional average meaning it does not consider the magnitude of the values. As a result, it is not affected by outliers. If the data set changes to 3, 5, 50, the median remains 5. Therefore, the median is considered a resistant measure of center.
Resistance to Outliers
Resistance to outliers is a valuable property for a measure of central tendency, as it prevents the measure from being unduly affected by extreme outliers. Outliers can distort statistical analyses by pulling the measure of center towards them.

  • The mean is not resistant as it incorporates all values, including outliers, in its calculation.
  • The median is resistant because its calculation is based on the position of data points, not their value.
  • The trimmed mean, by discarding a percentage of the highest and lowest data points, gains resistance to outliers.
  • The midrange, on the other hand, relies directly on extremes, thus it is not resistant.

Understanding resistance helps in selecting the best measure when summarizing data, especially in the presence of outliers.
Trimmed Mean
The trimmed mean offers a balance between the mean and the median. This measure is calculated by removing a certain percentage of the smallest and largest values in the data set and then determining the mean of the remaining data.

For example, in a data set of 1, 3, 5, 7, and 100, a trimmed mean with 20% trimming would remove 1 and 100, calculating the mean as \( \frac{3 + 5 + 7}{3} = 5 \).

By eliminating the extremes, the trimmed mean reduces the potential distortion from outliers, thus increasing its resistance. It provides a compromise between using all data points and completely ignoring outliers, like when using the median.
Midrange
The midrange is a measure of central tendency that considers only the smallest and largest observations in a data set. It is calculated by averaging these two extremes. For example, in the data set 2, 3, 10, the midrange would be \( \frac{2 + 10}{2} = 6 \).

Due to its reliance on the most extreme values, the midrange is highly sensitive to outliers. If an outlier is present, it can significantly skew the midrange. For example, if the data set changes to 2, 3, 100, the midrange becomes \( \frac{2 + 100}{2} = 51 \), demonstrating a substantial shift.

This sensitivity makes the midrange a less reliable measure in the presence of outliers, compared to more resistant measures like the median or trimmed mean.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

How does the speed of a runner vary over the course of a marathon (a distance of \(42.195 \mathrm{~km}\) )? Consider determining both the time to run the first \(5 \mathrm{~km}\) and the time to run between the \(35-\mathrm{km}\) and \(40-\mathrm{km}\) points, and then subtracting the former time from the latter time. A positive value of this difference corresponds to a runner slowing down toward the end of the race. The accompanying histogram is based on times of runners who participated in several different Japanese marathons ("Factors Affecting Runners' Marathon Performance," Chance, Fall, 1993: 24-30). What are some interesting features of this histogram? What is a typical difference value? Roughly what proportion of the runners ran the late distance more quickly than the early distance?

In a famous experiment carried out in 1882 , Michelson and Newcomb obtained 66 observations on the time it took for light to travel between two locations in Washington, D.C. A few of the measurements (coded in a certain manner) were \(31,23,32,36,-2,26,27\), and 31 . a. Why are these measurements not identical? b. Is this an enumerative study? Why or why not?

In a study of author productivity ("Lotka's Test," Collection \(M g m t\)., 1982: 111-118), a large number of authors were classified according to the number of articles they had published during a certain period. The results were presented in the accompanying frequency distribution: Number \(\begin{array}{lrrrrrrrr}\text { of papers } & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 \\\ \text { Frequency } & 784 & 204 & 127 & 50 & 33 & 28 & 19 & 19\end{array}\) Number \(\begin{array}{lrrrrrrrrr}\text { of papers } & 9 & 10 & 11 & 12 & 13 & 14 & 15 & 16 & 17 \\ \text { Frequency } & 6 & 7 & 6 & 7 & 4 & 4 & 5 & 3 & 3\end{array}\) a. Construct a histogram corresponding to this frequency distribution. What is the most interesting feature of the shape of the distribution? b. What proportion of these authors published at least five papers? At least ten papers? More than ten papers? c. Suppose the five \(15 \mathrm{~s}\), three \(16 \mathrm{~s}\), and three \(17 \mathrm{~s}\) had been lumped into a single category displayed as " \(\geq 15\)." Would you be able to draw a histogram? Explain. d. Suppose that instead of the values 15,16 , and 17 being listed separately, they had been combined into a \(15-17\) category with frequency 11. Would you be able to draw a histogram? Explain.

The article "A Thin-Film Oxygen Uptake Test for the Evaluation of Automotive Crankcase Lubricants" (Lubric. Engr., 1984: 75-83) reported the following data on oxidation-induction time (min) for various commercial oils: \(\begin{array}{rrrrrrrrrrr}87 & 103 & 130 & 160 & 180 & 195 & 132 & 145 & 211 & 105 & 145 \\ 153 & 152 & 138 & 87 & 99 & 93 & 119 & 129 & & & \end{array}\) \(\begin{array}{lllllllll}153 & 152 & 138 & 87 & 99 & 93 & 119 & 129\end{array}\) a. Calculate the sample variance and standard deviation. b. If the observations were reexpressed in hours, what would be the resulting values of the sample variance and sample standard deviation? Answer without actually performing the reexpression.

A sample of 77 individuals working at a particular office was selected and the noise level (dBA) experienced by each individual was determined, yielding the following data ("Acceptable Noise Levels for Construction Site Offices," Building Serv. Engr. Research and Technology, 2009: 87-94). \(\begin{array}{lllllllllllll}55.3 & 55.3 & 55.3 & 55.9 & 55.9 & 55.9 & 55.9 & 56.1 & 56.1 & 56.1 & 56.1 \\ 56.1 & 56.1 & 56.8 & 56.8 & 57.0 & 57.0 & 57.0 & 57.8 & 57.8 & 57.8 & 57.9 \\ 57.9 & 57.9 & 58.8 & 58.8 & 58.8 & 59.8 & 59.8 & 59.8 & 62.2 & 62.2 & 63.8 \\ 63.8 & 63.8 & 63.9 & 63.9 & 63.9 & 64.7 & 64.7 & 64.7 & 65.1 & 65.1 & 65.1 \\ 65.3 & 65.3 & 65.3 & 65.3 & 67.4 & 67.4 & 67.4 & 67.4 & 68.7 & 68.7 & 68.7 \\ 68.7 & 69.0 & 70.4 & 70.4 & 71.2 & 71.2 & 71.2 & 73.0 & 73.0 & 73.1 & 73.1 \\ 74.6 & 74.6 & 74.6 & 74.6 & 79.3 & 79.3 & 79.3 & 79.3 & 83.0 & 83.0 & 83.0\end{array}\) Use various techniques discussed in this chapter to organize, summarize, and describe the data.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.