/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 33 The three measures of center int... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

The three measures of center introduced in this chapter are the mean, median, and trimmed mean. Two additional measures of center that are occasionally used are the midrange, which is the average of the smallest and largest observations, and the midfourth, which is the average of the two fourths. Which of these five measures of center are resistant to the effects of outliers and which are not? Explain your reasoning.

Short Answer

Expert verified
Median and trimmed mean are resistant to outliers; mean, midrange, and midfourth are not.

Step by step solution

01

Understand the Given Measures of Center

The exercise discusses five measures of center: the mean, median, trimmed mean, midrange, and midfourth. The task is to determine which of these measures are resistant to outliers.
02

Define 'Resistant to Outliers'

A measure of center is resistant to outliers if it is not significantly affected by extreme values. In other words, the presence of a few unusually small or large observations will not drastically change the result of the measure.
03

Evaluate the Mean

The mean is the arithmetic average of all observations. It is not resistant to outliers because a single extremely large or small value can significantly affect the mean's value.
04

Evaluate the Median

The median is the middle value when the data is ordered. It is resistant to outliers because it depends on the order of the data, not the magnitude of extreme values.
05

Evaluate the Trimmed Mean

The trimmed mean is calculated by removing a certain percentage of the smallest and largest values before computing the mean. It is resistant to outliers because the outlier values are removed before calculation.
06

Evaluate the Midrange

The midrange is the average of the smallest and largest observations. It is not resistant to outliers because it directly uses potentially extreme values in its calculation.
07

Evaluate the Midfourth

The midfourth is the average of the two "fourths," which usually means an average of some central portion of data (e.g., using quartiles). However, without removing extremes, this measure can still be affected by outliers, so it's generally not resistant.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Understanding the Mean
The mean, often referred to as the average, is a widely used measure of center. To find the mean, you simply add up all the numbers in your dataset and then divide by the number of items in the dataset. For example, if you have the numbers 1, 2, and 3, the mean would be \( \frac{1+2+3}{3} = 2 \). While the mean is easy to calculate and understand, it has a major downside: it is highly sensitive to outliers. Outliers are extreme values that differ significantly from the rest of the data. For instance, if you have 1, 2, 3, and 100, the mean becomes \( \frac{1+2+3+100}{4} = 26.5 \). This shows how just one outlier, the number 100, can drastically shift the mean, making it less representative of the majority of the data.
Introduction to the Median
The median is another important measure of center. It represents the middle value in a dataset when the numbers are arranged in order. This is particularly helpful in datasets with outliers. Unlike the mean, the median does not consider the magnitude of numbers, only their position. To find the median, arrange your data in order and locate the middle number. For an odd number of values, it's simple: it's the middle number. For even numbers, take the average of the two middle values. For example, in the dataset 1, 2, 3, the median is 2, and for 1, 2, 3, 4, it is \( \frac{2+3}{2} = 2.5 \). The median's resilience to outliers makes it a reliable measure of the dataset's center when extremes might distort the analysis.
Exploring the Trimmed Mean
A trimmed mean is a refined version of the mean, designed to reduce the influence of outliers. In this approach, you cut off a specified small percentage of the largest and smallest values before finding the mean. For instance, if you decide on a 10% trim for a dataset, you would ignore the top 10% and bottom 10% of the data. This method leads to a mean that is generally more accurate for representing the "center" of the dataset, particularly when outliers are present. By removing these extreme values, the trimmed mean becomes less sensitive to those potential anomalies, providing a more stable measure of the data's core tendencies.
The Concept of Midrange
The midrange might be an unfamiliar term for many, but it's straightforward once understood. It's simply the average of the smallest and largest numbers in a dataset. This seems intuitive, like you're finding the middle ground between two extremes. To calculate the midrange, add the smallest and largest numbers, then divide by 2. For example, with the data 1, 2, 3, and 10, the midrange is \( \frac{1+10}{2} = 5.5 \). Even though the midrange is easy to compute, it is highly sensitive to outliers. Any extreme value will directly influence the midrange, thus making it not ideal for datasets with outliers.
Understanding Outliers
Outliers are values in your dataset that do not fit within the usual pattern of the data. They could be exceptionally high or low compared to the rest. In any statistical analysis, identifying outliers is crucial since they can skew results, especially when relying on specific measures of center like the mean or midrange. Handling outliers effectively is key to ensuring an accurate representation of your data. Measures like the median and trimmed mean are less influenced by outliers, making them more reliable choices in some cases. While outliers might result from errors, they could also indicate variability or unique phenomena in the data. Understanding their impact on various statistical measures allows for better decision-making and analysis.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The article "A Thin-Film Oxygen Uptake Test for the Evaluation of Automotive Crankease Lubricants" (Lubric. Engr., 1984: 75-83) reported the following data on oxidation-induction time (min) for various commercial oils: \(\begin{array}{lllllllllll}87 & 103 & 130 & 160 & 180 & 195 & 132 & 145 & 211 & 105 & 145\end{array}\) \(\begin{array}{llllllll}153 & 152 & 138 & 87 & 99 & 93 & 119 & 129\end{array}\) a. Calculate the sample variance and standard deviation. b. If the observations were reexpressed in hours, what would be the resulting values of the sample variance and sample standard deviation? Answer without actually performing the reexpression.

The accompanying data set consists of observations on shower-flow rate (L/min) for a sample of \(n=129\) houses in Perth, Australia ("An Application of Bayes Methodology to the Analysis of Diary Records in a Water Use Study," J. Amer. Stat. Assoc., 1987: 705-711): \(\begin{array}{rrrrrrrrrr}4.6 & 12.3 & 7.1 & 7.0 & 4.0 & 9.2 & 6.7 & 6.9 & 11.5 & 5.1 \\ 11.2 & 10.5 & 14.3 & 8.0 & 8.8 & 6.4 & 5.1 & 5.6 & 9.6 & 7.5 \\\ 7.5 & 6.2 & 5.8 & 2.3 & 3.4 & 10.4 & 9.8 & 6.6 & 3.7 & 6.4 \\ 8.3 & 6.5 & 7.6 & 9.3 & 9.2 & 7.3 & 5.0 & 6.3 & 13.8 & 6.2 \\ 5.4 & 4.8 & 7.5 & 6.0 & 6.9 & 10.8 & 7.5 & 6.6 & 5.0 & 3.3 \\ 7.6 & 3.9 & 11.9 & 2.2 & 15.0 & 7.2 & 6.1 & 15.3 & 18.9 & 7.2 \\ 5.4 & 5.5 & 4.3 & 9.0 & 12.7 & 11.3 & 7.4 & 5.0 & 3.5 & 8.2 \\ 8.4 & 7.3 & 10.3 & 11.9 & 6.0 & 5.6 & 9.5 & 9.3 & 10.4 & 9.7 \\ 5.1 & 6.7 & 10.2 & 6.2 & 8.4 & 7.0 & 4.8 & 5.6 & 10.5 & 14.6 \\ 10.8 & 15.5 & 7.5 & 6.4 & 3.4 & 5.5 & 6.6 & 5.9 & 15.0 & 9.6 \\ 7.8 & 7.0 & 6.9 & 4.1 & 3.6 & 11.9 & 3.7 & 5.7 & 6.8 & 11.3 \\ 9.3 & 9.6 & 10.4 & 9.3 & 6.9 & 9.8 & 9.1 & 10.6 & 4.5 & 6.2 \\ 8.3 & 3.2 & 4.9 & 5.0 & 6.0 & 8.2 & 6.3 & 3.8 & 6.0 & \end{array}\) a. Construct a stem-and-leaf display of the data. b. What is a typical, or representative, flow rate? c. Does the display appear to be highly concentrated or spread out? d. Does the distribution of values appear to be reasonably symmetric? If not, how would you describe the departure from symmetry? e. Would you describe any observation as being far from the rest of the data (an outlier)?

The following data on distilled alcohol content (\%) for a sample of 35 port wines was extracted from the article "A Method for the Estimation of Alcohol in Fortified Wines Using Hydrometer Baumé and Refractometer Brix" (Amer. J. Enol. Vitic., 2006: \(486-490\) ). Each value is an average of two duplicate measurements. $$ \begin{array}{lllllllll} 16.35 & 18.85 & 16.20 & 17.75 & 19.58 & 17.73 & 22.75 & 23.78 & 23.25 \\ 19.08 & 19.62 & 19.20 & 20.05 & 17.85 & 19.17 & 19.48 & 20.00 & 19.97 \\ 17.48 & 17.15 & 19.07 & 19.90 & 18.68 & 18.82 & 19.03 & 19.45 & 19.37 \\ 19.20 & 18.00 & 19.60 & 19.33 & 21.22 & 19.50 & 15.30 & 22.25 & \end{array} $$ Use methods from this chapter, including a boxplot that shows outliers, to describe and summarize the data.

Give one possible sample of size 4 from each of the following populations: a. All daily newspapers published in the United States b. All companies listed on the New York Stock Exchange c. All students at your college or university d. All grade point averages of students at your college or university

Automated electron backscattered diffraction is now being used in the study of fracture phenomena. The following information on misorientation angle (degrees) was extracted from the article "Hbservations on the Faceted Initiation Site in the Dwell-Fatigue Tested Ti-6242 Alloy: Crystallographic Orientation and Size Effects" (Metallurgical and Materials Trans., 2006: 1507-1518). \(\begin{array}{lcccc}\text { Class: } & 0-<5 & 5-<10 & 10-<15 & 15-<20 \\\ \text { Rel freq: } & .177 & .166 & .175 & .136 \\ \text { Class: } & 20-<30 & 30-<40 & 40-<60 & 60-<90 \\ \text { Rel freq: } & .194 & .078 & .044 & .030\end{array}\) a. Is it true that more than \(50 \%\) of the sampled angles are smaller than \(15^{\circ}\), as asserted in the paper? b. What proportion of the sampled angles are at least \(30^{\circ} ?\) c. Roughly what proportion of angles are between \(10^{\circ}\) and \(25^{\circ} ?\) d. Construct a histogram and comment on any interesting features.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.