/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 30 Median versus mean The mean and ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Median versus mean The mean and median describe the center. a. Why is the median sometimes preferred? Give an example. b. Why is the mean sometimes preferred? Give an example.

Short Answer

Expert verified
The median is preferred when there are outliers; the mean is useful for symmetric distributions.

Step by step solution

01

Understanding the Median's Preference

The median is sometimes preferred because it is less affected by extreme values (outliers) in a dataset. For example, consider the incomes of a group of people: $30,000, $32,000, $35,000, $36,000, and $1,000,000. The mean income is $226,600, but the median income is $35,000. The median better represents the typical income because the mean is skewed by the outlier ($1,000,000).
02

Understanding the Mean's Preference

The mean is sometimes preferred because it takes into account all values in the dataset and provides the arithmetic average. For example, in a symmetrically distributed dataset like $50, $52, $54, $56, and $58, the mean and median are both $54. The mean gives a complete picture of the dataset's total distribution.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Median
The median is a valuable measure in statistics. It identifies the middle point of a dataset. More specifically, it's the number that separates the lower half from the upper half in a sorted list. What makes the median particularly useful is its resistance to outliers or extreme values. If a few numbers in a dataset are much higher or lower than most others, these outliers can severely disrupt the calculations of other statistical measures. This is especially common in datasets involving financial values or rare occurrences. For instance, if you have a set of incomes \([30,000, 32,000, 35,000, 36,000, 1,000,000]\), the median is \(35,000. It is not impacted by the extremely high income of \)1,000,000, which significantly affects the mean calculation. Hence, the median is often a better indicator of the "typical" value in skewed distributions.
Mean
The mean, also known as the average, is calculated by adding up all the numbers in a dataset and dividing by the number of entries. It's a commonly used measure to find the central tendency as it includes all the data points and gives an arithmetic average. The mean is particularly helpful when dealing with symmetric distributions. Here every value is equally weighted, and there aren't any distortions from outliers. Consider a symmetrical dataset like \([50, 52, 54, 56, 58]\). Both the mean and median would be 54, showing the balance and providing a comprehensive picture of the dataset's spread. As every entry affects the mean, it considers the entire dataset's value spread, making it useful in fields such as economics or when every data point contributes meaningfully to the overall outcome.
Outliers
Outliers are data points that deviate significantly from other observations in a dataset. They can occur due to variability in measurements or experimental errors. Outliers are crucial because they can drastically affect statistical results, especially the mean. For instance, in the dataset \([30,000, 32,000, 35,000, 36,000, 1,000,000]\), the high value \(1,000,000 is an outlier. It inflates the average to \)226,600, which doesn't accurately represent the majority's income. While outliers can indicate errors, they can also uncover important insights, such as novel phenomena or data diversity. Identifying and understanding outliers is important for accurate data analysis.
Symmetrical Distribution
A symmetrical distribution occurs when values are evenly spread around the center, creating a mirror image on either side. In such distributions, the mean, median, and mode are all equal, often simplifying statistical analysis as trends can be observed under the assumption of data balance. This type of distribution is frequently seen when dealing with normally distributed datasets, such as human height or standardized test scores. For example, in the dataset \([50, 52, 54, 56, 58]\), not only are the mean and median equal at 54, but they also highlight the symmetry. Symmetry is crucial in statistics as it allows predictions and conclusions based on the shape and spread of data, validating the use of statistical tools assuming normality.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Net worth by degree The Statistical Abstract of the United States reported that in 2004 for those with a college education, the median net worth was \(\$ 226,100\) and the mean net worth was \(\$ 851,300 .\) For those with a high school diploma only, the values were \(\$ 68,700\) and \(\$ 196,800\). a. Explain how the mean and median could be so different for each group. b. Which measure do you think gives a more realistic measure of a typical net worth, the mean or the median. Why?

Golfers' gains During the 2010 Professional Golfers Association (PGA) season, 90 golfers earned at least \(\$ 1\) million in tournament prize money. Of those, 5 earned at least \(\$ 4\) million, 11 earned between \(\$ 3\) million and \(\$ 4\) million, 21 earned between \(\$ 2\) million and \(\$ 3\) million, and 53 earned between \(\$ 1\) million and \(\$ 2\) million. a. Would the data for all 90 golfers be symmetric, skewed to the left, or skewed to the right? b. Two measures of central tendency of the golfers' winnings were \(\$ 2,090,012\) and \(\$ 1,646,853 .\) Which do you think is the mean and which is the median?

Shape of cigarette taxes A recent summary for the distribution of cigarette taxes (in cents) among the 50 states and Washington, D.C. in the United States reported \(\bar{x}=73\) and \(s=48 .\) Based on these values, do you think that this distribution is bell shaped? If so, why? If not, why not, and what shape would you expect?

Sick leave Exercise 2.47 showed data for a company that investigated the annual number of days of sick leave taken by its employees. The data are \(\begin{array}{llllllll}0 & 0 & 4 & 0 & 0 & 0 & 6 & 0\end{array}\) a. The standard deviation is \(2.4 .\) Find and interpret the range. b. The quartiles are \(\mathrm{Q} 1=0,\) median \(=0, \mathrm{Q} 3=2 .\) Find the interquartile range. c. Suppose the 6 was incorrectly recorded and is supposed to be \(60 .\) The standard deviation is then 21.1 but the quartiles do not change. Redo parts \(\mathrm{a}-\mathrm{c}\) with the correct data and describe the effect of this outlier. Which measure of variability, the range, IQR, or standard deviation, is least affected by the outlier? Why?

Female strength The High School Female Athletes data file on the text CD has data for 57 high school female athletes on the maximum number of pounds they were able to bench press, which is a measure of strength. For these $$ \text { data, } \bar{x}=79.9, \mathrm{Q} 1=70, \text { median }=80, \mathrm{Q} 3=90 . $$ a. Interpret the quartiles. b. Would you guess that the distribution is skewed or roughly symmetric? Why?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.