/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 2 Briefly explain the meaning of a... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Briefly explain the meaning of an outlier. Is the mean or the median a better measure of central tendency for a data set that contains outliers? Illustrate with the help of an example.

Short Answer

Expert verified
An outlier is a data point that significantly varies from other observations. It affects the mean, pulling it towards the outlier, hence not accurately representing the central tendency in a dataset with outliers. The median, however, remains unaffected by the outliers, making it a better measure of central tendency for datasets with outliers. An example: for the dataset \(2, 3, 4, 4, 9, 40\), the mean is \(10.33\) and median is \(4.5.\)

Step by step solution

01

Define an Outlier

An outlier is a data point that differs significantly from other observations in a dataset. It can occur due to variability in the data or due to experimental errors. Outliers can distort the interpretation and analyses of the data.
02

Mean and Outliers

The mean is the average of all numbers in the dataset. Its value can be heavily influenced by outliers. If there are extremely high or low outliers, they can drag the mean towards them, leading to an inaccurate representation of the central tendency.
03

Median and Outliers

The median is the middle number of a sorted dataset. Whether outliers are extremely high or low, they do not affect the value of the median. It provides a better measure of central tendency when the dataset contains outliers.
04

Illustrate with Example

Consider a dataset: \(2, 3, 4, 4, 9, 40\). The mean is \(10.33\), pulled upwards by the outlier '40'. On the other hand, the median value \(4.5\) less distorted by the outlier and gives a better representation of the central tendency.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Seven airline passengers in economy class on the same flight paid an average of \(\$ 361\) per ticket. Because the tickets were purchased at different times and from different sources, the prices varied. The first five passengers paid \(\$ 420, \$ 210, \$ 333, \$ 695\), and \(\$ 485\). The sixth and seventh tickets were purchased by a couple who paid identical fares. What price did each of them pay?

The following data give the revenues (in millions of dollars) for the last available fiscal year for a sample of six charitable organizations for serious diseases (Charity Navigator, 2009). The values are, listed in order, for the Alzheimer's Association, the American Cancer Society, the American Diabetes Association, the American Heart Association, the American Lung Association, and the Cystic Fibrosis Foundation. \(\begin{array}{llllll}952 & 1129 & 231 & 668 & 49 & 149\end{array}\) Compute the mean and median. Do these data have a mode? Why or why not?

The following data set lists the number of women from each of 10 different countries who were on the Rolex Women's World Golf Rankings Top 25 list as of March 31,2009 . The data, entered in that order, are for the following countries: Australia, Brazil, England, Japan, Korea, Mexico, Norway, Sweden, Taiwan, and United States. \(\begin{array}{lllllllll}2 & 1 & 1 & 2 & 9 & 1 & 1 & 2 & 2 & 4\end{array}\) a. Calculate the mean and median for these data. b. Identify the outlier in this data set. Drop the outlier and recalculate the mean and median. Which of these two summary measures changes by a larger amount when you drop the outlier? c. Which is the better summary measure for these data, the mean or the median? Explain.

The following data give the recent estimates of crude oil reserves (in billions of barrels) of Saudi Arabia, Iraq, Kuwait, Iran, United Arab Emirates, Venezuela, Russia, Libya, Nigeria, China, Mexico, and the United States. The reserves for these countries are listed in that order. \(\begin{array}{rrrrrr}261.7 & 112.0 & 97.7 & 94.4 & 80.3 & 64.0 \\ 51.2 & 29.8 & 27.0 & 26.8 & 25.0 & 22.5\end{array}\) Prepare a box-and-whisker plot. Are the data symmetric or skewed?

The following data give the speeds of 13 cars (in mph) measured by radar, traveling on I-84. \(\begin{array}{lllllll}73 & 75 & 69 & 68 & 78 & 69 & 74 \\\ 76 & 72 & 79 & 68 & 77 & 71 & \end{array}\) a. Find the values of the three quartiles and the interquartile range. b. Calculate the (approximate) value of the 35 th percentile. c. Compute the percentile rank of 71 .

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.