/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 145 Using MAD to measure variability... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Using MAD to measure variability The standard deviation is the most popular measure of variability from the mean. It uses squared deviations, since the ordinary deviations sum to zero. An alternative measure is the mean absolute deviation, \(\sum|x-\bar{x}| / n\). a. Explain why greater variability tends to result in larger values of this measure. b. Would the MAD be more, or less, resistant than the standard deviation? Explain.

Short Answer

Expert verified
Greater variability increases MAD as deviations from the mean grow. MAD is more resistant to outliers than standard deviation.

Step by step solution

01

Understanding MAD

The Mean Absolute Deviation (MAD) measures the average absolute deviations of data points from their mean. Thus, it provides a way to express the average distance of each data point from the mean without taking direction into account (i.e., whether it's above or below the mean).
02

Explain Part a: Greater Variability

In a dataset with greater variability, the distances of data points from the mean tend to be larger, consequently increasing the absolute deviations \( |x - \bar{x}| \). When these larger deviations are averaged, the result is a larger MAD value. This is because MAD effectively captures how spread out the data points are from the mean.
03

Explain Part b: Resistance of MAD

The Mean Absolute Deviation is more resistant to outliers as it does not square the deviations, meaning it does not disproportionately weigh extreme values. In contrast, the standard deviation squares differences from the mean, which amplifies the contribution of outliers and makes it less resistant than MAD.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Standard Deviation
When we discuss the spread of a dataset, the standard deviation often comes up as a key measure. It provides a numerical value that tells us how far the individual data points tend to be from the mean of the dataset. In essence, it's a way to quantify the amount of variation within a set of values.

Standard deviation is calculated by finding the square root of the average of the squared deviations from the mean. Here's the basic formula:
  • First, find the mean of the data.
  • Subtract the mean from each data point to get the deviations.
  • Square each deviation to ensure they are positive.
  • Find the mean of these squared deviations.
  • Take the square root of this mean to get the standard deviation.
This method of squaring deviations can give a better sense of data point spread because it penalizes extreme values more heavily. This is why the standard deviation is very sensitive to outliers and changes a lot if the dataset contains such extreme values.
Variability
When analyzing data, we often talk about how spread out the values are, which is what we call variability. Variability tells us the degree to which data points differ from each other and from the mean. It's an important aspect of data analysis because it affects how we interpret the data and make predictions based on it.

There are several measures of variability, including:
  • Range: The simplest measure, calculated by subtracting the smallest value from the largest.
  • Variance: The average of the squared differences from the mean, without taking the square root like in the standard deviation.
  • Mean Absolute Deviation (MAD): The average of the absolute differences from the mean, offering an alternative to both range and standard deviation.
Variability is crucial in statistics because it shows how scattered the data is, impacting the reliability and validity of statistical conclusions. More variability might indicate that fewer conclusions can be safely drawn from the data.
Resistant to Outliers
Outliers are data points that differ significantly from other observations. They can occur by chance or may indicate a measurement error or a different population. When measuring variability, it's important to consider how these outliers affect the analysis.

Some measures of variability are more resistant to outliers than others:
  • Mean Absolute Deviation (MAD): It does not overly emphasize extreme values because it uses the absolute differences, reflecting the average distance of all data points from the mean, equally.
  • Standard Deviation: As it squares the deviations, it amplifies the effect of outliers, making it less resistant. Major outliers can drastically increase the standard deviation.
Using a measure like MAD can be advantageous in datasets where outliers are present. Since MAD does not increase disproportionately with extreme values, it provides a clearer picture of the true spread of the majority of data points. Understanding and choosing the right measure of variability based on the presence of outliers is essential for accurate data analysis.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Shape of the histogram For each of the following variables, indicate whether you would expect its histogram to be symmetric, skewed to the right, or skewed to the left. Explain why. a. Assessed value of houses in a large city (Hint: Would the relatively few homes with extremely high assessed value result in a long right tail or a long left tail?) b. Number of times checking account overdrawn in the past year for the faculty in your school c. IQ for the general population d. The height of female college students

Sick leave A company decides to investigate the amount of sick leave taken by its employees. A sample of eight employees yields the following numbers of days of sick leave taken in the past year: \(\begin{array}{llll}0 & 0 & 6 & 0\end{array}\) \(\begin{array}{llll}0 & 0 & 4 & 0\end{array}\) a. Find and interpret the range. b. Find and interpret the standard deviation \(s\). c. Suppose the 6 was incorrectly recorded and is supposed to be 60 . Redo parts a and \(b\) with the correct data and describe the effect of this outlier.

Life expectancy The Human Development Report 2006, published by the United Nations, showed life expectancies by country. For Western Europe, the values reported were Denmark \(77,\) Portugal 77 , Netherlands 78 , Finland 78 , Greece 78 , Ireland 78 , UK 78 , Belgium 79 , France 79 , Germany \(79,\) Norway \(79,\) Italy \(80,\) Spain \(80,\) Sweden 80 , Switzerland 80 . For Africa, the values reported (many of which were substantially lower than five years earlier because of the prevalence of AIDS) were Botswana \(37,\) Zambia \(37,\) Zimbabwe \(37,\) Malawi 40 , Angola 41 , Nigeria \(43,\) Rwanda \(44,\) Uganda 47 , Kenya \(47,\) Mali \(48,\) South Africa \(49,\) Congo \(52,\) Madagascar 55 , Senegal 56, Sudan 56, Ghana 57 . a. Which group (Western Europe or Africa) of life expectancies do you think has the larger standard deviation? Why? b. Find the standard deviation for each group. Compare them to illustrate that \(s\) is larger for the group that shows more variability from the mean.

Knowing homicide victims The table summarizes responses of 4383 subjects in a recent General Social Survey to the question, "Within the past month, how many people have you known personally that were victims of homicide?" a. To find the mean, it is necessary to give a score to the " 4 or more" category. Find it, using the score 4.5 . (In practice, you might try a few different scores, such as \(4,4.5,5,6,\) to make sure the resulting mean is not highly sensitive to that choice.) b. Find the median. Note that the "4 or more" category is not problematic for it. c. If 1744 observations shift from 0 to 4 or more, how do the mean and median change? d. Why is the median the same for parts \(\mathrm{b}\) and \(\mathrm{c},\) even though the data are so different?

Categorical/quantitative difference a. Explain the difference between categorical and quantitative variables. b. Give an example of each.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.