/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 62 Fiber in the Diet The number of ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Fiber in the Diet The number of grams of fiber eaten in one day for a sample of ten people are \(\begin{array}{ll}10 & 11\end{array}\) \(\begin{array}{ll}11 & 14\end{array}\) \(\begin{array}{llllll}15 & 17 & 21 & 24 & 28 & 115\end{array}\) (a) Find the mean and the median for these data. (b) The value of 115 appears to be an obvious outlier. Compute the mean and the median for the nine numbers with the outlier excluded. (c) Comment on the effect of the outlier on the mean and on the median.

Short Answer

Expert verified
With the outlier, the mean is 25.6 and the median is 16. Without the outlier, the mean is 15.67 and the median is 15. This shows that the mean is more affected by the outlier than the median, making the median a more resistant measure of central tendency in this case.

Step by step solution

01

Calculate the Mean (with outlier)

First, find the sum of all the data points: \(10 + 11 + 11 + 14 + 15 + 17 + 21 + 24 + 28 + 115 = 256\). Then divide this by the total number of data points (10 in this case) to get the mean: \(256 / 10 = 25.6\)
02

Calculate the Median (with outlier)

First, list the numbers in ascending order: \(10, 11, 11, 14, 15, 17, 21, 24, 28, 115\). Since there are 10 data points, the median is the average of the 5th and 6th data points, which is \( (15 + 17) / 2 = 16\).
03

Calculate the Mean (without outlier)

Now, exclude the outlier (115) and repeat the process for finding the mean: \(10 + 11 + 11 + 14 + 15 + 17 + 21 + 24 + 28 = 141\). Then divide this by the number of data points minus the outlier (9 in this case) to get the new mean: \(141 / 9 = 15.67\)
04

Calculate the Median (without outlier)

Again, exclude the outlier and list the numbers in ascending order: \(10, 11, 11, 14, 15, 17, 21, 24, 28\). There are now 9 data points and the median is the 5th, which is 15.
05

Discuss the Effect of the Outlier

Notice how the mean got smaller when the outlier was excluded, from 25.6 to 15.67. This shows that the mean is sensitive to extreme values. However, the median only decreased slightly, from 16 to 15. This demonstrates that the median is resistant to outliers, and hence it is a better representation of the central value when outliers are present.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Understanding Mean
The mean, commonly known as the average, is a measure of central tendency that summarizes a set of data points by taking their sum and dividing by the number of data points. For example, in our exercise where people consumed fiber, the mean gives us an idea of the "average" amount of fiber consumed.
To find the mean:
  • Add up all the numbers.
  • Then divide the total by the number of data points.
The calculation can be sensitive to extreme values, also known as outliers. In our case, when all numbers are included, the mean becomes 25.6, influenced significantly by the highest number, 115. This shows how a single outlier can skew the mean upwards. After excluding the outlier, the mean dropped to 15.67, more indicative of the typical data points. Thus, mean is useful but can be misleading if there's an outlier in the dataset.
Exploring Median
The median is another measure of central tendency, which might often be a better indicator of the "middle" of a dataset, especially when outliers are present. It's found by arranging all data points in order and then selecting the middle value. If there is an even number of observations, the median is the average of the two middle values.
In our dataset:
  • With the outlier, the median is calculated as the average of 15 and 17, yielding 16.
  • Without the outlier, the new middle value is 15.
This stability is why the median is preferred over the mean when dealing with skewed data or outliers. The median remained relatively unchanged despite the outlier's presence, emphasizing its robustness as a measure of central tendency.
Recognizing Outliers
Outliers are those data points that significantly differ from other observations. They appear like they don't belong, often much higher or lower than the rest of the data. In our fiber intake dataset, 115 is an obvious outlier as it is much larger than all other entries.
Recognizing outliers is crucial because they can:
  • Distort the mean, giving a false sense of the typical data value.
  • Have less impact on the median, but still suggest further investigation regarding why such data point exists.
Dealing with outliers involves assessing their impact on your analysis. In many situations, as seen in the exercise, excluding them can provide better insights into data trends. For serious analysis, consider why the outlier is there before deciding to exclude it, as it might reveal important insights or errors in data collection.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Create Your Own: Be Creative!! Create your own data visualization, and describe it. Be creative!!

Use the \(95 \%\) rule and the fact that the summary statistics come from a distribution that is symmetric and bell-shaped to find an interval that is expected to contain about \(95 \%\) of the data values. A bell-shaped distribution with mean 1500 and standard deviation 300.

Each describe a sample. The information given includes the five number summary, the sample size, and the largest and smallest data values in the tails of the distribution. In each case: (a) Clearly identify any outliers, using the IQR method. (b) Draw a boxplot. Five number summary: (42,72,78,80,99)\(;\) \(n=120 .\) Tails: 42, 63, \(65,67,68, \ldots, 88,89,95,96,99\).

Each describe a sample. The information given includes the five number summary, the sample size, and the largest and smallest data values in the tails of the distribution. In each case: (a) Clearly identify any outliers, using the IQR method. (b) Draw a boxplot. Five number summary: (15,42,52,56,71)\(;\) \(n=120 .\) Tails: \(15,20,28,30,31, \ldots, 64,65,65,66,71\)

Deal with an experiment to study the effects of financial incentives to quit smoking. 19 Smokers at a company were invited to participate in a smoking cessation program and randomly assigned to one of two groups. Those in the Reward group would get a cash award if they stopped smoking for six months. Those in the Deposit group were asked to deposit some money which they would get back along with a substantial bonus if they stopped smoking. After six months, 156 of the 914 smokers who accepted the invitation to be in the reward-only program stopped smoking, while 78 of the 146 smokers who paid a deposit quit. Set up a two-way table and compare the success rates between participants who entered the two programs.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.