/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q 3.54. Outliers and trimmed means. Some... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Outliers and trimmed means. Some data sets contain outliers, observation that fall well outside the overall pattern of the data(We discuss outliers in more detail in section 3.4) Suppose, for instance that you are interested in the ability o high school algebra student to compute square roots. You decide to give a square root exam to 10 of these students Unfortunately, one of the student had a fight with his girlfriend and cannot concentrate he gets a 0. The 10 score are displayed in increasing order in the following table. The score of 0 is an outlier.

Statisticians have a systematic method for avoiding extreme observation and outliers when they calculated means. They compute trimmed means, in which high and low observation are deleted or "trimmed off" before the mean is calculated . For instance, to compute the 10% trimmed mean of the test score data. we first delete both the bottom 10% and the top 10% of the ordered data. that is,0 and 80. Then we calculated the mean of the remaining data. Thus the10% trimmed mean of the test score data is

The following table displays a set of score for a 40 question algebra final

Part (a) Do any the score look like outliers?

Part (b) Compute the usual mean of the data.

Part (c) Compute the 5% trimmed mean of the data.

Part (d) Compute the 10% trimmed mean of the data.
Part (e) Compare the means you obtained in parts (b) (d) which of the three provides the best measure of center for the data?

Short Answer

Expert verified

Part (a) 2 and 4 are outliers.

Part (b) 19.25

Part (c) 19.72

Part (d) 20.25

Part (e) 10% trimmed value is better measure of center

Step by step solution

01

Part (a) Step 1. Given information.  

The provided data set is:

5,15,16,16,19,21,21,25,26,27,4,15,16,17,20,21,24,25,27,28

02

Part (a) Step2. If any of the scores appear to be outliers.

Outliers are observations that deviate significantly from the overall pattern of the data.

The sorted data is written as,

(2,,4,15,15,16,16,16,17,19,20,21,21,21,24,25,25,26,27,27,28)

Because the other scores are all two digits, and scores 2 and 4 are single digits. As a result, they act as an outlier.

As a result, the scores that appear to be outliers are 2 and 4.

03

Part (b) Step 1. Given information.  

The provided data set is:

5,15,16,16,19,21,21,25,26,27,4,15,16,17,20,21,24,25,27,28

04

Part (b) Step2.  The given data's mean. 

Mean is expressed as follows:

x¯=∑xn

The sorted data is written as,

(2,,4,15,15,16,16,16,17,19,20,21,21,21,24,25,25,26,27,27,28)

The mean of the given data set is computed as follows:

x¯=(2+4+15+15+16+16+16+17+19+20+21+21+21+2++25+25+26+27+27+2820x¯=38520x¯=19.25

05

Part (c) Step 1. Given information.  

The provided data set is:

5,15,16,16,19,21,21,25,26,27,4,15,16,17,20,21,24,25,27,28

06

Part (c) Step 2. The trimmed mean of the given data by 5%.

For the 5% trimmed mean, the 5% beginning and 5% ending data are removed.

5% trimmed data = 4,15,15,,16,16,16,17,,19,20,21,21,,21,24,25,26,27,27

Mean is expressed as follows:

x¯=∑xn

The mean of the given data set is computed as follows:

x¯=4+15+15+16+16+16+17+19+20+21+21+21+2++25+25+26+27+2718x¯=35518x¯=19.7222x¯≈19.72

07

Part (d) Step 1. Given information.  

The provided data set is:

5,15,16,16,19,21,21,25,26,27,4,15,16,17,20,21,24,25,27,28

08

Part (d) Step 2. The trimmed mean of the given data by 10%.

For the 10% trimmed mean, the 10% beginning and 10% ending data are removed.

10% trimmed data = 15,15,,16,16,16,17,,19,20,21,21,,21,24,25,26,27

Mean is expressed as follows:

x¯=∑xn

The mean of the given data set is computed as follows:

x¯=15+15+16+16+16+17+19+20+21+21+21+2++25+25+26+2716x¯=32416x¯=20.25

09

Part (e) Step 1. Given information.  

The provided data set is:

5,15,16,16,19,21,21,25,26,27,4,15,16,17,20,21,24,25,27,28

10

Part (d) Step 2.  The mean of parts (b) to (d) that provides the best measure of center.

The true mean is 19.25.

19.72 is the trimmed mean.

20.25 is the trimmed mean.

As a result, 10% trimmed mean is greater than 5% trimmed mean, which is greater than the actual mean.

The mean with a 10% trimmed value is a better measure of center because both outlier values have been removed.

As a result, the 10% trimmed mean is greater than the 5% trimmed mean, which is greater than the actual mean, and the 10% trimmed value is a better measure of centre because both outlier values have been removed.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

a) Determine the mode of the data.
b) Decide whether it would be appropriate to use either the mean or median as a measure of center. Explain your answer.

Medical School Faculty. The Association of American Medical College (AAMC) in a compiles data on medical school facility and publishes the results in AAMC Faculty Roster. The following table presents a frequency distribution of rank for medical school faculty during one year.

Each of the following smooth curves represents the shape of a data set. In each case, decide whether application of the empirical rule to the data set is appropriate. Explain your answers.

In each Exercises 3.82-3.90, use the technology of your choice to determine and interpret the range and sample standard deviation for those data sets to which those concepts apply. If those concepts don't apply, explain why. Note: If an exercise contains more than one data set, perform the aforementioned tasks for each data set.

Ballot Preferences. In Issue 338 of the Amstat News, then president of the American Statistical Association, F. Scheuren reported the results of a survey on how members would prefer to receive ballots in annual elections. On the WeissStats site, you will find data for preference and highest degree obtained for the 566 respondents.

Obesity. Researchers in obesity wanted to compare the effectiveness of dieting with exercise against dieting without exercise. Seventy-three patients were randomly divided into two groups. Group 1. composed of 37patients, was put on a program of dieting with exercise. Group 2. composed of 36patients, dieted only. The results for weight loss, in pounds, after 2months are summarized in the following boxplots. The top boxplot is for Group 1. and the bottom boxplot is for Group 2. Use the boxplots to compare the weight losses for the two groups, paying special attention to center and variation.

Fill in the blanks: Roughly, when arranged in increasing order, the uppermost 25%of the dataset are greater than or equal to _______ .

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.