/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q117E Sanitation inspection of cruise ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Sanitation inspection of cruise ships. Refer to Exercise 2.81 (p. 111) and the data on the sanitation levels of passenger cruise ships.

  1. Use the box plot method to detect any outliers in the data set.
  2. Use the z-score method to detect any outliers in the data set.
  3. Do the two methods agree? If not, explain why

Short Answer

Expert verified
  1. 105.5

  2. This approach considers any point with a z-score more than 2 or less than -2 an outlier.

  3. As a result, no outliers are discovered using the z-score approach.

Step by step solution

01

(a) Box plot method

We find the 5-point summary, as given below, and afterward calculate the IQR as well as outlier boundaries.

The five-point summary:

min=691stquartile=93median=963rdquartile=98max=100

skewness=3mean-medianσ=394.4409-965.3352=0.8766869

IQR=Q3-Q1

=98-93

=5

To find the outlier:

lowerlimit=Q1-1.5IQR=93-(1.5×5)=85.5Higherlimit=Q1+1.5IQR=93+(1.5×5)=105.5

Thus, any worth less than 85.5 or more than 105.5 is an outlier.

As we can see, some points are less than 85.5 and thus are considered outliers.

02

(b) z-score method

As illustrated, we compute the z-score for every observation.

This approach considers any point with a z-score more than 2 or less than -2 an outlier.

However, no outlier is discovered using this approach because no point has a z-score more than or even less than 2.

03

(c) Two methods

In the box plot approach, they only examine the centre 50% of the information to calculate the IQR, which is used as a measurement of the information's dispersion. When we plot the data as a histogram, we see that it is skewed to the left. As a result, employing the median as a central tendency measurement is more suitable in this circumstance.

One utilized the standard deviation and mean from the complete dataset in the z-score approach. Both are heavily influenced by outlier estimates and thus tend to distort the standard deviation and mean readings. As a result, no outliers are discovered using the z-score approach.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Made-to-order delivery times.Refer to the data on delivery times for a made-to-order product, Exercise 2.34 (p. 87). The delivery times (in days) for a sample of 25 orders are repeated in the accompanying table. (Times marked by an asterisk are associated with customers who subsequently placed additional orders with the company.) Identify any unusual observations (outliers) in the data set, and then use the results to comment on the claim that repeat customers tend to have shorter delivery times than one-time customers.

50* 64* 56* 43* 64* 82* 65* 49* 32* 63* 44* 71 54* 51* 102 49* 73* 50* 39* 86 33* 95 59* 51* 68

Question: Hourly road accidents in India.An analysis of road accident data in India was undertaken, and the results were published in the Journal of Big Data(Vol. 2, 2015). For a particular cluster of roads, the hourly numbers of accidents totaled over a recent 5-year period are listed in the next table. (These results are adapted from a figure in the journal article.) Create a scatterplot of the data, with a number of accidents on the vertical axis and hours on the horizontal axis. What type of trend (if any) do you detect in the data?

Hour

Number

1

2

3

4

5

6

7

8

9

10

11

12

125

148

159

270

281

295

302

216

225

353

400

327

Question: State SAT scores.Refer to Exercise 2.27 (p. 84) and the data on state SAT scores. Construct a scatterplot for the data, with the 2010 Math SAT score on the horizontal axis and the 2014 Math SAT score on the vertical axis. What type of trend do you detect?

State

2010 Math SAT

2014 Math SAT

Alabama

538

550

Alaska

503

513

Arizona

527

524

Arkansa

571

564

California

510

516

Wisconsin

608

603

Wyoming

599

565

U.S. business bankruptcies.The American Bankruptcy Institute and the National Bankruptcy Research Center monitor the number of business bankruptcy filings each quarter. The table below lists the number of business bankruptcy filings for each quarter of a recent 2-year period. FILING

a.Explain why the data in the table represent time series data.

b.Construct a time series plot for the quarterly number of bankruptcy filings.

c.Do you detect a trend in the time series plot? Explain.

Year

Quarter

Number of

Bankruptcies

2011

1

2

3

4

12,376

12,304

11,705

11,149

2012

1

2

3

4

10,998

10,374

9,248

9,231

Consider the following three measurements: 0, 4, and 12. Find the z-score for each measurement if they are from a population with the following mean and standard deviation equal to

a.µ = 2 and σ = 1

b.µ = 4 and σ = 2

c.µ = 8 and σ = 2

d.µ = 8 and σ = 8

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.