/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q48E Exercise 34 presented the follow... [FREE SOLUTION] | 91影视

91影视

Exercise 34 presented the following data on endotoxin concentration in settled dust both for a sample of urban homes and for a sample of farm homes:

U: 6.0 5.0 11.0 33.0 4.0 5.0 80.0 18.0 35.0 17.0 23.0

F: 4.0 14.0 11.0 9.0 9.0 8.0 4.0 20.0 5.0 8.9 21.0

9.2 3.0 2.0 0.3

a. Determine the value of the sample standard deviation for each sample, interpret these values, and then contrast variability in the two samples. (Hint:\(\sum {{x_i}} \)= 237.0 for the urban sample and =128.4 for the farm sample, and\(\sum {x_i^2} \)= 10,079 for the urban sample and 1617.94 for the farm sample.)

b. Compute the fourth spread for each sample and compare. Do the fourth spreads convey the same message about variability that the standard deviations do? Explain.

c. Construct a comparative boxplot (as did the cited paper)and compare and contrast the four samples.

Short Answer

Expert verified

a.

The sample standard deviation for urban homes is 22.3.

The sample standard deviation for the farm is 6.088.

b.

The fourth spread of x is 33.

The fourth spread of y is 7.

c. A comparative boxplot is represented as,

Step by step solution

01

Given information

The data on endotoxin concentration in settled dust both for a sample of urban homes and for a sample of farm homes are provided.

02

Computing the sample standard deviation

a.

Let x represents the concentration (EU/mg) in settled dust for one sample of urban homes.

Let y represent the concentration (EU/mg) in settled dust for another farm homes.

Referring to part a of Exercise 34,

The sample mean for one sample of urban homes is 21.5.

The sample mean for another sample is 8.6.

The sample standard deviation is computed as,

For x,

\(\begin{array}{c}{s_x} &=& \sqrt {\frac{{\sum {{{\left( {{x_i} - \bar x} \right)}^2}} }}{{{n_x} - 1}}} \\ &=& \sqrt {\frac{{{{\left( {6 - 21.5} \right)}^2} + {{\left( {5 - 21.5} \right)}^2} + ... + {{\left( {23 - 21.5} \right)}^2}}}{{11 - 1}}} \\ &=& 22.3\end{array}\)

For y,

\(\begin{array}{c}{s_y} &=& \sqrt {\frac{{\sum {{{\left( {{y_i} - \bar y} \right)}^2}} }}{{{n_y} - 1}}} \\ &=& \sqrt {\frac{{{{\left( {4 - 6.088} \right)}^2} + {{\left( {14 - 6.088} \right)}^2} + ... + {{\left( {0.3 - 6.088} \right)}^2}}}{{15 - 1}}} \\ &=& 6.088\end{array}\)

Therefore, the sample standard deviation for x and y are 22.3 and 6.088 respectively.

03

Computing the fourth spread

b.

The fourth spread is computed as,

\({f_s} = {\rm{Upper}}\,{\rm{fourth}} - {\rm{Lower}}\;{\rm{fourth}}\)

Where, the upper fourth is the median value of the largest half and the lower fourth is the median value of the smallest half.

The number of observations for x is 11 (odd).

The number of observations for y is 15 (odd).

Arranging the data in ascending order,

x

y

4

0.3

5

2

5

3

6

4

11

4

17

5

18

8

23

8.9

33

9

35

9

80

9.2


11


14


20


21

For x,

\(\begin{array}{l}{n_{smallest}} &=& 5\\{n_{{\rm{largest}}}} &=& 5\end{array}\)

The lower fourth is computed as,

\(\begin{array}{c}\frac{{{n_{smallest}} + 1}}{2} &=& \frac{{5 + 1}}{2}\\ &=& {3^{rd}}value\end{array}\)

The 3rd value of the smallest half is 5.

This implies that the lower fourth is 5.

The upper fourth is computed as,

\(\begin{array}{c}\frac{{{n_{{\rm{largest}}}} + 1}}{2} &=& \frac{{5 + 1}}{2}\\ &=& {3^{rd}}value\end{array}\)

The 3rd value of the largest half is 33.

This implies that the upper fourth is 33.

The fourth spread for the chamber cooler is computed as,

\(\begin{array}{c}{f_s} &=& {\rm{Upper}}\,{\rm{fourth}} - {\rm{Lower}}\;{\rm{fourth}}\\ &=& 33 - 5\\ &=& 28\end{array}\)

Thus, the fourth spread of x is 28.

For y,

\(\begin{array}{l}{n_{smallest}} &=& 7\\{n_{{\rm{largest}}}} &=& 7\end{array}\)

The lower fourth is computed as,

\(\begin{array}{c}\frac{{{n_{smallest}} + 1}}{2} &=& \frac{{7 + 1}}{2}\\ &=& {4^{th}}value\end{array}\)

The 4th value of the smallest half is 4.

This implies that the lower fourth is 4.

The upper fourth is computed as,

\(\begin{array}{c}\frac{{{n_{{\rm{largest}}}} + 1}}{2} &=& \frac{{7 + 1}}{2}\\ &=& {4^{th}}value\end{array}\)

The 4th value of the largest half is 11.

This implies that the upper fourth is 11.

The fourth spread for y is computed as,

\(\begin{array}{c}{f_s} &=& {\rm{Upper}}\,{\rm{fourth}} - {\rm{Lower}}\;{\rm{fourth}}\\ &=& 11 - 4\\ &=& 7\end{array}\)

Thus, the fourth spread of y is 7.

04

Construction a comparative boxplot and compare the four samples

c.

Let p represents the concentration (EU/mg) in dust bag dust for one sample of urban homes.

Let q represent the concentration (EU/mg) in dust bag dust for another farm homes.

The data arranged in ascending order can be represented as,

x

y

p

q

4

0.3

1

0.2

5

2

13

2

5

3

24

5

6

4

24

6

11

4

33

10

17

5

34

11

18

8

34

13

23

8.9

35

13

33

9

38

17

35

9

40

17

80

9.2

49

23


11

104

27


14


28


20


35


21


64

Referring to part b of Exercise 48, and part b of Exercise 34.

For x,

The lower fourth is 5.

The upper fourth is 33.

The smallest value is 4.

The largest value is 80.

The sample median is 17.

For y,

The lower fourth is 4.

The upper fourth is 11.

The smallest value is 0.3.

The largest value is 21.

The sample median is 8.9.

For p,

The smallest value is 1.

The largest value is 104.

For the even number of observations, the median value is computed as,

\(\begin{array}{c}\tilde p &=& average\;of\;{\left( {\frac{n}{2}} \right)^{th}}and\;{\left( {\frac{n}{2} + 1} \right)^{th}}\;ordered\;values\\ &=& average\;of\;{\left( {\frac{{12}}{2}} \right)^{th}}and\;{\left( {\frac{{12}}{2} + 1} \right)^{th}}\;ordered\;values\\ &=& average\;of\;{\left( 6 \right)^{th}}and\;{\left( 7 \right)^{th}}\;ordered\;values\\ &=& \frac{{34 + 34}}{2}\\ &=& 34\end{array}\)

Thus, the median value is 34.

For p,

\(\begin{array}{l}{n_{smallest}} &=& 3\\{n_{{\rm{largest}}}} &=& 3\end{array}\)

The lower fourth is 3rdvalue of the smallest half; that is, 24.

This implies that the lower fourth is 24.

The upper fourth is 3rd value of the largest half is 38.

This implies that the upper fourth is 38.

For q,

The smallest value is 0.2.

The largest value is 64.

For the odd number of observations, the median value is computed as,

\(\begin{array}{c}\tilde q &=& {\left( {\frac{{n + 1}}{2}} \right)^{th}}ordered\;value\\ &=& {\left( {\frac{{15 + 1}}{2}} \right)^{th}}ordered\;value\\ &=& {\left( 8 \right)^{th}}ordered\;value\end{array}\)

Thus, the median value of q is 13.

For q,

\(\begin{array}{l}{n_{smallest}} &=& 7\\{n_{{\rm{largest}}}} &=& 7\end{array}\)

The lower fourth is computed as,

\(\begin{array}{c}\frac{{{n_{smallest}} + 1}}{2} &=& \frac{{7 + 1}}{2}\\ &=& {4^{th}}value\end{array}\)

The 4th value of the smallest half is 6.

This implies that the lower fourth is 6.

The upper fourth is computed as,

\(\begin{array}{c}\frac{{{n_{{\rm{largest}}}} + 1}}{2} &=& \frac{{7 + 1}}{2}\\ &=& {4^{th}}value\end{array}\)

The 4th value of the largest half is 27.

This implies that the upper fourth is 27.

The steps to construct a boxplot are as follows,

1) Compute the values of the five-number summary (smallest value, lower fourth, median, upper fourth, largest value).

2) Construct a line segment from the smallest value to the largest value of the dataset.

3) Construct a rectangular box from the lower fourth to the upper fourth and draw a line in the box at the median value.

The boxplot is represented as,

Where blue boxplot represents x, pink represents y, dark blue represents p and green represents q data.

The features that can be observed are,

1) The outliers can be observed insettled dust for one sample of urban homes, in dust bag dust for one sample of urban homes, and in dust bag dust for another farm homes.

2) For x, the distribution is approximately negatively skewed.

For y, the distribution is positively skewed.

For p, the distribution is positively skewed.

For q, the distribution is approximately negatively skewed

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91影视!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Each customer making a particular Internet purchase must pay with one of three types of credit cards (think Visa, MasterCard, AmEx). Let \({{\rm{A}}_{\rm{i}}}{\rm{(i = 1,2,3)}}\) be the event that a type \({\rm{i}}\)credit card is used, with\({\rm{P}}\left( {{{\rm{A}}_{\rm{1}}}} \right){\rm{ = }}{\rm{.5}}\), \({\rm{P}}\left( {{{\rm{A}}_{\rm{2}}}} \right){\rm{ = }}{\rm{.3}}\), and\({\rm{P}}\left( {{{\rm{A}}_{\rm{3}}}} \right){\rm{ = }}{\rm{.2}}\). Suppose that the number of customers who make such a purchase on a given day is a Poisson \({\rm{rv}}\) with parameter\({\rm{\lambda }}\). Define \({\rm{rv}}\) 's \({{\rm{X}}_{{\rm{1,}}}}{{\rm{X}}_{\rm{2}}}{\rm{,}}{{\rm{X}}_{\rm{3}}}\)by \({{\rm{X}}_{\rm{i}}}{\rm{ = }}\)the number among the \({\rm{N}}\)customers who use a type \({\rm{i}}\)card\({\rm{(i = 1,2,3)}}\). Show that these three \({\rm{rv}}\) 's are independent with Poisson distributions having parameters\({\rm{.5\lambda ,}}{\rm{.3\lambda }}\), and\({\rm{.2\lambda }}\), respectively. (Hint: For non-negative integers\({{\rm{x}}_{\rm{1}}}{\rm{,}}{{\rm{x}}_{\rm{2}}}{\rm{,}}{{\rm{x}}_{\rm{3}}}\), let\({\rm{n = }}{{\rm{x}}_{\rm{1}}}{\rm{ + }}{{\rm{x}}_{\rm{2}}}{\rm{ + }}{{\rm{x}}_{\rm{3}}}\). Then\({\rm{P}}\left( {{{\rm{X}}_{\rm{1}}}{\rm{ = }}{{\rm{x}}_{\rm{1}}}} \right.\), \(\left. {{{\rm{X}}_{\rm{2}}}{\rm{ = }}{{\rm{x}}_{\rm{2}}}{\rm{,}}{{\rm{X}}_{\rm{3}}}{\rm{ = }}{{\rm{x}}_{\rm{3}}}} \right){\rm{ = P}}\left( {{{\rm{X}}_{\rm{1}}}{\rm{ = }}{{\rm{x}}_{\rm{1}}}{\rm{,}}{{\rm{X}}_{\rm{2}}}{\rm{ = }}{{\rm{x}}_{\rm{2}}}{\rm{,}}{{\rm{X}}_{\rm{3}}}{\rm{ = }}{{\rm{x}}_{\rm{3}}}{\rm{,N = n}}} \right)\) (why is this?). Now condition on\({\rm{N = n}}\), in which case the three \({\rm{Xi}}\) 's have a trinomial distribution (multinomial with three categories) with category probabilities\({\rm{.5,}}{\rm{.3}}\), and . \({\rm{2}}\).)

A sample of n=10 automobiles was selected, and eachwas subjected to a 5-mph crash test. Denoting a car withno visible damage by S (for success) and a car with suchdamage by F, results were as follows:

S S F S SS F F S S

  1. What is the value of the sample proportion of successes x/n?
  2. Replace each S with a 1 and each F with a 0. Then calculate for this numerically coded sample. How does compare to x/n?
  3. Suppose it is decided to include 15 more cars in the experiment. How many of these would have to be S鈥檚 to give x/n = .80 for the entire sample of 25 cars?

Zinfandel is a popular red wine varietal produced almostexclusively in California. It is rather controversialamong wine connoisseurs because its alcohol contentvaries quite substantially from one producer to another.In May 2013, the author went to the website klwines.com, randomly selected 10 zinfandels from among the 325 available, and obtained the following values of alcoholcontent (%):

14.8 14.5 16.1 14.2 15.9

13.7 16.2 14.6 13.8 15.0

  1. Calculate and interpret several measures of center.
  2. Calculate the sample variance using the defining formula.
  3. Calculate the sample variance using the short cut formula after subtracting 13 from each observation

Grip is applied to produce normal surface forces that compress the object being gripped. Examples include two people shaking hands, or a nurse squeezing a patient鈥檚 forearm to stop bleeding. The article 鈥淚nvestigation of Grip Force, Normal Force, Contact Area, Hand Size, and Handle Size for Cylindrical Handles鈥 (Human Factors, 2008: 734鈥744) included the following data on grip strength (N) for a sample of 42

individuals:

16 18 18 26 33 41 54 56 66 68 87 91 95

98 106 109 111 118 127 127 135 145 147 149 151 168

172 183 189 190 200 210 220 229 230 233 238 244 259

294 329 403

a. Construct a stem-and-leaf display based on repeating each stem value twice, and comment on interesting features.

b. Determine the values of the fourths and the fourth spread.

c. Construct a boxplot based on the five-number summary, and comment on its features.

d. How large or small does an observation have to be to qualify as an outlier? An extreme outlier? Are there any outliers?

e. By how much could the observation 403, currently the largest, be decreased without affecting\({f_s}\)?

Lengths of bus routes for any particular transit system will typically vary from one route to another. The article 鈥淧lanning of City Bus Routes鈥 (J. of the Institution of Engineers, 1995: 211鈥215) gives the following information on lengths (km) for one particular system:

Length6-<8 8-<10 10-<12 12-<14 14-<16

Frequency6 23 30 35 32

Length16-<18 18-<20 20-<22 22-<24 24-<26

Frequency48 42 40 28 27

Length26-<28 28-<30 30-<35 35-<40 40-<45

Frequency26 14 27 11 2

a. Draw a histogram corresponding to these frequencies.

b. What proportion of these route lengths are less than 20? What proportion of these routes have lengths of at least 30?

c. Roughly what is the value of the 90th percentile of the route length distribution?

d. Roughly what is the median route length?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.