/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q84SE Consider a sample \({x_1},{x_2},... [FREE SOLUTION] | 91影视

91影视

Consider a sample \({x_1},{x_2},...,{x_n}\) with neven. Let \({\bar x_L}\) and \({\bar x_U}\) denote the average of the smallest n/2 and the largest n/2 observations, respectively. Show that the mean absolute deviation from the median for this sample satisfies

\(\sum {\left| {{x_i} - \tilde x} \right|} /n = \left( {{{\bar x}_U} - {{\bar x}_L}} \right)/2\)

Then show that if n is odd and the two averages are calculated after excluding the median from each half, replacing non the left with n-1 gives the correct result.(Hint: Break the sum into two parts, the first involving observations less than or equal to the median and the second involving observations greater than or equal to the median.)

Short Answer

Expert verified

\(\frac{{\sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} }}{n} = \frac{{\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2}\)

Step by step solution

01

Given information

A sample of size n is \({x_1},{x_2},...,{x_n}\)with n even.

\({\bar x_L}\) and \({\bar x_U}\) denote the average of the smallest n/2 and the largest n/2 observations, respectively.

02

Derivation of the proof

Let\(n = 2d\left( {even} \right)\).

The ascending of n observations is,

\({X_1},{X_2},...,{X_d},{X_{d + 1}},...,{X_{2d}}\).

The sample mean is given as,

\(\begin{aligned}{{\bar X}_L} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_i}} \\{{\bar X}_U} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_{d + i}}} \end{aligned}\)

The sample median is given as,

\(\tilde X = {X_{d + 1}}\)

Further calculations are computed as,

\(\begin{aligned}\frac{{n\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} &= d\left( {{{\bar X}_U} - {{\bar X}_L}} \right)\\ &= \sum\limits_{i = 1}^d {{X_{d + i}}} - \sum\limits_{i = 1}^d {{X_i}} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + i}} - {X_i}} \right)} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + i}} - \tilde X + \tilde X - {X_i}} \right)} \,\;\;\;\;\;\left( {Adding\;and\;substracting\;median} \right)\\ &= \sum\limits_{i = 1}^d {\left| {{X_{d + i}} - \tilde X} \right|} + \sum\limits_{i = 1}^d {\left| {\tilde X - {X_i}} \right|} \\ &= \sum\limits_{i = 1}^{2d} {\left| {{X_{d + i}} - \tilde X} \right|} + \sum\limits_{i = 1}^d {\left| {{X_i} - \tilde X} \right|} \\ &= \sum\limits_{i = 1}^{2d} {\left| {{X_i} - \tilde X} \right|} \end{aligned}\)

Therefore,

\(\begin{aligned}\frac{{n\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} = \sum\limits_{i = 1}^{2d} {\left| {{X_i} - \tilde X} \right|} \\ = \sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} \end{aligned}\)

Thus,

\(\frac{{\sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} }}{n} = \frac{{\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2}\)

For odd number of observations, \(n = 2d + 1\).

The ascending of n observations is,

\({X_1},{X_2},...,{X_d},{X_{d + 1}},...,{X_{2d + 1}}\).

The sample mean is given as,

\(\begin{aligned}{{\bar X}_L} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_i}} \\{{\bar X}_U} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_{d + 1 + i}}} \end{aligned}\)

The sample median is given as,

\(\tilde X = {X_{d + 1}}\)

The calculations are as follows,

\(\begin{aligned}\frac{{\left( {n - 1} \right) * \left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} &= d\left( {{{\bar X}_U} - {{\bar X}_L}} \right)\\ &= \sum\limits_{i = 1}^d {{X_{d + 1 + i}}} - \sum\limits_{i = 1}^d {{X_i}} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + 1 + i}} - {X_i}} \right)} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + 1 + i}} - \tilde X + \tilde X - {X_i}} \right)} \\ &= \sum\limits_{i = 1}^d {\left| {{X_{d + 1 + i}} - \tilde X} \right| + \sum\limits_{I = 1}^d {\left| {\tilde X - {X_i}} \right|} } \\ &= \sum\limits_{i = d + 2}^{2d + 1} {\left| {{X_i} - \tilde X} \right| + \sum\limits_{I = 1}^d {\left| {{X_i} - \tilde X} \right|} } \\ &= \sum\limits_{i = 1}^d {\left| {{X_i} - \tilde X} \right| + \left| {\tilde X - \tilde X} \right|} + \sum\limits_{i = d + 2}^{2d + 1} {\left| {{X_i} - \tilde X} \right|} \\ &= \sum\limits_{i = 1}^{2d + 1} {\left| {{X_i} - \tilde X} \right|} \end{aligned}\)

Therefore,

\(\begin{aligned}\frac{{\left( {n - 1} \right) * \left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} &= \sum\limits_{i = 1}^{2d + 1} {\left| {{X_i} - \tilde X} \right|} \\ &= \sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} \\ &= \frac{{\sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} }}{{n - 1}}\\ &= \frac{{{{\bar X}_U} - {{\bar X}_L}}}{2}\end{aligned}\)

Hence Proved.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91影视!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The article 鈥淩eliability-Based Service-Life Assessment of Aging Concrete Structures鈥 (J. Structural Engr.,\({\rm{1993: 1600 - 1621}}\)) suggests that a Poisson process can be used to represent the occurrence of structural loads over time. Suppose the mean time between occurrences of loads is\({\rm{.5}}\)year. a. How many loads can be expected to occur during a\({\rm{2}}\)-year period? b. What is the probability that more than five loads occur during a\({\rm{2}}\)-year period? c. How long must a time period be so that the probability of no loads occurring during that period is at most\({\rm{.1}}\)?

The article cited in Exercise 20 also gave the following values of the variables y=number of culs-de-sac and z=number of intersections:

y

1

0

1

0

0

2

0

1

1

1

2

1

0

0

1

1

0

1

1

z

1

8

6

1

1

5

3

0

0

4

4

0

0

1

2

1

4

0

4

y

1

1

0

0

0

1

1

2

0

1

2

2

1

1

0

2

1

1

0

z

0

3

0

1

1

0

1

3

2

4

6

6

0

1

1

8

3

3

5

y

1

5

0

3

0

1

1

0

0

z

0

5

2

3

1

0

0

0

3

a. Construct a histogram for the ydata. What proportion of these subdivisions had no culs-de-sac? At least one cul-de-sac?

A Pareto diagram is a variation of a histogram forcategorical data resulting from a quality control study.Each category represents a different type of product non-conformity or production problem. The categories areordered so that the one with the largest frequencyappears on the far left, then the category with the secondlargest frequency, and so on. Suppose the following information on nonconformities in circuit packs isobtained: failed component, 126; incorrect component,210; insufficient solder, 67; excess solder, 54; missingcomponent, 131. Construct a Pareto diagram.

The three measures of center introduced in this chapter are the mean, median, and trimmed mean. Two additional measures of center that are occasionally used are the midrange,which is the average of the smallest and largest observations, and the midfourth,which is the average of the two fourths. Which of these five measures of center are resistant to the effects of outliers and which are not? Explain your reasoning.

The accompanying data set consists of observations,on shower-flow rate (L/min) for a sample of n=129,houses in Perth, Australia (鈥淎n Application of Bayes,Methodology to the Analysis of Diary Records in a

Water Use Study,鈥 J. Amer. Stat. Assoc., 1987: 705鈥711):

4.6 12.3 7.1 7.0 4.0 9.2 6.7 6.9 11.5 5.1

11.2 10.5 14.3 8.0 8.8 6.4 5.1 5.6 9.6 7.5

7.5 6.2 5.8 2.3 3.4 10.4 9.8 6.6 3.7 6.4

8.3 6.5 7.6 9.3 9.2 7.3 5.0 6.3 13.8 6.2

5.4 4.8 7.5 6.0 6.9 10.8 7.5 6.6 5.0 3.3

7.6 3.9 11.9 2.2 15.0 7.2 6.1 15.3 18.9 7.2

5.4 5.5 4.3 9.0 12.7 11.3 7.4 5.0 3.5 8.2

8.4 7.3 10.3 11.9 6.0 5.6 9.5 9.3 10.4 9.7

5.1 6.7 10.2 6.2 8.4 7.0 4.8 5.6 10.5 14.6

10.8 15.5 7.5 6.4 3.4 5.5 6.6 5.9 15.0 9.6

7.8 7.0 6.9 4.1 3.6 11.9 3.7 5.7 6.8 11.3

9.3 9.6 10.4 9.3 6.9 9.8 9.1 10.6 4.5 6.2

8.3 3.2 4.9 5.0 6.0 8.2 6.3 3.8 6.0

  1. Construct a stem-and-leaf display of the data.
  2. What is a typical, or representative, flow rate?
  3. Does the display appear to be highly concentrated or spread out?
  4. Does the distribution of values appear to be reasonably symmetric? If not, how would you describe the departure from symmetry?
  5. Would you describe any observation as being far from the rest of the data (an outlier)?
See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.