/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 31 In Superbowl XXXVII, Michael Pit... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

In Superbowl XXXVII, Michael Pittman of Tampa Bay rushed (ran with the football) 17 times on first down, and the results were the following gains in yards: \(\begin{array}{lllllllll}23 & 1 & 4 & 1 & 6 & 5 & 9 & 6 & 2 \\ -1 & 3 & 2 & 0 & 2 & 24 & 1 & 1 & \end{array}\) a. Determine the value of the sample mean. b. Determine the value of the sample median. Why is it so different from the mean? c. Calculate a trimmed mean by deleting the smallest and largest observations. What is the corresponding trimming percentage? How does the value of this \(\bar{x}_{r r}\) compare to the mean and median?

Short Answer

Expert verified
a. Mean: 5.18 b. Median: 2 (Mean is higher due to extreme values) c. Trimmed Mean: 4.4; 11.76% trimming

Step by step solution

01

Calculate the Sample Mean

To find the sample mean, sum all the data points and then divide by the number of data points. The data points are:\(23, 1, 4, 1, 6, 5, 9, 6, 2, -1, 3, 2, 0, 2, 24, 1, 1\).Firstly, sum these values: \(23 + 1 + 4 + 1 + 6 + 5 + 9 + 6 + 2 - 1 + 3 + 2 + 0 + 2 + 24 + 1 + 1 = 88\).Next, divide the total by the number of observations (17):\(\text{Mean} = \frac{88}{17} \approx 5.18\).
02

Find the Sample Median

To find the median, first list the numbers in ascending order:\(-1, 0, 1, 1, 1, 1, 2, 2, 2, 3, 4, 5, 6, 6, 9, 23, 24\).With 17 observations, the median is the 9th value:Thus, the Median is 2.
03

Discuss Difference Between Mean and Median

The mean (\(5.18\)) is larger than the median (2) because the mean is affected by extreme values, such as 24 and 23, whereas the median is not. The sizeable values raise the mean, while the median remains centrally located among sorted values.
04

Calculate Trimmed Mean

To find the trimmed mean, remove the smallest and largest data points: -1 and 24.Remaining observations: \(0, 1, 1, 1, 1, 2, 2, 2, 3, 4, 5, 6, 6, 9, 23\).Sum these:\(0 + 1 + 1 + 1 + 1 + 2 + 2 + 2 + 3 + 4 + 5 + 6 + 6 + 9 + 23 = 66\).Total observations now: 15 (since 2 are removed).Trimmed mean: \(\text{Trimmed Mean} = \frac{66}{15} = 4.4\).The trimming percentage is \(\frac{2}{17} \times 100\% \approx 11.76\%\).The trimmed mean (4.4) is less than the mean and closer to the median due to reduced influence of extreme values.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Sample Mean
The sample mean, often referred to as the arithmetic mean, is a measure that sums up all the data values and divides by the count of values. This gives us the average value in the data set. In this exercise, Michael Pittman's 17 rushing attempts resulted in yard gains that we summed (88 yards) and divided by the number of attempts (17). Thus, the sample mean is calculated as: \[\text{Sample Mean} = \frac{88}{17} \approx 5.18\]
  • The mean provides a central value that represents the data.
  • It is sensitive to all data points, especially outliers.
The sample mean of 5.18 yards suggests that, on average, Pittman gained approximately 5.18 yards per rush on first down during the game.
Sample Median
The sample median is the middle value of a data set arranged in ascending order. It divides the data into two equal halves, offering a measure of central tendency that is less affected by outliers. In Pittman’s data, when arranged from lowest to highest value, the 9th value in the sequence is 2. Thus, the median is:
  • Median: 2
  • It is less influenced by extreme values compared to the mean.
  • Represents a central point that is not skewed by anomalies.
This difference with the sample mean demonstrates how one outstanding or low value can shift the mean, while the median stays firmly representative of the majority of the data.
Trimmed Mean
The trimmed mean is a method to calculate an average that reduces the influence of extreme values or outliers by excluding a fixed percentage of the smallest and largest data points. By removing these outliers, we can achieve a more robust mean that reflects the central tendency without the skewing effect of extremes.In Pittman's case:
  • Remove the smallest (-1) and largest (24) values.
  • The remaining values were summed to get 66, with 15 observations left.
  • Trimmed Mean: \( \frac{66}{15} = 4.4 \)
This 11.76% trimming yields a mean (\(4.4\)) that lies closer to the median and less than the untrimmed sample mean, highlighting its immunity to outliers.
Outliers Impact
Outliers are data points significantly distant from others in the dataset. They can skew certain measures of central tendency like the mean but have negligible effect on the median. Examining Pittman's rushing yards, gains like -1 and 24 clearly stand out as outliers.
  • Outliers inflate or deflate the sample mean, making it an unreliable measure for central tendency if present.
  • The presence of 23 and 24 increased the calculated mean from potential values around the median.
  • Modern statistical analysis often involves methods like trimming or using the median to manage outlier impacts.
Understanding outliers is crucial in descriptive statistics to ensure comprehensive data analysis and interpretation.
Descriptive Statistics
Descriptive statistics aim to summarize or describe characteristics of a dataset. They offer insights into the dataset's main features through numerical or graphical representations. Key tools include:
  • Mean: Provides a measure of central tendency, affected by outliers.
  • Median: Centre point of the dataset, resilient against outliers.
  • Range: Difference between the maximum and minimum values, highlighting data spread.
  • Variance and Standard Deviation: Measures of data dispersion or variability.
Using descriptive statistics, such as sample mean and median, one can better understand patterns and tendencies in datasets like Pittman's rushing attempts, assisting in identifying strategies or performance evaluations.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The accompanying data set consists of observations on shear strength (lb) of ultrasonic spot welds made on a type of alclad sheet. Construct a relative frequency histogram based on ten equalwidth classes with boundaries \(4000,4200, \ldots .\) [The histogram will agree with the one in "Comparison of Properties of Joints Prepared by Ultrasonic Welding and Other Means" (J. Aircraft, 1983: 552-556).] Comment on its features. \(\begin{array}{lllllll}5434 & 4948 & 4521 & 4570 & 4990 & 5702 & 5241 \\ 5112 & 5015 & 4659 & 4806 & 4637 & 5670 & 4381 \\ 4820 & 5043 & 4886 & 4599 & 5288 & 5299 & 4848 \\ 5378 & 5260 & 5055 & 5828 & 5218 & 4859 & 4780 \\ 5027 & 5008 & 4609 & 4772 & 5133 & 5095 & 4618 \\ 4848 & 5089 & 5518 & 5333 & 5164 & 5342 & 5069 \\ 4755 & 4925 & 5001 & 4803 & 4951 & 5679 & 5256 \\ 5207 & 5621 & 4918 & 5138 & 4786 & 4500 & 5461 \\ 5049 & 4974 & 4592 & 4173 & 5296 & 4965 & 5170 \\ 4740 & 5173 & 4568 & 5653 & 5078 & 4900 & 4968 \\ 5248 & 5245 & 4723 & 5275 & 5419 & 5205 & 4452 \\ 5227 & 5555 & 5388 & 5498 & 4681 & 5076 & 4774 \\ 4931 & 4493 & 5309 & 5582 & 4308 & 4823 & 4417 \\ 5364 & 5640 & 5069 & 5188 & 5764 & 5273 & 5042 \\ 5189 & 4986 & & & & & \end{array}\)

The accompanying data set consists of observations on shower-flow rate (L/min) for a sample of \(n=129\) houses in Perth, Australia ("An Application of Bayes Methodology to the Analysis of Diary Records in a Water Use Study," J. Amer. Statist. Assoc., 1987: 705-711): \(\begin{array}{rrrrrrrrrr}4.6 & 12.3 & 7.1 & 7.0 & 4.0 & 9.2 & 6.7 & 6.9 & 11.5 & 5.1 \\ 11.2 & 10.5 & 14.3 & 8.0 & 8.8 & 6.4 & 5.1 & 5.6 & 9.6 & 7.5 \\\ 7.5 & 6.2 & 5.8 & 2.3 & 3.4 & 10.4 & 9.8 & 6.6 & 3.7 & 6.4 \\ 8.3 & 6.5 & 7.6 & 9.3 & 9.2 & 7.3 & 5.0 & 6.3 & 13.8 & 6.2 \\ 5.4 & 4.8 & 7.5 & 6.0 & 6.9 & 10.8 & 7.5 & 6.6 & 5.0 & 3.3 \\ 7.6 & 3.9 & 11.9 & 2.2 & 15.0 & 7.2 & 6.1 & 15.3 & 18.9 & 7.2 \\ 5.4 & 5.5 & 4.3 & 9.0 & 12.7 & 11.3 & 7.4 & 5.0 & 3.5 & 8.2 \\ 8.4 & 7.3 & 10.3 & 11.9 & 6.0 & 5.6 & 9.5 & 9.3 & 10.4 & 9.7 \\ 5.1 & 6.7 & 10.2 & 6.2 & 8.4 & 7.0 & 4.8 & 5.6 & 10.5 & 14.6 \\ 10.8 & 15.5 & 7.5 & 6.4 & 3.4 & 5.5 & 6.6 & 5.9 & 15.0 & 9.6 \\ 7.8 & 7.0 & 6.9 & 4.1 & 3.6 & 11.9 & 3.7 & 5.7 & 6.8 & 11.3 \\ 9.3 & 9.6 & 10.4 & 9.3 & 6.9 & 9.8 & 9.1 & 10.6 & 4.5 & 6.2 \\ 8.3 & 3.2 & 4.9 & 5.0 & 6.0 & 8.2 & 6.3 & 3.8 & 6.0 & \end{array}\) a. Construct a stem-and-leaf display of the data. b. What is a typical, or representative, flow rate? c. Does the display appear to be highly concentrated or spread out? d. Does the distribution of values appear to be reasonably symmetric? If not, how would you describe the departure from symmetry? e. Would you describe any observation as being far from the rest of the data (an outlier)?

Lengths of bus routes for any particular transit system will typically vary from one route to another. The article "Planning of City Bus Routes" (J. Institut. Engrs., 1995: 211-215) gives the following information on lengths \((\mathrm{km})\) for one particular system: \(\begin{array}{lccccc}\text { Length } & 6-8 & 8-10 & 10-12 & 12-14 & 14-16 \\\ \text { Freq. } & 6 & 23 & 30 & 35 & 32 \\ \text { Length } & 16-18 & 18-20 & 20-22 & 22-24 & 24-26 \\ \text { Freq. } & 48 & 42 & 40 & 28 & 27 \\\ \text { Length } & 26-28 & 28-30 & 30-35 & 35-40 & 40-45 \\ \text { Freq. } & 26 & 14 & 27 & 11 & 2\end{array}\) a. Draw a histogram corresponding to these frequencies. b. What proportion of these route lengths are less than \(20 ?\) What proportion of these routes have lengths of at least 30 ? c. Roughly what is the value of the 90 th percentile of the route length distribution? d. Roughly what is the median route length?

Consider a sample \(x_{1}, x_{2}, \ldots, x_{n}\) and suppose that the values of \(\vec{x}, s^{2}\), and \(s\) have been calculated. a. Let \(y_{i}=x_{i}-\bar{x}\) for \(i=1, \ldots, n\). How do the values of \(s^{2}\) and \(s\) for the \(y_{i}\) 's compare to the corresponding values for the \(x_{i}\) 's? Explain. b. Let \(z_{i}=\left(x_{i}-\bar{x}\right) / s\) for \(i=1, \ldots, n\). What are the values of the sample variance and sample standard deviation for the \(z_{i}\) 's?

The amount of flow through a solenoid valve in an automobile's pollution- control system is an important characteristic. An experiment was carried out to study how flow rate depended on three factors: armature length, spring load, and bobbin depth. Two different levels (low and high) of each factor were chosen, and a single observation on flow was made for each combination of levels. a. The resulting data set consisted of how many observations? b. Does this study involve sampling an existing population or a conceptual population?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.