/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 6 The accompanying data on number ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

The accompanying data on number of minutes used for cell phone calls in one month was generated to be consistent with summary statistics published in a report of a marketing study of San Diego residents (TeleTruth, March 2009): $$ \begin{array}{rrrrrrrrrr} 189 & 0 & 189 & 177 & 106 & 201 & 0 & 212 & 0 & 306 \\ 0 & 0 & 59 & 224 & 0 & 189 & 142 & 83 & 71 & 165 \\ 236 & 0 & 142 & 236 & 130 & & & & & \end{array} $$ a. Would you recommend the mean or the median as a measure of center for this data set? Give a brief explanation of your choice. (Hint: It may help to look at a graphical display of the data.) b. Compute a trimmed mean by deleting the three smallest observations and the three largest observations in the data set and then averaging the remaining 19 observations. What is the trimming percentage for this trimmed mean? c. What trimming percentage would you need to use in order to delete all of the 0 minute values from the data set? Would you recommend a trimmed mean with this trimming percentage? Explain why or why not.

Short Answer

Expert verified
The median would be a more appropriate measure of center for this data set due to presence of outliers. The trimmed mean calculated by removing smallest and largest three observations from the dataset is found by taking average of remaining 19 values. The trimming percentage for this calculation would be 24%. On the other hand, if a trimmed mean that excludes all zero values is desired, the trimming percentage will be 28%. The decision to recommend such trimming percentage will depend on the balance of excluding extremes and retaining sufficient data.

Step by step solution

01

Determine Measure of Center

To determine which measure of center is more appropriate, i.e., the mean or median, one must examine the distribution of the data. However, from the data provided, it can be observed that there are a significant number of outliers (0 values) which would skew the mean towards zero. Therefore, the median would be a more accurate measure of center for this data set.
02

Compute a Trimmed Mean

Trimmed mean can be calculated by deleting the three smallest observations and the three largest observations in the data set and then averaging the remaining 19 observations. When sorted in increasing order, the smallest observations are {0, 0, 0} and largest are {236, 236, 306}. Deleting these, the remaining data set becomes: \[ \{59, 71, 83, 106, 130, 142, 142, 165, 177, 189, 189, 189, 201, 212, 224\} \] The trimmed mean then becomes the average of these remaining values which can be calculated as the sum of these values divided by number of these values.
03

Calculate trimming percentage

The trimming percentage represents the proportion of observations removed from each end. For the trimmed mean calculated in step 2, we have removed 3 from each end out of total 25 observations. So, the trimming percentage would be \((3+3)/25*100 = 24\%\).
04

Trimmed mean by excluding 0 values

To calculate a trimmed mean by excluding all of the 0-minute values from the data set, one would need to count the number of 0 values present and consider them for trimming. This leads to 7 zero values in the data set of 25, the trimming percentage becomes \((7/25) * 100 = 28\%\). However, the decision to recommend this trimming percentage will depend on the context and desired balance between excluding extremes (0 minute values in this case) and retaining enough data to have a meaningful result.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Measure of Center
In descriptive statistics, the measure of center is a value that is used to indicate the central point of a data set. It helps us understand what a typical data value looks like. The two most popular measures of center are the mean and the median.
The **mean** is calculated by adding up all the values and dividing by the number of values. It represents an average, but can be heavily influenced by extremely high or low numbers.
The **median**, on the other hand, is the middle value when the data is sorted from smallest to largest. It divides the data set into two equal halves and is less affected by outliers or skewed data.
In the presence of outliers or non-normal data distribution, the median can provide a more accurate depiction of the data's center than the mean.
Trimmed Mean
The trimmed mean is a method used to calculate the mean while minimizing the effect of outliers. This process involves removing a certain percentage of the lowest and highest values before calculating the mean. It is especially useful when dealing with skewed data or outliers that might distort the average.
To calculate the trimmed mean, follow these steps:
  • Sort the data from smallest to largest.
  • Decide on the number of observations to remove from each end.
  • Remove these observations and calculate the mean of the remaining data.
The trimming percentage is determined by the proportion of data points removed from the dataset. For example, if you remove three observations from each end of a 25-point data set, the trimming percentage is 24%.
Outliers
Outliers are data points that significantly differ from the majority of a data set. They can be much higher or lower than other entries and often result from variability in the data, measurement errors, or unusual occurrences.
Outliers can skew your data and give misleading interpretations when calculating measures like the mean. It’s essential to identify outliers so that you can decide whether they should be included in your analysis or possibly treated separately from the rest of the data.
The median or trimmed mean are often preferred measures of center in datasets with outliers because they are less sensitive to extreme values.
Data Distribution
Data distribution is the way in which data values are spread across a range. It provides an overall picture of the pattern formed by your data.
A few common types of distributions include:
  • **Normal Distribution**: Data that forms a symmetric, bell-shaped curve.
  • **Skewed Distribution**: Data is asymmetrical and can be skewed left or right, often due to outliers.
  • **Uniform Distribution**: Data is evenly spread across the range, with no visible peaks.
Recognizing the type of data distribution is crucial as it affects the choice of statistical methods. For instance, the mean works well with normal distributions, while the median or trimmed mean is often better for skewed distributions with outliers.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The article "Rethink Diversification to Raise Retums, Cut Risk" (San Luis Obispo Tribune, January 21, 2006 ) included the following paragraph: In their research, Mulvey and Reilly compared the results of two hypothetical portfolios and used actual data from 1994 to 2004 to see what returns they would achieve. The first portfolio invested in Treasury bonds, domestic stocks, international stocks, and cash. Its 10 -year average annual return was \(9.85 \%\) and its volatilitymeasured as the standard deviation of annual returns-was \(9.26 \%\). When Mulvey and Reilly shifted some assets in the portfolio to include funds that invest in real estate, commodities, and options, the 10 -year return rose to \(10.55 \%\) while the standard deviation fell to \(7.97 \%\). In short, the more diversified portfolio had a slightly better return and much less risk. Explain why the standard deviation is a reasonable measure of volatility and why it is reasonable to interpret a smaller standard deviation as meaning less risk.

Because some homes have selling prices that are much higher than most, the median price is usually used to describe a "typical" home price for a given location. The three accompanying quotes are all from the San Luis Obispo Tribune, but each gives a different interpretation of the median price of a home in San Luis Obispo County. Comment on each of these statements. (Look carefully. At least one of the statements is incorrect.) a. "So we have gone from \(23 \%\) to \(27 \%\) of county residents who can afford the median priced home at \(\$ 278,380\) in SLO County. That means that half of the homes in this county cost less than \(\$ 278,380\) and half cost more." (October 11,2001\()\) b. "The county's median price rose to \(\$ 285,170\) in the fourth quarter, a \(9.6 \%\) increase from the same period a year ago, the report said. (The median represents the midpoint of a range.)" (February 13,2002 ) c. "Your median is going to creep up above \(\$ 300,000\) if there is nothing available below \(\$ 300,000\),' Walker said." (February 26, 2002)

The accompanying data on annual maximum wind speed (in meters per second) in Hong Kong for each year in a 45 -year period were given in an article that appeared in the journal Renewable Energy (March 2007). Use the annual maximum wind speed data to construct a boxplot. Is the boxplot approximately symmetric? \(\begin{array}{lllllllll}30.3 & 39.0 & 33.9 & 38.6 & 44.6 & 31.4 & 26.7 & 51.9 & 31.9 \\ 27.2 & 52.9 & 45.8 & 63.3 & 36.0 & 64.0 & 31.4 & 42.2 & 41.1 \\\ 37.0 & 34.4 & 35.5 & 62.2 & 30.3 & 40.0 & 36.0 & 39.4 & 34.4 \\ 28.3 & 39.1 & 55.0 & 35.0 & 28.8 & 25.7 & 62.7 & 32.4 & 31.9 \\ 37.5 & 31.5 & 32.0 & 35.5 & 37.5 & 41.0 & 37.5 & 48.6 & 28.1\end{array}\)

Age at diagnosis for each of 20 patients under treatment for meningitis was given in the paper "Penidlin in the Treatment of Meningitis" (journal of the American Medical Association [1984]: \(1870-1874\) ). The ages (in years) were as follows: $$ \begin{array}{lllllllllll} 8 & 25 & 19 & 23 & 20 & 69 & 18 & 21 & 18 & 20 & 18 \\ 0 & 18 & 19 & 28 & 17 & 18 & 18 & & & & \end{array} $$ \(\begin{array}{ll}18 & 18 \\ 18 & 20\end{array}\) a. Calculate the values of the sample mean and the standard deviation. b. Calculate the \(10 \%\) trimmed mean. How does the value of the trimmed mean compare to that of the sample mean? Which would you recommend as a measure of center? Explain. c. Compute the upper quartile, the lower quartile, and the interquartile range. d. Are there any mild or extreme outliers present in this data set? e. Construct the boxplot for this data set.

The report "Who Moves? Who Stays Put? Where's Home?" (Pew Social and Demographic Trends, December 17,2008 ) gave the accompanying data for the 50 U.S. states on the percentage of the population that was born in the state and is still living there. The data values have been arranged in order from largest to smallest. \(\begin{array}{lllllllllll}75.8 & 71.4 & 69.6 & 69.0 & 68.6 & 67.5 & 66.7 & 66.3 & 66.1 & 66.0 & 66.0\end{array}\) \(\begin{array}{lllllllllll}65.1 & 64.4 & 64.3 & 63.8 & 63.7 & 62.8 & 62.6 & 61.9 & 61.9 & 61.5 & 61.1\end{array}\) \(\begin{array}{lllllllllll}59.2 & 59.0 & 58.7 & 57.3 & 57.1 & 55.6 & 55.6 & 55.5 & 55.3 & 54.9 & 54.7\end{array}\) \(\begin{array}{lllllllllll}54.5 & 54.0 & 54.0 & 53.9 & 53.5 & 52.8 & 52.5 & 50.2 & 50.2 & 48.9 & 48.7\end{array}\) \(\begin{array}{llllll}48.6 & 47.1 & 43.4 & 40.4 & 35.7 & 28.2\end{array}\) a. Find the values of the median, the lower quartile, and the upper quartile. b. The two smallest values in the data set are \(28.2\) (Alaska) and \(35.7\) (Wyoming). Are these two states outliers? c. Construct a boxplot for this data set and comment on the interesting features of the plot.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.