/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 59 Suppose that the distribution of... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Suppose that the distribution of weekly water usage for single-family homes in a particular city is mound shaped and approximately symmetric. The mean is 1400 gallons, and the standard deviation is 300 gallons. a. What is the approximate value of the 16 th percentile? b. What is the approximate value of the median? c. What is the approximate value of the 84 th percentile?

Short Answer

Expert verified
a) The approximate value of the 16th percentile is 1100 gallons. b) The approximate value of the median is 1400 gallons. c) The approximate value of the 84th percentile is 1700 gallons.

Step by step solution

01

Pick the closest percentage from the empirical rule

As 16 is closest to 15.85 (which is found by subtracting 34 from 50), we know that the 16th percentile is somewhere between the mean minus one standard deviation (\(mean - \sigma\)) and the mean.
02

Estimate the 16th percentile

Since the 16th percentile is close to the mean minus one standard deviation (\(mean - \sigma\)), we calculate the range: \(1400 - 300 = 1100\) The approximate value of the 16th percentile is 1100 gallons. b) Finding the median:
03

Use symmetric property

Since the given distribution is approximately symmetric, the median and the mean are the same. The approximate value of the median in this case is 1400 gallons. c) Finding the 84th percentile: To find the 84th percentile, we again use the closest percentage from the empirical rule to estimate the value.
04

Pick the closest percentage from the empirical rule

As 84 is closest to 84.15 (which is found by adding 34 to 50), we know that the 84th percentile is somewhere between the mean and the mean plus one standard deviation (\(mean + \sigma\)).
05

Estimate the 84th percentile

Since the 84th percentile is close to the mean plus one standard deviation (\(mean + \sigma\)), we calculate the range: \(1400 + 300 = 1700\) The approximate value of the 84th percentile is 1700 gallons.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Percentile Calculation
Percentile calculation is an essential concept to understand when dealing with data distributions. It helps determine where a particular score or value lies in relation to the entire dataset. In simple terms, the percentile of a value is the percentage of data points in a dataset that are below that particular value. To calculate percentiles, especially in a symmetrical distribution, it is helpful to understand the empirical rule (also known as the 68-95-99.7 rule). This rule provides a quick way to estimate where certain percentiles lie based on the properties of the normal distribution. In the problem, for instance, the 16th percentile can be approximated using the mean minus one standard deviation:
  • The mean is 1400 gallons.
  • The standard deviation is 300 gallons.
  • Using the empirical rule, the 16th percentile is approximately one standard deviation below the mean.
Calculating this, we find the 16th percentile is approximately at 1100 gallons, making it lower compared to most data points.
Symmetric Distribution
A symmetric distribution is one where data is evenly distributed on both sides of the mean. Think of it like a mirror image; both halves of the distribution are identical. This characteristic is crucial because it greatly influences how we view concepts like the median and percentiles. In a symmetric distribution, like the one described in the problem, the mean and the median are equal. This is because exactly half of the data lies on each side of the mean. So if you were to divide the data down the middle, you would find the median right at the mean, which is 1400 gallons in this scenario. This symmetry property simplifies calculations significantly, as evidenced by our ability to deduce the value of the median without further computation, once we know the mean. Understanding symmetric distributions can also help in visualizing the spread of data and how it clusters around the central point.
Standard Deviation
Standard deviation is a measure of how spread out the data points in a distribution are. It tells us the typical distance of each data point from the mean. A smaller standard deviation indicates that the data points are close to the mean, whereas a larger standard deviation signifies that they are spread out over a wider range.In the given problem, the standard deviation is 300 gallons. This mark gives us a sense of the variability in water usage among different homes. When we utilize the empirical rule for normal distributions:
  • About 68% of the data falls within one standard deviation (\(mean \pm \sigma\))
  • 95% falls within two standard deviations
  • 99.7% lies within three standard deviations
Analyzing the standard deviation alongside the mean enables us to make assessments about percentiles and tendencies within a dataset. For example, the 84th percentile can be quickly approximated by adding one standard deviation to the mean, resulting in 1700 gallons in this problem, indicating a relatively high water usage compared to other homes.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Increasing joint extension is one goal of athletic trainers. In a study to investigate the effect of therapy that uses ultrasound and stretching (Trae Tashiro, Masters Thesis, University of Virginia, 2004 ), passive knee extension was measured after treatment. Passive knee extension (in degrees) is given for each of 10 study participants. $$ \begin{array}{llllllllll} 59 & 46 & 64 & 49 & 56 & 70 & 45 & 52 & 63 & 52 \end{array} $$ Which would you choose to describe center and variability the mean and standard deviation or the median and interquartile range? Justify your choice.

The accompanying data are a subset of data read from a graph in the paper "Ladies First? A Field Study of Discrimination in Coffee Shops" (Applied Economics [April, 2008] ). The data are the waiting time (in seconds) between ordering and receiving coffee for 19 male customers at a Boston coffee shop. $$ \begin{array}{rrrrrrrrrr} 40 & 60 & 70 & 80 & 85 & 90 & 100 & 100 & 110 & 120 \\ 125 & 125 & 140 & 140 & 160 & 160 & 170 & 180 & 200 & \end{array} $$ Use these data to construct a boxplot. Write a few sentences describing the important characteristics of the boxplot.

Data on tipping percent for 20 restaurant tables, consistent with summary statistics given in the paper "Beauty and the Labor Market: Evidence from Restaurant Servers" (unpublished manuscript by Matt Parrett, 2007 ), are: $$ \begin{array}{rrrrrrrr} 0.0 & 5.0 & 45.0 & 32.8 & 13.9 & 10.4 & 55.2 & 50.0 \\ 10.0 & 14.6 & 38.4 & 23.0 & 27.9 & 27.9 & 105.0 & 19.0 \\ 10.0 & 32.1 & 11.1 & 15.0 & & & & \end{array} $$

The report titled "State of the News Media \(2013^{\text {" }}\) (Pew Research Center, May 7,2013 ) included the weekday circulation numbers for the top 20 newspapers in the country. Here are the data for the 6 months ending September 2012: $$ \begin{array}{rrrrr} 2,293798 & 1,713,833 & 1,613,866 & 641,369 & 535,875 \\ 529,999 & 522,868 & 462,228 & 432,455 & 412,669 \\ 411,960 & 410,130 & 392,989 & 325,814 & 313,003 \\ 311,504 & 300,277 & 296,427 & 293,139 & 285,088 \end{array} $$ a. Calculate and interpret the value of the median of this data set. b. Explain why the median is preferable to the mean for describing center for this data set. c. Explain why it would be unreasonable to generalize from this sample of 20 newspapers to the population of all daily newspapers in the United States.

Acrylamide, a possible cancer-causing substance, forms in high-carbohydrate foods cooked at high temperatures. Acrylamide levels can vary widely even within the same type of food. An article appearing in the journal Food Chemistry (March 2014, 204-211) included the following acrylamide content (in nanograms/gram) for five brands of bisquits: $$ \begin{array}{lllll} 345 & 292 & 334 & 276 & 248 \end{array} $$ a. Calculate the mean acrylamide level. For each data value, calculate the deviation from the mean. b. Verify that, except for the effect of rounding, the sum of the five deviations from the mean is equal to 0 for this data set. (If you rounded the sample mean or the deviations, your sum may not be exactly zero, but it should still be close to zero.) c. Use the deviations from Part (a) to calculate the variance and standard deviation for this data set.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.