/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 3 Consider the following sample of... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Consider the following sample of observations on coating thickness for low- viscosity paint ("Achieving a Target Value for a Manufacturing Process: A Case Study," J. of Quality Technology, 1992: 22-26): $$ \begin{array}{rrrrrrrr} .83 & .88 & .88 & 1.04 & 1.09 & 1.12 & 1.29 & 1.31 \\ 1.48 & 1.49 & 1.59 & 1.62 & 1.65 & 1.71 & 1.76 & 1.83 \end{array} $$ Assume that the distribution of coating thickness is normal (a normal probability plot strongly supports this assumption). a. Calculate a point estimate of the mean value of coating thickness, and state which estimator you used. b. Calculate a point estimate of the median of the coating thickness distribution, and state which estimator you used. c. Calculate a point estimate of the value that separates the largest \(10 \%\) of all values in the thickness distribution from the remaining \(90 \%\), and state which estimator you used. [Hint: Express what you are trying to estimate in terms of \(\mu\) and \(\sigma\).] d. Estimate \(P(X<1.5)\), i,e, the proportion of all thickness values less than 1.5. [Hint: If you knew the values of \(\mu\) and \(\sigma\), you could calculate this probability. These values are not available, but they can be estimated.] e. What is the estimated standard error of the estimator that you used in part (b)?

Short Answer

Expert verified
a. Sample mean: 1.389. b. Median: 1.395. c. 90th percentile: 1.775. d. \( P(X<1.5) \approx 0.644 \). e. Standard error: 0.073.

Step by step solution

01

Calculate Sample Mean for Part (a)

To estimate the mean value of the coating thickness, use the sample mean formula, \( \bar{x} = \frac{1}{n} \sum_{i=1}^{n} x_i \), where \( n \) is the number of observations and \( x_i \) represents each observation of coating thickness. Here, \( n = 16 \) and the observations are: 0.83, 0.88, 0.88, 1.04, 1.09, 1.12, 1.29, 1.31, 1.48, 1.49, 1.59, 1.62, 1.65, 1.71, 1.76, 1.83. The mean is calculated as follows: \( \bar{x} = \frac{0.83 + 0.88 + 0.88 + 1.04 + 1.09 + 1.12 + 1.29 + 1.31 + 1.48 + 1.49 + 1.59 + 1.62 + 1.65 + 1.71 + 1.76 + 1.83}{16} = 1.389 \).
02

Median Calculation for Part (b)

To find the median, order the observations and find the middle value(s). Since there are 16 observations, the median is the average of the 8th and 9th values. First, reorder the values if necessary: 0.83, 0.88, 0.88, 1.04, 1.09, 1.12, 1.29, 1.31, 1.48, 1.49, 1.59, 1.62, 1.65, 1.71, 1.76, 1.83. The 8th and 9th values are 1.31 and 1.48, so the median is \( \text{Median} = \frac{1.31 + 1.48}{2} = 1.395 \).
03

Determine 90th Percentile for Part (c)

To find the 90th percentile, or the point where the largest 10% of values begin, express it in terms of the mean \( \mu \) and standard deviation \( \sigma \). Thus, calculate the z-score corresponding to the 90th percentile, \( z = 1.2816 \). The formula for a percentile in a normal distribution is \( x = \mu + z \sigma \). Use the sample mean (\( 1.389 \)) and estimate the sample standard deviation \( s \) with \( s = \sqrt{\frac{1}{n-1} \sum_{i=1}^{n}(x_i-\bar{x})^2} = 0.301 \). The 90th percentile: \( x = 1.389 + 1.2816 \times 0.301 = 1.775 \).
04

Estimate Probability for Part (d)

To estimate \( P(X<1.5) \), calculate the z-score for 1.5 using \( z = \frac{1.5 - \bar{x}}{s} = \frac{1.5 - 1.389}{0.301} \approx 0.369 \). Using a standard normal distribution table or calculator, find that \( P(Z < 0.369) \approx 0.644 \).
05

Standard Error for Part (e)

The standard error of the median for a normal distribution can be approximated using \( SE = \frac{\sigma}{\sqrt{n}} \sqrt{\frac{\pi}{2}} \approx \frac{0.301}{\sqrt{16}} \sqrt{\frac{\pi}{2}} \approx 0.073 \).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Point Estimation
Point estimation is a technique used to calculate a single value from sample data to serve as a best guess for an unknown population parameter. For example, in the given exercise, we perform point estimation to find both the mean and median of the coating thickness distribution. In part (a) of the exercise, the sample mean serves as a point estimate for the population mean. The sample mean is calculated using the formula:
  • \( \bar{x} = \frac{1}{n} \sum_{i=1}^{n} x_i \)
where \(n\) is the total number of observations, and \(x_i\) represents each observation. This method provides an efficient point estimate for the average coating thickness in this sample set.
In general, point estimation allows us to make informed guesses about population parameters using sample data, making it a crucial part of statistical analysis.
Standard Deviation
Standard deviation is a measure of the spread or dispersion of a set of data points. It shows us how much individual data points deviate from the mean of the dataset. In the exercise, the sample standard deviation \(s\) is calculated to understand the variability in coating thickness. The formula used for finding sample standard deviation is as follows:
  • \( s = \sqrt{\frac{1}{n-1} \sum_{i=1}^{n}(x_i-\bar{x})^2} \)
Here, \(x_i\) is each data point, \(\bar{x}\) is the sample mean, and \(n\) is the number of observations.
Standard deviation is crucial in determining how "spread out" the data are from the mean and assists in calculating other values, such as the z-scores and percentiles, which are explored in this exercise.
Z-score
Understanding z-scores is essential for interpreting where a particular observation stands within a distribution. A z-score measures how many standard deviations an element is from the mean. It helps us standardize scores from different datasets, making comparisons possible. In the exercise, z-scores are used to find percentiles and probabilities.
The formula for calculating a z-score is:
  • \( z = \frac{x - \mu}{\sigma} \)
where \(x\) is the value of interest, \(\mu\) is the mean of the distribution, and \(\sigma\) is the standard deviation. Given that \(\mu\) and \(\sigma\) are often unknown in practice, they can be estimated using the sample mean \(\bar{x}\) and sample standard deviation \(s\).
Z-scores enable us to work with the standard normal distribution to find probabilities and percentiles, as demonstrated in parts (c) and (d) of the exercise.
Percentile
A percentile indicates the relative standing of a value within a dataset. The nth percentile is the value below which n percent of the observations fall. In the exercise, the 90th percentile is calculated to separate the highest 10% of the coating thickness values.
We utilize the standard normal distribution and z-scores to find this percentile. The formula used is:
  • \( x = \mu + z \sigma \)
where \(z\) is the z-score corresponding to the desired percentile (in this case, the 90th percentile), \(\mu\) is the mean, and \(\sigma\) is the standard deviation.
Percentiles provide a way to understand the distribution of data points among a population, and are particularly useful in determining thresholds or cut-off values in a dataset.
Standard Error
Standard error measures the variability of a sample statistic, such as the mean, from the actual population parameter. It tells us how much the sample mean (or other statistic) is expected to vary from the true population mean. In the context of the exercise, the standard error of the median is approximated using the given data.
The formula to estimate the standard error of the median in a normal distribution is:
  • \( SE = \frac{\sigma}{\sqrt{n}} \sqrt{\frac{\pi}{2}} \)
where \(\sigma\) is the standard deviation, \(n\) is the sample size, and \(\pi\) is a constant that enhances the approximation for the median.
Understanding the standard error enables us to assess how reliable our sample estimations are, and it plays a significant role in inferential statistics and hypothesis testing.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Let \(X_{1}, X_{2}, \ldots, X_{n}\) be a random sample from a pdf \(f(x)\) that is symmetric about \(\mu\), so that \(\widetilde{X}\) is an unbiased estimator of \(\mu\). If \(n\) is large, it can be shown that \(V(\widetilde{X}) \approx 1 /\left(4 n[f(\mu)]^{2}\right)\). a. Compare \(V(\widetilde{X})\) to \(V(\bar{X})\) when the underlying distribution is normal. b. When the underlying pdf is Cauchy (see Example 6.7), \(V(\bar{X})=\infty\), so \(\bar{X}\) is a terrible estimator. What is \(V(\widetilde{X})\) in this case when \(n\) is large?

In Chapter 3 , we defined a negative binomial rv as the number of failures that occur before the \(r\) th success in a sequence of independent and identical success/failure trials. The probability mass function (pmf) of \(X\) is $$ \begin{aligned} &n b(x ; r ; p)= \\ &\left(\begin{array}{c} x+r-1 \\ x \end{array}\right) p^{r}(1-p)^{x} \quad x=0,1,2, \ldots \end{aligned} $$ a. Suppose that \(r \geq 2\). Show that $$ \hat{p}=(r-1) /(X+r-1) $$ is an unbiased estimator for \(p\). [Hint: Write out \(E(\hat{p})\) and cancel \(x+r-1\) inside the sum.] b. A reporter wishing to interview five individuals who support a certain candidate begins asking people whether \((S)\) or not \((F)\) they support the candidate. If the sequence of responses is SFFSFFFSSS, estimate \(p=\) the true proportion who support the candidate.

a. A random sample of 10 houses in a particular area, each of which is heated with natural gas, is selected and the amount of gas (therms) used during the month of January is determined for each house. The resulting observations are \(103,156,118,89,125,147,122,109,138,99\). Let \(\mu\) denote the average gas usage during January by all houses in this area. Compute a point estimate of \(\mu\). b. Suppose there are 10,000 houses in this area that use natural gas for heating. Let \(\tau\) denote the total amount of gas used by all of these houses during January. Estimate \(\tau\) using the data of part (a). What estimator did you use in computing your estimate? c. Use the data in part (a) to estimate \(p\), the proportion of all houses that used at least 100 therms. d. Give a point estimate of the population median usage (the middle value in the population of all houses) based on the sample of part (a). What estimator did you use?

Using a long rod that has length \(\mu\), you are going to lay out a square plot in which the length of each side is \(\mu\). Thus the area of the plot will be \(\mu^{2}\). However, you do not know the value of \(\mu\), so you decide to make \(n\) independent measurements \(X_{1}, X_{2}, \ldots, X_{n}\) of the length. Assume that each \(X_{i}\) has mean \(\mu\) (unbiased measurements) and variance \(\sigma^{2}\). a. Show that \(\bar{X}^{2}\) is not an unbiased estimator for \(\mu^{2}\). b. For what value of \(k\) is the estimator \(\bar{X}^{2}-k S^{2}\) unbiased for \(\mu^{2}\) ?

Consider the accompanying observations on stream flow (1000s of acre-feet) recorded at a station in Colorado for the period April 1-August 31 over a 31-year span (from an article in the 1974 volume of Water 91Ó°ÊÓ Research). $$ \begin{array}{rrrrr} 127.96 & 210.07 & 203.24 & 108.91 & 178.21 \\ 285.37 & 100.85 & 89.59 & 185.36 & 126.94 \\ 200.19 & 66.24 & 247.11 & 299.87 & 109.64 \\ 125.86 & 114.79 & 109.11 & 330.33 & 85.54 \\ 117.64 & 302.74 & 280.55 & 145.11 & 95.36 \\ 204.91 & 311.13 & 150.58 & 262.09 & 477.08 \\ 94.33 & & & & \end{array} $$ An appropriate probability plot supports the use of the lognormal distribution (see Section 4.5) as a reasonable model for stream flow. a. Estimate the parameters of the distribution. [Hint: Remember that \(X\) has a lognormal distribution with parameters \(\mu\) and \(\sigma^{2}\) if \(\ln (X)\) is normally distributed with mean \(\mu\) and variance \(\sigma^{2}\).] b. Use the estimates of part (a) to calculate an estimate of the expected value of stream flow.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.