/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 42 With domestic sources of buildin... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

With domestic sources of building supplies running low several years ago, roughly 60,000 homes were built with imported Chinese drywall. According to the article "Report Links Chinese Drywall to Home Problems" (New York Times, Nov. 24, 2009), federal investigators identified a strong association between chemicals in the drywall and electrical problems, and there is also strong evidence of respiratory difficulties due to the emission of hydrogen sulfide gas. An extensive examination of 51 homes found that 41 had such problems. Suppose these 51 were randomly sampled from the population of all homes having Chinese drywall. a. Does the data provide strong evidence for concluding that more than \(50 \%\) of all homes with Chinese drywall have electrical/environmental problems? Carry out a test of hypotheses using \(\alpha=.01\). b. Calculate a lower confidence bound using a confidence level of \(99 \%\) for the percentage of all such homes that have electrical/environmental problems. c. If it is actually the case that \(80 \%\) of all such homes have problems, how likely is it that the test of (a) would not conclude that more than \(50 \%\) do?

Short Answer

Expert verified
a. Conclude more than 50% have problems; b. Lower bound is 71.79%; c. Likelihood of not concluding is nearly 0.

Step by step solution

01

Formulate Hypotheses for Part (a)

We need to test whether more than 50% of homes with Chinese drywall have electrical/environmental problems. Our null hypothesis is that the true proportion of such homes, \( p \), is 0.5, and the alternative hypothesis is that \( p > 0.5 \). Thus, we have:\[H_0: p = 0.5\]\[H_a: p > 0.5\]We will use a significance level of \( \alpha = 0.01 \).
02

Calculate Test Statistic for Part (a)

We will use the z-test for proportions. Calculate the sample proportion \( \hat{p} \) as follows: \( \hat{p} = \frac{41}{51} \approx 0.8039 \). The test statistic \( z \) is given by:\[z = \frac{\hat{p} - p_0}{\sqrt{\frac{p_0(1-p_0)}{n}}}\]Substituting the values, we have:\[z = \frac{0.8039 - 0.5}{\sqrt{\frac{0.5 \times 0.5}{51}}} \approx \frac{0.3039}{0.070}\approx 4.34\]
03

Make a Decision for Part (a)

Compare the calculated z-value to the critical value from the standard normal distribution at \( \alpha = 0.01 \). The critical z-value is 2.33. Since 4.34 > 2.33, we reject the null hypothesis. This provides strong evidence that more than 50% of homes have problems.
04

Calculate Lower Confidence Bound for Part (b)

We want a 99% lower confidence bound for \( p \). For a 99% confidence level, the z-value is 2.33 (one-tailed). The formula for the confidence interval lower bound is:\[\hat{p} - z \times \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\]Substitute \( \hat{p} = 0.8039 \), \( n = 51 \), and \( z = 2.33 \):\[0.8039 - 2.33 \times \sqrt{\frac{0.8039 \times 0.1961}{51}} \approx 0.8039 - 0.086 \approx 0.7179\]Thus, we are 99% confident that at least 71.79% of the homes have problems.
05

Calculate Probability Not to Conclude (c)

If the actual proportion is 0.8, we want to find the probability of failing to reject \( H_0: p = 0.5 \) for \( \alpha = 0.01 \). Using the z-score formula, the z-value when \( p = 0.8 \) is:\[z = \frac{0.8 - 0.5}{\sqrt{\frac{0.5 \times 0.5}{51}}} \approx \frac{0.3}{0.070} \approx 4.29\]We need to find the probability of observing such a high z-value under \( H_0 \), which is practically zero (probability of a Type II error is very small). Therefore, the likelihood of not concluding more than 50% is virtually 0.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Z-test for proportions
In hypothesis testing, when we want to compare a sample proportion to a known population proportion, we use the z-test for proportions. This is particularly useful when analyzing binary data, where observations fall into two categories. For example, in our drywall problem, we need to decide if more than 50% of homes are affected, labeling them as "problematic" or "not problematic."

The formula for the test statistic in a z-test is:
  • Calculate the sample proportion \( \hat{p} \) which is the number of successes divided by the total sample size \( n \).
  • The test statistic \( z \) formula is given by: \[ z = \frac{\hat{p} - p_0}{\sqrt{\frac{p_0(1-p_0)}{n}}} \] Here, \( p_0 \) is the population proportion under the null hypothesis.
Once we have the z-value, we can compare it to critical values from the standard normal distribution to decide if we should reject the null hypothesis.
Confidence interval calculation
A confidence interval gives us a range of possible values for a population parameter. When we calculate a confidence interval for a proportion, we get a span of values within which the true population proportion is likely to fall.

For calculating the confidence interval, the sample proportion \( \hat{p} \) is central. The formula for a lower confidence bound on a proportion, particularly when we are interested in a one-tailed interval, is:
  • The formula is given by: \[ \hat{p} - z \times \sqrt{\frac{\hat{p}(1-\hat{p})}{n}} \] where \( z \) is the z-value corresponding to the desired confidence level, and \( n \) is the sample size.
  • This calculation gives us the minimum plausible proportion, meaning we are sure that the actual proportion is above this value to a specific degree of confidence.
In the drywall example, we calculated a 99% lower bound. This means we can be very confident that at least this percentage of homes have problems.
Null and alternative hypotheses
Setting up the null and alternative hypotheses is the first critical step in hypothesis testing. These hypotheses help us determine what we're testing and what conclusions we can draw from our statistical analysis.

The null hypothesis \( H_0 \) usually represents the status quo or a statement of no effect or difference. In our drywall scenario:
  • You state \( H_0: p = 0.5 \), suggesting that exactly 50% of the homes have issues.
The alternative hypothesis \( H_a \) is what you want to prove:
  • We set \( H_a: p > 0.5 \), meaning we believe more than 50% of the homes face problems.
By using the test statistic, we assess these hypotheses to decide whether the alternative hypothesis provides enough evidence to reject the null hypothesis.
Significance level alpha
The significance level, denoted as \( \alpha \), is a crucial threshold in hypothesis testing that determines the risk of concluding that an effect exists when it doesn't—a Type I error.

It represents the probability of rejecting the null hypothesis when it's actually true. Common choices for \( \alpha \) are 0.05 or 0.01.
  • This value helps decide the critical z-value you compare against the test statistic.
  • If the computed test statistic surpasses the critical z-value, the result is statistically significant.
For our exercise, an \( \alpha \) of 0.01 was selected, offering strict criteria for evidence, thus making rejecting the null hypothesis more challenging. This level of significance ensures greater confidence in the conclusion that more than 50% of homes have the problems indicated.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

To determine whether the pipe welds in a nuclear power plant meet specifications, a random sample of welds is selected, and tests are conducted on each weld in the sample. Weld strength is measured as the force required to break the weld. Suppose the specifications state that mean strength of welds should exceed \(100 \mathrm{lb} / \mathrm{in}^{2}\); the inspection team decides to test \(H_{0}: \mu=100\) versus \(H_{a}: \mu>100\). Explain why it might be preferable to use this \(H_{\mathrm{a}}\) rather than \(\mu<100\).

Before agreeing to purchase a large order of polyethylene sheaths for a particular type of high-pressure oil-filled submarine power cable, a company wants to see conclusive evidence that the true standard deviation of sheath thickness is less than \(.05 \mathrm{~mm}\). What hypotheses should be tested, and why? In this context, what are the type I and type II errors?

When the population distribution is normal and \(n\) is large, the sample standard deviation \(S\) has approximately a normal distribution with \(E(S) \approx \sigma\) and \(V(S) \approx \sigma^{2} /(2 n)\). We already know that in this case, for any \(n, \bar{X}\) is normal with \(E(\bar{X})=\mu\) and \(V(\bar{X})=\sigma^{2} / n\) a. Assuming that the underlying distribution is normal, what is an approximately unbiased estimator of the 99th percentile \(\theta=\mu+2.33 \sigma\) ? b. When the \(X_{i}\) 's are normal, it can be shown that \(\bar{X}\) and \(S\) are independent rv's (one measures location whereas the other measures spread). Use this to compute \(V(\hat{\theta})\) and \(\sigma_{\hat{\theta}}\) for the estimator \(\hat{\theta}\) of part (a). What is the estimated standard error \(\hat{\sigma}_{\hat{\theta}}\) ? c. Write a test statistic for testing \(H_{0}: \theta=\theta_{0}\) that has approximately a standard normal distribution when \(H_{0}\) is true. If soil \(\mathrm{pH}\) is normally distributed in a certain region and 64 soil samples yield \(\bar{x}=6.33, s=.16\), does this provide strong evidence for concluding that at most \(99 \%\) of all possible samples would have a \(\mathrm{pH}\) of less than 6.75? Test using \(\alpha=.01\).

The article "Caffeine Knowledge, Attitudes, and Consumption in Adult Women" (J. of Nutrition Educ., 1992: 179-184) reports the following summary data on daily caffeine consumption for a sample of adult women: \(n=47\), \(\bar{x}=215 \mathrm{mg}, s=235 \mathrm{mg}\), and range \(=5-1176\). a. Does it appear plausible that the population distribution of daily caffeine consumption is normal? Is it necessary to assume a normal population distribution to test hypotheses about the value of the population mean consumption? Explain your reasoning. b. Suppose it had previously been believed that mean consumption was at most \(200 \mathrm{mg}\). Does the given data contradict this prior belief? Test the appropriate hypotheses at significance level . 10 and include a \(P\)-value in your analysis.

The article "Uncertainty Estimation in Railway Track LifeCycle Cost" (J. of Rail and Rapid Transit, 2009) presented the following data on time to repair (min) a rail break in the high rail on a curved track of a certain railway line. A normal probability plot of the data shows a reasonably linear pattern, so it is plausible that the population distribution of repair time is at least approximately normal. The sample mean and standard deviation are \(249.7\) and \(145.1\), respectively. a. Is there compelling evidence for concluding that true average repair time exceeds \(200 \mathrm{~min}\) ? Carry out a test of hypotheses using a significance level of \(.05\). b. Using \(\sigma=150\), what is the type II error probability of the test used in (a) when true average repair time is actually \(300 \mathrm{~min}\) ? That is, what is \(\beta(300)\) ?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.