/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 55 Multiple Testing. This problem a... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Multiple Testing. This problem assumes that you have studied optional Chaputer 14 on binomial distributions. If the null hypothesis is true, testing at significance level \(0.05\) means that the probability is \(0.05\) of incorrectly rejecting the null hypothesis. Suppose one conducts 20 independent tests at level \(0.05\) and in each case the null hypothesis is true. Let \(X\) denote the number of tests that incorrectly reject the null hypothesis. \(X\) can take values from 0 to 20 and will follow a binomial distribution with \(n=20\) observations and probability \(p=0.05\) of success. What is the probability \(X \geq 1\) ? This is the probability that at least one test will incorrectly reject the null hypothesis.

Short Answer

Expert verified
The probability that at least one test will incorrectly reject the null hypothesis is approximately 0.6415.

Step by step solution

01

Understand the Problem

We need to find the probability of getting at least one false positive out of 20 independent tests, each with a type I error probability of 0.05, which means finding \( P(X \geq 1) \) for \( X \sim \text{Binomial}(n=20, p=0.05) \).
02

Identify the Complement

Instead of directly finding \( P(X \geq 1) \), we will compute the complement, \( P(X = 0) \), which is when there are no false positives, and then use the formula \( P(X \geq 1) = 1 - P(X = 0) \).
03

Apply the Binomial Formula

The probability of getting exactly 0 false positives is given by \( P(X = 0) = \binom{20}{0} (0.05)^0 (0.95)^{20} \). Simplifying this, we find \( P(X = 0) = 0.95^{20} \).
04

Calculate P(X = 0)

Compute \( 0.95^{20} \) using a calculator: \( 0.95^{20} \approx 0.3585 \).
05

Find P(X ≥ 1)

Using the complement rule, \( P(X \geq 1) = 1 - P(X = 0) = 1 - 0.3585 = 0.6415 \).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Binomial Distribution
The binomial distribution is a probability distribution that summarizes the likelihood of a value taking one of two independent states. For example, it often represents the success or failure outcome of n independent trials. In statistical terms, each trial is an experiment with two possible outcomes: a success with probability \( p \), and a failure with probability \( 1-p \). Each trial must be independent, meaning the result of one trial does not affect the results of others.

In this exercise, a binomial distribution is used with parameters \( n = 20 \) and \( p = 0.05 \). Here, \( n \) signifies the number of tests (trials), and \( p \) represents the probability that a given test incorrectly rejects the null hypothesis (a false positive). To calculate probabilities concerning the binomial distribution, formulas involving combinations and powers are utilized. This allows us to assess the behavior and likelihood of various outcomes across multiple trials.
Null Hypothesis
The null hypothesis, often denoted as \( H_0 \), is a pivotal concept in statistics. It provides a statement for testing that assumes no effect or no difference in the underlying process, allowing for statistical tests to be conducted with an assumption to either reject or fail to reject.

In this context, each test conducted is based on the assumption that the null hypothesis is true: no significant effect or change exists. However, due to random chance, there is a probability of obtaining results that appear significant, known as Type I error (which we explore in the next section). The exercise's primary focus is on maintaining control over these errors as they confront real-world data and analyses, especially when dealing with multiple tests.
Type I Error
A Type I Error occurs when a test incorrectly rejects a true null hypothesis. It's often referred to as a "false positive" - thinking there's an effect when in reality, there isn't one. Traditionally in hypothesis testing, the significance level \( \alpha \) (in this case, 0.05) is the probability threshold for committing this type of error.

With multiple testing, as in performing 20 tests with \( \alpha = 0.05 \), the chance of encountering at least one Type I error increases. The given problem revolves around computing the probability of getting at least one Type I error across these tests, which can be quite insightful, especially when controlling for errors becomes crucial in larger datasets or investigations. Understanding and quantifying these errors help researchers prevent drawing false conclusions.
Complement Rule
The complement rule is a handy concept in probability that simplifies the calculation of probabilities for binomial events. It asserts that the probability an event does not happen is 1 minus the probability that the event does happen.

In the solution provided, instead of directly calculating the probability of getting at least one false positive \( P(X \geq 1) \), the complement rule is used by first calculating \( P(X = 0) \), the probability of zero false positives.

By using the formula \( P(X \geq 1) = 1 - P(X = 0) \), the evaluation of complex scenarios where direct calculation is difficult becomes simplified. Hence, for this exercise, we arrive at \( P(X \geq 1) = 1 - 0.95^{20} = 0.6415 \), easing the interpretation of somewhat convoluted probability problems.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Divorces. Divorce rates vary from city to city in the United States. We have lots of data on many U.S. cities. Statistical software makes it easy to perform dozens of significance tests on dozens of variables to see which ones best predict divorce. One interesting finding is that those cities with Major League ballparks tend to have significantly more divorces than other cities. To improve your chances of a lasting marriage, should you use this "significant" variable to decide where to live? Explain your answer.

Reducing the Gender Gap. In many science disciplines, women are outperformed by men on test scores. Will "values affirmation training" improve self- confidence and hence performance of women relative to men in science courses? A study conducted at a large university compares the scores of men and women at the end of a large introductory physics course on a nationally normed standardized test of conceptual physics, the Force and Motion Conceptual Evaluation (FMCE). Half the women in the course were randomly assigned to values affirmation training during the course; the other half received no training. The study reports that there was a significant difference \((P<0.01)\) in the gap between men's and women's scores, although the gap for women who received the values affirmation training was much smaller than that for women who did not receive training. As evidence that this gap was reduced for woman who received the training, the study also reports that a \(95 \%\) confidence interval for the difference in mean scores on the FMCE exam between women who received the training and those who didn't is \(13 \pm 8\) points. You are a faculty member in the physics department, and the provost, who is interested in women in science, asks you about the study. a. Explain in simple language what "a significant difference \((P<0.01)\) " means. b. Explain clearly and briefly what "95\% confidence" means. c. Is this study good evidence that requiring values affirmation training of all female students would greatly reduce the gender gap in scores on science tests in college courses?

Detecting A cid Rain. Emissions of sulfur dioxide by industry set off chemical changes in the atmosphere that result in acid rain. The acidity of liquids is measured by \(\mathrm{pH}\) on a scale of 0 to 14. Distilled water has \(\mathrm{pH}\) 7.0, and lower \(\mathrm{pH}\) values indicate acidity. Normal rain is somewhat acidic, so acid rain is sometimes defined as rainfall with a pH below \(5.0\). Suppose that \(\mathrm{pH}\) measurements of rainfall on different days in a Canadian forest follow a Normal distribution with standard deviation \(\sigma=0.6\). A sample of \(n\) days finds that the mean \(\mathrm{pH}\) is \(x=4.8\). Is this good evidence that the mean \(\mathrm{pH} \mu\) for all rainy days is less than \(5.0\) ? The answer depends on the size of the sample. Either by hand or using the P-Value of a Test of Significance applet, carry out four tests of $$ \begin{aligned} &H_{0}: \mu=5.0 \\ &H_{a}: \mu<5.0 \end{aligned} $$ Use \(\sigma=0.6\) and \(x=4.8\) in all four tests. But use four different sample sizes: \(n=9, n=16, n=36\), and \(n=64\). a. What are the \(P\)-values for the four tests? The P-value of the same result \(x=4.8\) gets smaller (more significant) as the sample size increases. b. For each test, sketch the Normal curve for the sampling distribution of \(x\) when \(H_{0}\) is true. This curve has mean \(5.0\) and standard deviation \(0.6 / \sqrt{n}\). Mark the observed \(x=4.8\) on each curve. (If you use the applet, you can just copy the curves displayed by the applet.) The same result \(x=4.8\) gets more extreme on the sampling distribution as the sample size increases.

Vigorous exercise is associated with an extra several years of life (on the average). Researchers in Denmark found evidence that slow jogging may provide even better life-expect ancy benefits than more vigorous running. \(-\) Suppose that the added life expectancy associated with slow jogging for 30 minutes three times a week is just one month. A statistical test is more likely to find a significant increase in mean life expectancy for those who jog slowly if a. it is based on a very large random sample. b. it is based on a very small random sample. c. The size of the sample has little effect on significance for such a small increase in life expectancy.

Sample Size and Margin of Error. Example 16.1 (page 368) described NHANES data on the body mass index (BMI) of 936 young men. The mean BMI in the sample was \(x=27.2 \mathrm{~kg} / \mathrm{m}^{2}\). We treated these data as an SRS from a Normally distributed population with standard deviation \(\sigma=11.6\) a. Suppose that we had an SRS of just 100 young men. What would be the margin of error for \(95 \%\) confidence? b. Find the margins of error for \(95 \%\) confidence based on SRSs of 400 young men and 1600 young men. c. Compare the three margins of error. How does increasing the sample size change the margin of error of a confidence interval when the confidence level and population standard deviation remain the same?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.