/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 177 The data in Hurricanes contains ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

The data in Hurricanes contains the number of hurricanes that made landfall on the eastern coast of the United States over the 101 years from 1914 to 2014 . Suppose we are interested in testing whether the number of hurricanes is increasing over time. (a) State the null and alternative hypotheses for testing whether the correlation between year and number of hurricanes is positive, which would indicate the number of hurricanes is increasing. (b) Describe in detail how you would create a randomization distribution to test this claim (if you had many more hours to do this exercise and no access to technology).

Short Answer

Expert verified
The null hypothesis (\(H_0\)) is: 'The correlation between year and number of hurricanes is equal to zero,' and the alternative hypothesis (\(H_A\)) is: 'The correlation between year and number of hurricanes is greater than zero.' A randomization distribution would be created by rerunning the test multiple times with random samples, recording the correlation each time to see if the actual observed correlation is an extreme value.

Step by step solution

01

Formulating the Hypotheses

The null hypothesis and the alternative hypothesis are two dichotomous statements about the population. For this exercise, we are interested in examining whether the number of hurricanes is increasing over time. The null hypothesis, \(H_0\), would be that there is no correlation between the year and the number of hurricanes, implying that the number of hurricanes is not increasing over time. The alternative hypothesis, \(H_A\), would be that there is a positive correlation between the year and the number of hurricanes, indicating an increase in the number of hurricanes over time. Thus, the null hypothesis, \(H_0\), is: 'The correlation between year and number of hurricanes is equal to zero,' and the alternative hypothesis, \(H_A\), is: 'The correlation between year and number of hurricanes is greater than zero.'
02

Creating a Randomization Distribution

A randomization distribution would be created to test the hypotheses by rerunning the test multiple times with random samples. Given unlimited time and no access to technology. To create a randomization distribution, follow these steps: First, record the number of hurricanes for each year. Second, mix these results up and then assign them at random to the years. This effectively breaks any relationship that may exist between the year and the number of hurricanes. Third, calculate the correlation under this random assignment. This serves as a single data point representing the null hypothesis. Lastly, repeat these steps as many times as possible to generate a distribution of correlations that would be expected under the null hypothesis. This randomization distribution is then used to see if the actual observed correlation is an extreme value, which might lead to the rejection of the null hypothesis in favor of the alternative hypothesis.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Null and Alternative Hypotheses
Understanding the null and alternative hypotheses is a foundational aspect of statistical hypothesis testing. In simple terms, the null hypothesis (\(H_0\)) represents the default assumption or the status quo, suggesting there is no effect or no association between the variables being studied. Specifically in our exercise, the null hypothesis posits that there is no upward trend in the number of hurricanes over time, which means that the correlation between the year and number of hurricanes is zero.

In contrast, the alternative hypothesis (\(H_A\)) is the statement we wish to validate, based on the evidence. It suggests that there is an effect or association, which in this case, is a positive correlation indicating that the number of hurricanes is increasing over time. Formulating these hypotheses carefully is pivotal, as they guide the direction of the statistical test and the interpretation of the results.

It’s important to note that these hypotheses are mutually exclusive and exhaustive, meaning only one can be true and no other outcomes are possible within the context of the test. When we perform the statistical test, we'll be essentially checking the evidence against the null hypothesis, and if it’s strong enough, we can reject it in favor of the alternative.
Correlation
Correlation is a statistical measure that expresses the extent to which two variables are linearly related. It’s a value between -1 and 1 where:
  • \(-1\) indicates a perfect negative linear relationship,
  • 0 indicates no linear relationship, and
  • \(+1\) indicates a perfect positive linear relationship.

For the task described in the exercise, a positive correlation would mean as one variable (year) increases, so does the other (number of hurricanes). The strength of the correlation is key to our hypothesis test; a higher positive correlation would support the alternative hypothesis. However, it is crucial to remember that correlation does not imply causation. Even if we find a strong positive correlation, we cannot conclude that an increase in the year causes more hurricanes—there could be other lurking variables or factors contributing to this observed relationship.
Randomization Distribution
A randomization distribution is a fundamental tool in the absence of certain assumptions required for traditional parametric tests. It’s created by simulating the process of allocating observations to groups in a random manner many times over, to build a distribution that represents a world where the null hypothesis is true.

To construct a randomization distribution for our hurricane exercise, we would shuffle the years’ data and randomly pair them with the number of hurricanes, removing any actual chronological association. We then calculate the correlation coefficient for each of these randomized pairings and record the result. Through repeated shufflings, we create a multitude of correlation coefficients that mimic what we would expect to see if there truly were no relationship between years and the number of hurricanes.

The observed correlation from our real data is then compared to this distribution. If the observed correlation falls outside the majority of the generated random correlations—typically in the tails of the distribution—we would have evidence to reject the null hypothesis in favor of the alternative. This method does not rely on assumptions of normal distributions and is especially valuable when dealing with small or non-standard data, but it requires a large number of repetitions to gain reliable insight.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Data 4.3 on page 265 introduces a situation in which a restaurant chain is measuring the levels of arsenic in chicken from its suppliers. The question is whether there is evidence that the mean level of arsenic is greater than \(80 \mathrm{ppb},\) so we are testing \(H_{0}: \mu=80\) vs \(H_{a}:\) \(\mu>80,\) where \(\mu\) represents the average level of arsenic in all chicken from a certain supplier. It takes money and time to test for arsenic, so samples are often small. Suppose \(n=6\) chickens from one supplier are tested, and the levels of arsenic (in ppb) are: \(\begin{array}{llllll}68, & 75, & 81, & 93, & 95, & 134\end{array}\) (a) What is the sample mean for the data? (b) Translate the original sample data by the appropriate amount to create a new dataset in which the null hypothesis is true. How do the sample size and standard deviation of this new dataset compare to the sample size and standard deviation of the original dataset? (c) Write the six new data values from part (b) on six cards. Sample from these cards with replacement to generate one randomization sample. (Select a card at random, record the value, put it back, select another at random, until you have a sample of size \(6,\) to match the original sample size.) List the values in the sample and give the sample mean. (d) Generate 9 more simulated samples, for a total of 10 samples for a randomization distribution. Give the sample mean in each case and create a small dotplot. Use an arrow to locate the original sample mean on your dotplot.

A confidence interval for a sample is given, followed by several hypotheses to test using that sample. In each case, use the confidence interval to give a conclusion of the test (if possible) and also state the significance level you are using. A \(95 \%\) confidence interval for \(p: 0.48\) to 0.57 (a) \(H_{0}: p=0.5\) vs \(H_{a}: p \neq 0.5\) (b) \(H_{0}: p=0.75\) vs \(H_{a}: p \neq 0.75\) (c) \(H_{0}: p=0.4\) vs \(H_{a}: p \neq 0.4\)

We are conducting many hypothesis tests to test a claim. In every case, assume that the null hypothesis is true. Approximately how many of the tests will incorrectly find significance? 100 tests conducted using a significance level of \(5 \%\).

How influenced are consumers by price and marketing? If something costs more, do our expectations lead us to believe it is better? Because expectations play such a large role in reality, can a product that costs more (but is in reality identical) actually be more effective? Baba Shiv, a neuroeconomist at Stanford, conducted a study \(^{25}\) involving 204 undergraduates. In the study, all students consumed a popular energy drink which claims on its packaging to increase mental acuity. The students were then asked to solve a series of puzzles. The students were charged either regular price ( \(\$ 1.89\) ) for the drink or a discount price \((\$ 0.89)\). The students receiving the discount price were told that they were able to buy the drink at a discount since the drinks had been purchased in bulk. The authors of the study describe the results: "the number of puzzles solved was lower in the reduced-price condition \((M=4.2)\) than in the regular-price condition \((M=5.8) \ldots p<.0001 . "\) (a) What can you conclude from the study? How strong is the evidence for the conclusion? (b) These results have been replicated in many similar studies. As Jonah Lehrer tells us: "According to Shiv, a kind of placebo effect is at work. Since we expect cheaper goods to be less effective, they generally are less effective, even if they are identical to more expensive products. This is why brand-name aspirin works better than generic aspirin and why Coke tastes better than cheaper colas, even if most consumers can't tell the difference in blind taste tests."26 Discuss the implications of this research in marketing and pricing.

It is believed that sunlight offers some protection against multiple sclerosis (MS) since the disease is rare near the equator and more prevalent at high latitudes. What is it about sunlight that offers this protection? To find out, researchers \({ }^{23}\) injected mice with proteins that induce a condition in mice comparable to MS in humans. The control mice got only the injection, while a second group of mice were exposed to UV light before and after the injection, and a third group of mice received vitamin D supplements before and after the injection. In the test comparing UV light to the control group, evidence was found that the mice exposed to UV suppressed the MS-like disease significantly better than the control mice. In the test comparing mice getting vitamin D supplements to the control group, the mice given the vitamin D did not fare significantly better than the control group. If the p-values for the two tests are 0.472 and 0.002 , which p-value goes with which test?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.