/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 7 The paper "Cigarette Tar Yields ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

The paper "Cigarette Tar Yields in Relation to Mortality from Lung Cancer in the Cancer Prevention Study II Prospective Cohort" (British Medical journal [2004]: 72-79) included the accompanying data on the tar level of cigarettes smoked for a sample of male smokers who subsequently died of lung cancer. $$ \begin{array}{lc} \text { Tar Level } & \text { Frequency } \\ \hline 0-7 \mathrm{mg} & 103 \\ 8-14 \mathrm{mg} & 378 \\ 15-21 \mathrm{mg} & 563 \\ \geq 22 \mathrm{mg} & 150 \\ \hline \end{array} $$ Assume it is reasonable to regard the sample as representative of male smokers who die of lung cancer. Is there convincing evidence that the proportion of male smoker lung cancer deaths is not the same for the four given tar level categories?

Short Answer

Expert verified
Without specific numerical calculations, it is not possible to conclude whether there is convincing evidence that the proportion of male smoker lung cancer deaths is not the same for the four given tar level categories. However, if p-value < 0.05 from the Chi-Square test, then there is statistically significant evidence to reject the null hypothesis in favor of the alternative, implying the proportions are different across the tar level categories.

Step by step solution

01

Define Null and Alternative Hypotheses

The null hypothesis \(H_0\) is that there is no association between tar levels and the proportion of male smokers who die of lung cancer, implying that the proportion remains the same across all tar level categories. The alternative hypothesis \(H_A\) is that there is an association between tar levels and the proportion of male lung cancer deaths, indicating the proportion is different across the tar level categories.
02

Calculate Observed and Expected Frequencies

We are given the observed frequencies in the exercise. To calculate the expected frequencies, we calculate the total number of deaths which is 103 + 378 + 563 + 150 = 1194. Under \(H_0\), we assume each tar level has the same proportion of deaths. Therefore, the expected frequency for each category is the total number of deaths divided by the number of categories, which is 1194 / 4 = 298.5.
03

Compute Chi-Square Test Statistic

The Chi-Square test statistic calculates the sum of the squared difference between observed and expected frequencies divided by the expected frequency for all levels. It is given as \(\chi^2 = \Sigma [ (O_i - E_i)^2 / E_i ] \), where \(O_i\) and \(E_i\) represent the observed and expected frequency for the i-th category. Therefore, we have:\[\chi^2 = [(103 - 298.5)^2 / 298.5] + [(378 - 298.5)^2 / 298.5] + [(563 - 298.5)^2 / 298.5] + [(150 - 298.5)^2 / 298.5]\]Calculating these values will give us our Chi-Square statistic.
04

Determine p-value

Next, we use the Chi-Square distribution with 3 degrees of freedom (4 categories - 1) to find the area to the right of the calculated Chi-Square value. This area is the p-value. If the p-value is less than 0.05, then there is statistically significant evidence to reject the null hypothesis in favor of the alternative.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Hypothesis Testing
Hypothesis testing is a statistical method that helps us decide whether to accept or reject a hypothesis based on sample data. In this exercise, we are particularly interested in comparing proportions across different groups. Here's what happens step by step:

1. **Establish Hypotheses**: We initially assume a null hypothesis. For our exercise, the null hypothesis \( H_0 \) is that there is no difference in the proportion of lung cancer deaths among the four different cigarette tar levels. In contrast, the alternative hypothesis \( H_A \) asserts that there is a significant difference in these proportions.

2. **Collect Data**: We use the given data, which includes the number of deaths in each tar level category.

3. **Perform the Test**: By calculating a test statistic, like the Chi-Square statistic (detailed in later steps), we can determine the likelihood of the observed data under the null hypothesis.

4. **Decision Making**: If this probability (or p-value) is low, typically less than 0.05, we reject the null hypothesis and accept that there is a significant difference in proportions.
Expected Frequencies
In hypothesis testing, especially when using the Chi-Square Test, expected frequencies are crucial. They represent the frequencies we would expect in each category if the null hypothesis were true. Let's break it down:

- **Calculate Total Cases**: First, sum up all observed frequencies, which represents the total number of cases, in this case, the total number of deaths from lung cancer across all tar levels. This amounts to 1194.

- **Equitable Distribution**: If the null hypothesis is true (no proportional differences), we would expect the deaths to be evenly distributed across the 4 tar level categories.

- **Determine Expected Frequencies**: Therefore, the expected frequency for each category = Total number of cases / Number of categories. For our problem, that's \( 1194 / 4 = 298.5 \).

Expected frequencies give us a baseline against which we can compare the observed frequencies.
Observed Frequencies
Observed frequencies are the actual numbers gathered from data that tell us how often something happens. In our exercise, these frequencies are the counted number of lung cancer deaths at each tar level:

- 0-7 mg: 103
- 8-14 mg: 378
- 15-21 mg: 563
- ≥ 22 mg: 150

These observed frequencies drive our analysis. They help us determine if there's a significant difference from what we expect under the null hypothesis. We compare these observed values to the expected frequencies using a Chi-Square Test. This comparison highlights any disparities and aids in concluding whether these differences are due to chance or a significant factor, like the tar level of cigarettes.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The paper "Sociochemosensory and Emotional Functions" (Psychological Science [2009]: \(1118-1124\) ) describes an interesting experiment to determine if college students can identify their roommates by smell. Forty-four female college students participated as subjects in the experiment. Each subject was presented with a set of three t-shirts that were identical in appearance. Each of the three t-shirts had been slept in for at least 7 hours by a person who had not used any scented products (like scented deodorant, soap, or shampoo) for at least 48 hours prior to sleeping in the shirt. One of the three shirts had been worn by the subject's roommate. The subject was asked to identify the shirt worn by her roommate. This process was then repeated with another three shirts, and the number of times out of the two trials that the subject correctly identified the shirt worn by her roommate was recorded. The resulting data is given in the accompanying table. $$ \begin{array}{l|ccc} \hline \text { Number of Correct Identifications } & 0 & 1 & 2 \\ \hline \text { Observed Count } & 21 & 10 & 13 \\ \hline \end{array} $$ a. Can a person identify her roommate by smell? If not, the data from the experiment should be consistent with what we would have expected to see if subjects were just guessing on each trial. That is, we would expect that the probability of selecting the correct shirt would be \(1 / 3\) on each of the two trials. It would then be reasonable to regard the number of correct identifications as a binomial variable with \(n=2\) and \(p=1 / 3\). Use this binomial distribution to compute the proportions of the time we would expect to see 0,1, and 2 correct identifications if subjects are just guessing. b. Use the three proportions computed in Part (a) to carry out a test to determine if the numbers of correct identifications by the students in this study are significantly different than what would have been expected by guessing. Use \(\alpha=.05 .\) (Note: One of the expected counts is just a bit less than \(5 .\) For purposes of this exercise, assume that it is \(\mathrm{OK}\) to proceed with a goodness-of-fit test.)

The paper “Contemporary College Students and Body Piercing" (Journal of Adolescent Health [2004]: 58-61) described a survey of 450 undergraduate students at a state university in the southwestern region of the United States. Each student in the sample was classified according to class standing (freshman, sophomore, junior, or senior) and body art category (body piercings only, tattoos only, both tattoos and body piercings, no body art). Use the data in the accompanying table to determine if there is evidence that there is an association between class standing and response to the body art question. Assume that it is reasonable to regard the sample of students as representative of the students at this university. Use \(\alpha=.01\). $$ \begin{array}{l|cccc} & \text { Body } & & {\text { Both Body }} \\ & \begin{array}{c} \text { Piercings } \\ \text { Only } \end{array} & \begin{array}{c} \text { Tattoos } \\ \text { Only } \end{array} & \begin{array}{c} \text { Piercing and } \\ \text { Tattoos } \end{array} & \begin{array}{c} \text { No Body } \\ \text { Art } \end{array} \\ \hline \text { Freshman } & 61 & 7 & 14 & 86 \\ \text { Sophomore } & 43 & 11 & 10 & 64 \\ \text { Junior } & 20 & 9 & 7 & 43 \\ \text { Senior } & 21 & 17 & 23 & 54 \\ \hline \end{array} $$

Drivers born under the astrological sign of Capricorn are the worst drivers in Australia, according to an article that appeared in the Australian newspaper The Mercury (October 26,1998 ). This statement was based on a study of insurance claims that resulted in the following data for male policyholders of a large insurance company. $$ \begin{array}{lc} \text { Astrological Sign } & \begin{array}{c} \text { Number of } \\ \text { Policyholders } \end{array} \\ \hline \text { Aquarius } & 35,666 \\ \text { Aries } & 37,926 \\ \text { Cancer } & 38,126 \\ \text { Capricorn } & 54,906 \\ \text { Gemini } & 37,179 \\ \text { Leo } & 37,354 \\ \text { Libra } & 37,910 \\ \text { Pisces } & 36,677 \\ \text { Sagittarius } & 34,175 \\ \text { Scorpio } & 35,352 \\ \text { Taurus } & 37,179 \\ \text { Virgo } & 37,718 \\ \hline \end{array} $$ a. Assuming that it is reasonable to treat the male policyholders of this particular insurance company as a random sample of male insured drivers in Australia, are the observed data consistent with the hypothesis that the proportion of male insured drivers is the same for each of the 12 astrological signs? b. Why do you think that the proportion of Capricorn policyholders is so much higher than would be expected if the proportions are the same for all astrological signs? c. Suppose that a random sample of 1000 accident claims submitted to this insurance company is selected and each claim classified according to the astrological sign of the driver. (The accompanying table is consistent with accident rates given in the article.) $$ \begin{array}{lc} & \text { Observed Number } \\ \text { Astrological Sign } & \text { in Sample } \\ \hline \text { Aquarius } & 85 \\ \text { Aries } & 83 \\ \text { Cancer } & 82 \\ \text { Capricorn } & 88 \\ \text { Gemini } & 83 \\ \text { Leo } & 83 \\ \text { Libra } & 83 \\ \text { Pisces } & 82 \\ \text { Sagittarius } & 81 \end{array} $$ $$ \begin{array}{lc} & \text { Observed Number } \\ \text { Astrological Sign } & \text { in Sample } \\ \hline \text { Scorpio } & 85 \\ \text { Taurus } & 84 \\ \text { Virgo } & 81 \\ \hline \end{array} $$ Test the null hypothesis that the proportion of accident claims submitted by drivers of each astrological sign is consistent with the proportion of policyholders of each sign. Use the given information on the distribution of policyholders to compute expected frequencies and then carry out an appropriate test.

A particular state university system has six campuses. On each campus, a random sample of students will be selected, and each student will be categorized with respect to political philosophy as liberal, moderate, or conservative. The null hypothesis of interest is that the proportion of students falling in these three categories is the same at all six campuses. a. On how many degrees of freedom will the resulting \(X^{2}\) test be based? b. How does your answer in Part (a) change if there are seven campuses rather than six? c. How does your answer in Part (a) change if there are four rather than three categories for political philosophy?

The 2006 Expedia Vacation Deprivation Survey (Ipsos Insight, May 18,2006 ) described the results of a poll of working adults in Canada. Each person in a random sample was classified according to gender and the number of vacation days he or she usually took each year. The resulting data are summarized in the given table. Is it reasonable to conclude that there is an association between gender and the number of vacation days taken? To what population would it be reasonable to generalize this conclusion? $$ \begin{array}{l|rc} & \text { Gender } \\ \hline \text { Days of Vacation } & \text { Male } & \text { Female } \\ \hline \text { None } & 51 & 42 \\ 1-5 & 21 & 25 \\ 6-10 & 67 & 79 \\ 11-15 & 111 & 94 \\ 16-20 & 71 & 70 \\ 21-25 & 82 & 58 \\ \text { More than } 25 & 118 & 79 \\ \hline \end{array} $$

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.