/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 8 Test the given claim. Data Set... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Test the given claim. Data Set 12 "Passive and Active Smoke" includes cotinine levels measured in a group of smokers \((n=40, \bar{x}=172.48 \mathrm{ng} / \mathrm{mL}, s=119.50 \mathrm{ng} / \mathrm{mL})\) and a group of nonsmokers not exposed to tobacco smoke \((n=40, \bar{x}=16.35 \mathrm{ng} / \mathrm{mL}, s=62.53 \mathrm{ng} / \mathrm{mL})\) Cotinine is a metabolite of nicotine, meaning that when nicotine is absorbed by the body, cotinine is produced. a. Use a 0.05 significance level to test the claim that the variation of cotinine in smokers is greater than the variation of cotinine in nonsmokers not exposed to tobacco smoke. b. The 40 cotinine measurements from the nonsmoking group consist of these values (all in ng/mL): \(1,1,90,244,309,\) and 35 other values that are all \(0 .\) Does this sample appear to be from a normally distributed population? If not, how are the results from part (a) affected?

Short Answer

Expert verified
Reject the null hypothesis. The variation in cotinine levels in smokers is greater than in nonsmokers. The nonsmoker data doesn't appear to be normally distributed, potentially affecting results.

Step by step solution

01

State the Hypotheses

Identify the null and alternative hypotheses. The null hypothesis (H_0) is that the variance of cotinine levels in smokers (σ^2_smokers) is equal to the variance of cotinine levels in nonsmokers (σ^2_nonsmokers). The alternative hypothesis (H_1) is that the variance in smokers is greater than the variance in nonsmokers. Symbolically, H_0: σ^2_smokers = σ^2_nonsmokers or H_0: σ^2_smokers / σ^2_nonsmokers = 1 H_1: σ^2_smokers > σ^2_nonsmokers or H_1: σ^2_smokers / σ^2_nonsmokers > 1
02

Choose the Significance Level

The problem states to use a significance level of α = 0.05.
03

Compute the Test Statistic

To compare variances, use the F-test: F = s^2_smokers / s^2_nonsmokers Using the given data: s^2_smokers = (119.50)^2 = 14280.25 s^2_nonsmokers = (62.53)^2 = 3908.80 F = 14280.25 / 3908.80 ≈ 3.655
04

Determine the Critical Value

With α = 0.05 and df1 = 39 (for the numerator, smokers) and df2 = 39 (for the denominator, nonsmokers), use the F-distribution table to find the critical value F_crit. For α = 0.05, df1 = 39, df2 = 39: F_crit ≈ 1.86.
05

Compare and Make a Decision

Compare the calculated F-statistic with the critical value. F = 3.655 F_crit = 1.86 Since F > F_crit , reject the null hypothesis. Conclusion: There is evidence to support the claim that the variation of cotinine levels in smokers is greater than in nonsmokers.
06

Checking Normal Distribution in Nonsmokers

Examine the nonsmokers' data: [1, 1, 90, 244, 309,and 35 values of 0]. Most values are 0, with a few large values, suggesting a non-normal distribution. Since normality of the data is an assumption for the F-test, the results from part (a) might be affected if the assumption is violated.
07

Discuss the Effect of Non-Normality

Because the nonsmoker data does not appear to come from a normally distributed population, the F-test results may be unreliable. Variance tests like the F-test assume normality, and non-normal data can affect the validity of the test. Alternative non-parametric tests or transformation of data may be considered.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

F-test
When we want to compare the variances of two populations, we use an F-test. The F-test evaluates if one population's variance is significantly greater than the other's. Variance measures how spread out the values are in a data set. In our case, we have cotinine levels for a group of smokers and nonsmokers. The F-test formula is simple: \[F = \frac{s^2_{\text{group 1}}}{s^2_{\text{group 2}}} \] where \(s^2\) represents the sample variance.
For this exercise, smokers are considered group 1 and nonsmokers are group 2. We can compute the sample variances and then the F value. If this computed F value is greater than the predetermined critical value from the F-distribution table, we reject the null hypothesis. This suggests that the variances are statistically different and, in our specific hypothesis, that the variance in smokers is greater.
Significance Level
The significance level, denoted as \( \alpha \), is the threshold we set to decide whether to reject the null hypothesis. It represents the probability of rejecting the null hypothesis when it is actually true. Commonly used significance levels are 0.05, 0.01, and 0.10. In this problem, we use a 0.05 significance level. This means there's a 5% risk of concluding that there is a difference in variance when there is none.
Setting \( \alpha = 0.05 \) helps balance between being too lenient and too strict in our test criteria. We then compare our calculated test statistic with the critical value corresponding to our chosen \( \alpha \) to make our decision.
Normal Distribution
Normal distribution is a key assumption in many statistical methods, including the F-test. It's a bell-shaped curve where most data points cluster around the mean, and the probabilities for values taper off symmetrically as they move further from the mean. This exercise's part (b) examines whether the nonsmokers' data appears to come from a normally distributed population.
Given the data [1, 1, 90, 244, 309, and 35 values of 0], it looks skewed rather than normally distributed. This can affect the validity of our F-test because the test assumes normality. If a dataset significantly deviates from normal distribution, the reliability of the F-test results declines. In such cases, alternative methods like non-parametric tests might be necessary.
Statistical Variance
Statistical variance measures the dispersion of data points in a data set relative to the mean. It is the average of the squared differences from the mean, providing insight into how ‘spread out’ the data is. The formula for variance in a sample is: \[ s^2 = \frac{\sum (x_i - \overline{x})^2}{n-1} \] where \( x_i \) are the data points, \( \overline{x} \) is the sample mean, and \( n \) is the number of data points.
In our exercise, we compare the variances of cotinine levels in smokers and nonsmokers. Higher variance indicates greater spread. For instance, the variance for smokers and nonsmokers came out to be quite different, contributing to the F-value calculation. This variance comparison helps determine if the spread of cotinine levels is significantly different between the two groups.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Test the given claim. Blanking Out on Tests Many students have had the unpleasant experience of panicking on a test because the first question was exceptionally difficult. The arrangement of test items was studied for its effect on anxiety. The following scores are measures of "debilitating test anxiety" which most of us call panic or blanking out (based on data from "Item Arrangement, Cognitive Entry Characteristics, Sex and Test Anxiety as Predictors of Achievement in Examination Performance,"by Klimko, Joumal of Experimental Education, Vol. 52, No. 4.) Using a 0.05 significance level, test the claim that the two populations of scores have different amounts of variation. $$\begin{array}{|c|c|c|c|c|} \hline \multicolumn{5}{|c|} {\text { Questions Arranged from Easy to Difficult }} \\ \hline 24.64 & 39.29 & 16.32 & 32.83 & 28.02 \\ \hline 33.31 & 20.60 & 21.13 & 26.69 & 28.90 \\ \hline 26.43 & 24.23 & 7.10 & 32.86 & 21.06 \\ \hline 28.89 & 28.71 & 31.73 & 30.02 & 21.96 \\ \hline 25.49 & 38.81 & 27.85 & 30.29 & 30.72 \\ \hline \end{array}$$ $$\begin{array}{|c|c|c|c|} \hline \multicolumn{3}{|c|} {\text { Questions Arranged from Difficult to Easy }} \\ \hline 33.62 & 34.02 & 26.63 & 30.26 \\ \hline 35.91 & 26.68 & 29.49 & 35.32 \\ \hline 27.24 & 32.34 & 29.34 & 33.53 \\ \hline 27.62 & 42.91 & 30.20 & 32.54 \\ \hline \end{array}$$

Use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal. Listed below are body temperatures from seven different subjects measured at two different times in a day (from Data Set 3 "Body Temperatures" in Appendix B). a. Use a 0.05 significance level to test the claim that there is no difference between body temperatures measured at \(8 \mathrm{AM}\) and at \(12 \mathrm{AM}\). b. Construct the confidence interval that could be used for the hypothesis test described in part (a). What feature of the confidence interval leads to the same conclusion reached in part (a)? $$\begin{array}{l|l|l|l|l|l|l|l} \hline \text { Body Temperature ("F) at 8 AM } & 96.6 & 97.0 & 97.0 & 97.8 & 97.0 & 97.4 & 96.6 \\ \hline \text { Body Temperature ('F) at 12 AM } & 99.0 & 98.4 & 98.0 & 98.6 & 98.5 & 98.9 & 98.4 \\ \hline \end{array}$$

Use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal. A study was conducted to investigate the effectiveness of hypnotism in reducing pain. Results for randomly selected subjects are given in the accompanying table (based on "An Analysis of Factors That Contribute to the Efficacy of Hypnotic Analgesia," by Price and Barber, Journal of Abnormal Psychology, Vol. 96, No. 1). The values are before and after hypnosis; the measurements are in centimeters on a pain scale. Higher values correspond to greater levels of pain. Construct a \(95 \%\) confidence interval for the mean of the "before/after" differences. Does hypnotism appear to be effective in reducing pain? $$\begin{array}{l|c|c|c|c|c|c|c|c} \hline \text { Subject } & \text { A } & \text { B } & \text { C } & \text { D } & \text { E } & \text { F } & \text { G } & \text { H } \\ \hline \text { Before } & 6.6 & 6.5 & 9.0 & 10.3 & 11.3 & 8.1 & 6.3 & 11.6 \\ \hline \text { After } & 6.8 & 2.4 & 7.4 & 8.5 & 8.1 & 6.1 & 3.4 & 2.0 \\ \hline \end{array}$$

Assume that the two samples are independent simple random samples selected from normally distributed populations, and do not assume that the population standard deviations are equal. (Note: Answers in Appendix D include technology answers based on Formula 9 - 1 along with "Table" answers based on Table \(A\) - 3 with df equal to the smaller of \(\boldsymbol{n}_{I}-\boldsymbol{I}\) and \(\boldsymbol{n}_{2}-\boldsymbol{I} .\) ) Are Quarters Now Lighter? Weights of quarters are carefully considered in the design of the vending machines that we have all come to know and love. Data Set 29 "Coin Weights" in Appendix B includes weights of a sample of pre-1964 quarters \((n=40, \bar{x}=6.19267 \mathrm{g}\) \(s=0.08700 \mathrm{g}\) ) and weights of a sample of post- 1964 quarters \((n=40, \bar{x}=5.63930 \mathrm{g}\) \(s=0.06194 \mathrm{g}\) a. Use a 0.05 significance level to test the claim that pre-1964 quarters have a mean weight that is greater than the mean weight of post-1964 quarters. b. Construct a confidence interval appropriate for the hypothesis test in part (a). c. Do post-1964 quarters appear to weigh less than before \(1964 ?\) If so, why aren't vending machines affected very much by the difference?

Use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal. Listed below are heights (in.) of mothers and their first daughters. The data are from a journal kept by Francis Galton. (See Data Set 5 "Family Heights" in Appendix B.) Use a 0.05 significance level to test the claim that there is no difference in heights between mothers and their first daughters. $$\begin{array}{l|l|l|l|l|l|l|l|l|l|l} \hline \text { Height of Mother } & 68.0 & 60.0 & 61.0 & 63.5 & 69.0 & 64.0 & 69.0 & 64.0 & 63.5 & 66.0 \\ \hline \text { Height of Daughter } & 68.5 & 60.0 & 63.5 & 67.5 & 68.0 & 65.5 & 69.0 & 68.0 & 64.5 & 63.0 \\ \hline \end{array}$$

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.