/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 9 Use the listed paired sample dat... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal. Listed below are heights (in.) of mothers and their first daughters. The data are from a journal kept by Francis Galton. (See Data Set 5 "Family Heights" in Appendix B.) Use a 0.05 significance level to test the claim that there is no difference in heights between mothers and their first daughters. $$\begin{array}{l|l|l|l|l|l|l|l|l|l|l} \hline \text { Height of Mother } & 68.0 & 60.0 & 61.0 & 63.5 & 69.0 & 64.0 & 69.0 & 64.0 & 63.5 & 66.0 \\ \hline \text { Height of Daughter } & 68.5 & 60.0 & 63.5 & 67.5 & 68.0 & 65.5 & 69.0 & 68.0 & 64.5 & 63.0 \\ \hline \end{array}$$

Short Answer

Expert verified
Calculate mean and standard deviation of differences, perform t-test, and compare against critical value to decide whether to reject or not reject the null hypothesis.

Step by step solution

01

- State Hypotheses

Set up the null and alternative hypotheses. The null hypothesis \(H_0\) is that there is no difference in heights between mothers and their first daughters, which means the mean difference \(\mu_d = 0\). The alternative hypothesis \(H_1\) is that there is a difference in heights, i.e., \(\mu_d \eq 0\).
02

- Calculate Differences

For each mother-daughter pair, calculate the difference \(d = \text{Mother's Height} - \text{Daughter's Height}\). The differences are: \(68.0 - 68.5 = -0.5\), \(60.0 - 60.0 = 0.0\), \(61.0 - 63.5 = -2.5\), \(63.5 - 67.5 = -4.0\), \(69.0 - 68.0 = 1.0\), \(64.0 - 65.5 = -1.5\), \(69.0 - 69.0 = 0.0\), \(64.0 - 68.0 = -4.0\), \(63.5 - 64.5 = -1.0\), \(66.0 - 63.0 = 3.0\).
03

- Compute Mean and Standard Deviation of Differences

Calculate the mean \(\bar{d}\) and the standard deviation \(s_d\) of the differences obtained in Step 2. The mean is calculated as \(\bar{d} = \frac{\sum d_i}{n}\) and the standard deviation is calculated using \(\text{s}_d = \sqrt{\frac{\sum (d_i - \bar{d})^2}{n-1}}\).
04

- Perform t-test

Use the following t-test formula for the paired differences: \[ t = \frac{\bar{d}}{\frac{s_d}{\sqrt{n}}} \] where \(\bar{d}\) is the mean difference, \(s_d\) is the standard deviation of differences, and \(n\) is the number of pairs. The degrees of freedom (df) are \(n-1\).
05

- Compare with Critical Value

Look up the critical value of t for \(df = n-1\) at the \0.05\ significance level for a two-tailed test. Compare the calculated t value from Step 4 with this critical value.
06

- Make a Decision

If the absolute value of the calculated t is greater than the critical value, reject the null hypothesis. Otherwise, fail to reject the null hypothesis.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Hypothesis Testing
Hypothesis testing is a statistical method used to make decisions about the population based on sample data. In this particular problem, we test if there is a significant difference in heights between mothers and their first daughters. First, we set up our null hypothesis \(H_0\), which posits that there is no difference in the heights, i.e., \( \mu_d = 0 \). The alternative hypothesis \(H_1\) suggests that there is a difference in heights, i.e., \( \mu_d \eq 0 \).

We then choose a significance level, which in this exercise is 0.05. This indicates that we are willing to accept a 5% chance of rejecting the true null hypothesis. The steps involve calculating certain statistics from the data and comparing those statistics with critical values from statistical tables to make a decision.

Hypothesis testing allows us to quantify the evidence against \(H_0\), and decide if any observed differences are statistically significant.
Mean Difference
The mean difference is a critical value in paired t-tests. It's the average of all individual differences between paired observations. By subtracting the height of each daughter from her mother’s height, we obtain a set of differences \(d_i \). These differences are: \(-0.5, 0.0, -2.5, -4.0, 1.0, -1.5, 0.0, -4.0, -1.0, 3.0 \).

We then calculate the mean of these differences using the formula: \(\bar{d} = \frac{\text{Sum of differences}}{n} \). For 10 mother-daughter pairs, this yields the average difference.

The mean difference \(\bar{d} \) helps in determining if the observed differences are substantial or merely due to random chance. In the context of the hypothesis test, if \(\bar{d} \) significantly deviates from 0, we may consider rejecting \(H_0\).
Standard Deviation
Standard deviation measures the dispersion or spread of differences from the mean. Higher standard deviations indicate greater variability among data points, while lower standard deviations reflect data that are more clustered around the mean difference.

To find the standard deviation \(s_d \) of the differences, use: \[\text{s}_d = \sqrt{\frac{ \sum (d_i - \bar{d})^2 }{n-1}} \].

This involves computing the differences between each individual difference and the mean difference, squaring them, summing them up, dividing by \(n-1 \) (with \ \ being the total number of pairs), and then taking the square root of the result.

The standard deviation is crucial because it normalizes the mean difference when calculating the t-statistic, making it comparable across different datasets.
Critical Value
The critical value is a threshold that helps us determine if the computed test statistic is statistically significant. It is based on the chosen significance level and the degrees of freedom (df). For our exercise, at the 0.05 significance level with 9 degrees of freedom (because there are 10 pairs), we look this value up in a t-distribution table.

If the absolute value of our calculated t-statistic exceeds the critical value, we reject \(H_0\). If it does not, we fail to reject \(H_0\). This comparison allows us to decide if the observed sample data are consistent with the null hypothesis.
Random Samples
Random samples are crucial in hypothesis testing because they ensure that each member of the population has an equal chance of being included. This randomness eliminates bias and allows for the generalization of the results to the broader population. In this exercise, the mother-daughter heights are assumed to be simple random samples, which means they are taken in such a way that each mother-daughter pair is independently and randomly chosen.

This randomization gives the paired t-test validity as it makes reasonable assumptions about the independence of observations and the normality of differences. Without random sampling, our conclusions could be skewed, limiting the reliability of the test outcomes.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Test the given claim. Identify the null hypothesis, alternative hypothesis, test statistic, P-value or critical value(s), then state the conclusion about the null hypothesis, as well as the final conclusion that addresses the original claim. Since the Hawk-Eye instant replay system for tennis was introduced at the U.S. Open in \(2006,\) men challenged 2441 referee calls, with the result that 1027 of the calls were overturned. Women challenged 1273 referee calls, and 509 of the calls were overturned. We want to use a 0.05 significance level to test the claim that men and women have equal success in challenging calls. a. Test the claim using a hypothesis test. b. Test the claim by constructing an appropriate confidence interval. c. Based on the results, does it appear that men and women have equal success in challenging calls?

Use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal. Listed below are "attractiveness" ratings made by participants in a speed dating session. Each attribute rating is the sum of the ratings of five attributes (sincerity, intelligence, fun, ambition, shared interests). The listed ratings are from Data Set 18 "Speed Dating." Use a 0.05 significance level to test the claim that there is a difference between female attractiveness ratings and male attractiveness ratings. $$\begin{array}{|l|l|l|l|l|l|l|l|l|l|l|l|l|} \hline \text { Rating of Male by Female } & 4.0 & 8.0 & 7.0 & 7.0 & 6.0 & 8.0 & 6.0 & 4.0 & 2.0 & 5.0 & 9.5 & 7.0 \\ \hline \text { Rating of Female by Male } & 6.0 & 8.0 & 7.0 & 9.0 & 5.0 & 7.0 & 5.0 & 4.0 & 6.0 & 8.0 & 6.0 & 5.0 \\ \hline \end{array}$$

Assume that the two samples are independent simple random samples selected from normally distributed populations, and do not assume that the population standard deviations are equal. (Note: Answers in Appendix D include technology answers based on Formula 9 - 1 along with "Table" answers based on Table \(A\) - 3 with df equal to the smaller of \(\boldsymbol{n}_{I}-\boldsymbol{I}\) and \(\boldsymbol{n}_{2}-\boldsymbol{I} .\) ) Second-Hand Smoke Data Set 12 "Passive and Active Smoke" in Appendix B includes cotinine levels measured in a group of nonsmokers exposed to tobacco smoke \((n=40,\) \(\bar{x}=60.58 \mathrm{ng} / \mathrm{mL}, s=138.08 \mathrm{ng} / \mathrm{mL})\) and a group of nonsmokers not exposed to tobacco smoke \((n=40, \bar{x}=16.35 \mathrm{ng} / \mathrm{mL}, s=62.53 \mathrm{ng} / \mathrm{mL}) .\) Cotinine is a metabolite of nicotine, meaning that when nicotine is absorbed by the body, cotinine is produced. a. Use a 0.05 significance level to test the claim that nonsmokers exposed to tobacco smoke have a higher mean cotinine level than nonsmokers not exposed to tobacco smoke. b. Construct the confidence interval appropriate for the hypothesis test in part (a). c. What do you conclude about the effects of second-hand smoke?

Use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal. Listed below are body temperatures from seven different subjects measured at two different times in a day (from Data Set 3 "Body Temperatures" in Appendix B). a. Use a 0.05 significance level to test the claim that there is no difference between body temperatures measured at \(8 \mathrm{AM}\) and at \(12 \mathrm{AM}\). b. Construct the confidence interval that could be used for the hypothesis test described in part (a). What feature of the confidence interval leads to the same conclusion reached in part (a)? $$\begin{array}{l|l|l|l|l|l|l|l} \hline \text { Body Temperature ("F) at 8 AM } & 96.6 & 97.0 & 97.0 & 97.8 & 97.0 & 97.4 & 96.6 \\ \hline \text { Body Temperature ('F) at 12 AM } & 99.0 & 98.4 & 98.0 & 98.6 & 98.5 & 98.9 & 98.4 \\ \hline \end{array}$$

Assume that the two samples are independent simple random samples selected from normally distributed populations, and do not assume that the population standard deviations are equal. (Note: Answers in Appendix D include technology answers based on Formula 9 - 1 along with "Table" answers based on Table \(A\) - 3 with df equal to the smaller of \(\boldsymbol{n}_{I}-\boldsymbol{I}\) and \(\boldsymbol{n}_{2}-\boldsymbol{I} .\) ) Color and Cognition Researchers from the University of British Columbia conducted a study to investigate the effects of color on cognitive tasks. Words were displayed on a computer screen with background colors of red and blue. Results from scores on a test of word recall are given below. Higher scores correspond to greater word recall. a. Use a 0.05 significance level to test the claim that the samples are from populations with the same mean. b. Construct a confidence interval appropriate for the hypothesis test in part (a). What is it about the confidence interval that causes us to reach the same conclusion from part (a)? c. Does the background color appear to have an effect on word recall scores? If so, which color appears to be associated with higher word memory recall scores? $$\begin{array}{l|l} \hline \text { Red Background } & n=35, \mathrm{x}=15.89, \mathrm{s}=5.90 \\ \hline \text { Blue Background } & n=36, \mathrm{x}=12.31, \mathrm{s}=5.48 \\ \hline \end{array}$$

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.