Chapter 13: Problem 5
Explain the difference between a simple and a multiple regression model.
/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none}
Learning Materials
Features
Discover
Chapter 13: Problem 5
Explain the difference between a simple and a multiple regression model.
All the tools & learning materials you need for study success - in one app.
Get started for free
The following table lists the midterm and final exam scores for seven students in a statistics class. $$ \begin{array}{l|lllllll} \hline \text { Midterm score } & 79 & 95 & 81 & 66 & 87 & 94 & 59 \\ \hline \text { Final exam score } & 85 & 97 & 78 & 76 & 94 & 84 & 67 \\ \hline \end{array} $$ a. Do you expect the midterm and final exam scores to be positively or negatively related? b. Plot a scatter diagram. By looking at the scatter diagram, do you expect the correlation coefficient between these two variables to be close to zero, 1 , or \(-1\) ? c. Find the correlation coefficient. Is the value of \(r\) consistent with what you expected in parts a and \(\mathrm{b}\) ? d. Using the \(1 \%\) significance level, test whether the linear correlation coefficient is positive.
Suppose that you work part-time at a bowling alley that is open daily from noon to midnight. Although business is usually slow from noon to 6 P.M., the owner has noticed that it is better on hotter days during the summer, perhaps because the premises are comfortably air-conditioned. The owner shows you some data that she gathered last summer. This data set includes the maximum temperature and the number of lines bowled between noon and 6 P.M. for each of 20 days. (The maximum temperatures ranged from \(77^{\circ} \mathrm{F}\) to \(95^{\circ} \mathrm{F}\) during this period.) The owner would like to know if she can estimate tomorrow's business from noon to 6 P.M. by looking at tomorrow's weather forecast. She asks you to analyze the data. Let \(x\) be the maximum temperature for a day and \(y\) the number of lines bowled between noon and 6 P.M. on that day. The computer output based on the data for 20 days provided the following results: \(\hat{y}=-432+7.7 x, \quad s_{e}=28.17, \quad \mathrm{SS}_{x x}=607, \quad\) and \(\quad \bar{x}=87.5\) Assume that the weather forecasts are reasonably accurate. a. Does the maximum temperature seem to be a useful predictor of bowling activity between noon and 6 P.M.? Use an appropriate statistical procedure based on the information given. Use \(\alpha=.05\) b. The owner wants to know how many lines of bowling she can expect, on average, for days with a maximum temperature of \(90^{\circ}\). Answer using a \(95 \%\) confidence level. c. The owner has seen tomorrow's weather forecast, which predicts a high of \(90^{\circ} \mathrm{F}\). About how many lines of bowling can she expect? Answer using a \(95 \%\) confidence level. d. Give a brief commonsense explanation to the owner for the difference in the interval estimates of parts \(\mathrm{b}\) and \(\mathrm{c}\). e. The owner asks you how many lines of bowling she could expect if the high temperature were \(100^{\circ} \mathrm{F}\). Give a point estimate, together with an appropriate warning to the owner.
Explain the meaning of independent and dependent variables for a regression model.
Two variables \(x\) and \(y\) have a negative linear relationship. Explain what happens to the value of \(y\) when \(x\) increases.
Will you expect a positive, zero, or negative linear correlation between the two variables for each of the following examples? a. SAT scores and GPAs of students b. Stress level and blood pressure of individuals c. Amount of fertilizer used and yield of corn per acre d. Ages and prices of houses e. Heights of husbands and incomes of their wives
What do you think about this solution?
We value your feedback to improve our textbook solutions.