Chapter 12: Problem 28
What diagnostic plot can you use to determine whether the data satisfy the normality assumption? What should the plot look like for normal residuals?
/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none}
Learning Materials
Features
Discover
Chapter 12: Problem 28
What diagnostic plot can you use to determine whether the data satisfy the normality assumption? What should the plot look like for normal residuals?
All the tools & learning materials you need for study success - in one app.
Get started for free
How is the cost of a plane flight related to the length of the trip? The table shows the average round-trip coach airfare paid by customers of American Airlines on each of 18 heavily traveled U.S. air routes. $$ \begin{array}{lrr} & \text { Distance } & \\ \text { Route } & \text { (miles) } & \text { Cost } \\ \hline \text { Dallas-Austin } & 178 & \$ 125 \\ \text { Houston-Dallas } & 232 & 123 \\ \text { Chicago-Detroit } & 238 & 148 \\ \text { Chicago-St. Louis } & 262 & 136 \\ \text { Chicago-Cleveland } & 301 & 129 \\ \text { Chicago-Atlanta } & 593 & 162 \\ \text { New York-Miami } & 1092 & 224 \\ \text { New York-San Juan } & 1608 & 264 \\ \text { New York-Chicago } & 714 & 287 \\ \text { Chicago-Denver } & 901 & 256 \\ \text { Dallas-Salt Lake } & 1005 & 365 \\ \text { New York-Dallas } & 1374 & 459 \\ \text { Chicago-Seattle } & 1736 & 424 \\ \text { Los Angeles-Chicago } & 1757 & 361 \\ \text { Los Angeles-Atlanta } & 1946 & 309 \\ \text { New York-Los Angeles } & 2463 & 444 \\ \text { Los Angeles-Honolulu } & 2556 & 323 \\ \text { New York-San Francisco } & 2574 & 513 \end{array} $$ a. If you want to estimate the cost of a flight based on the distance traveled, which variable is the response variable and which is the independent predictor variable? b. Assume that there is a linear relationship between cost and distance. Calculate the least-squares regression line describing cost as a linear function of distance. c. Plot the data points and the regression line. Does it appear that the line fits the data? d. Use the appropriate statistical tests and measures to explain the usefulness of the regression model for predicting cost.
Some varieties of nematodes, roundworms that live in the soil and frequently are so small as to be invisible to the naked eye, feed on the roots of lawn grasses and other plants. This pest, which is particularly troublesome in warm climates, can be treated by the application of nematicides. Data collected on the percent kill of nematodes for various rates of application (dosages given in pounds per acre of active ingredient) are as follows: $$ \begin{array}{l|l|l|l|l} \text { Rate of Application, } x & 2 & 3 & 4 & 5 \\ \hline \text { Percent Kill, } y & 50,56,48 & 63,69,71 & 86,82,76 & 94,99,97 \end{array} $$ Use an appropriate computer printout to answer these questions: a. Calculate the coefficient of correlation \(r\) between rates of application \(x\) and percent kill \(y\) b. Calculate the coefficient of determination \(r^{2}\) and interpret. c. Fit a least-squares line to the data. d. Suppose you wish to estimate the mean percent kill for an application of 4 pounds of the nematicide per acre. What do the diagnostic plots generated by MINITAB tell you about the validity of the regression assumptions? Which assumptions may have been violated? Can you explain why?
The Academic Performance Index (API) is a measure of school achievement based on the results of the Stan- ford 9 Achievement test. Scores range from 200 to 1000 , with 800 considered a long-range goal for schools. The following table shows the API for eight elementary schools in Riverside County, California, along with the percent of students at that school who are considered English Language Learners (ELL). \(^{3}\) $$ \begin{array}{lrrrrrrrr} \text { School } & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 \\ \hline \text { API } & 588 & 659 & 710 & 657 & 669 & 641 & 557 & 743 \\ \text { ELL } & 58 & 22 & 14 & 30 & 11 & 26 & 39 & 6 \end{array} $$ a. Which of the two variables is the independent variable and which is the dependent variable? Explain your choice. b. Use a scatterplot to plot the data. Is the assumption of a linear relationship between \(x\) and \(y\) reasonable? c. Assuming that \(x\) and \(y\) are linearly related, calculate the least-squares regression line. d. Plot the line on the scatterplot in part b. Does the line fit through the data points?
In Exercise we described an informal experiment conducted at McNair Academic High School in Jersey City, New Jersey. Two freshman algebra classes were studied, one of which used laptop computers at school and at home, while the other class did not. In each class, students were given a survey at the beginning and end of the semester, measuring his or her technological level. The scores were recorded for the end of semester survey \((x)\) and the final examination \((y)\) for the laptop group. \({ }^{6}\) The data and the MINITAB printout are shown here. $$ \begin{array}{crr|ccc} & & \text { Final } & & & \text { Final } \\ \text { Student } & \text { Posttest } & \text { Exam } & \text { Student } & \text { Posttest } & \text { Exam } \\ \hline 1 & 100 & 98 & 11 & 88 & 84 \\ 2 & 96 & 97 & 12 & 92 & 93 \\ 3 & 88 & 88 & 13 & 68 & 57 \\ 4 & 100 & 100 & 14 & 84 & 84 \\ 5 & 100 & 100 & 15 & 84 & 81 \\ 6 & 96 & 78 & 16 & 88 & 83 \\ 7 & 80 & 68 & 17 & 72 & 84 \\ 8 & 68 & 47 & 18 & 88 & 93 \\ 9 & 92 & 90 & 19 & 72 & 57 \\ 10 & 96 & 94 & 20 & 88 & 83 \end{array} $$ a. Construct a scatterplot for the data. Does the assumption of linearity appear to be reasonable? b. What is the equation of the regression line used for predicting final exam score as a function of the posttest score? c. Do the data present sufficient evidence to indicate that final exam score is linearly related to the posttest score? Use \(\alpha=.01\) d. Find a \(99 \%\) confidence interval for the slope of the regression line.
How good are you EX1212 at estimating? To test a subject's ability to estimate sizes, he was shown 10 different objects and asked to estimate their length or diameter. The object was then measured, and the results were recorded in the table below. $$ \begin{array}{lrr} \text { Object } & \text { Estimated (inches) } & \text { Actual (inches) } \\\ \hline \text { Pencil } & 7.00 & 6.00 \\ \text { Dinner plate } & 9.50 & 10.25 \\ \text { Book 1 } & 7.50 & 6.75 \\ \text { Cell phone } & 4.00 & 4.25 \\ \text { Photograph } & 14.50 & 15.75 \\ \text { Toy } & 3.75 & 5.00 \\ \text { Belt } & 42.00 & 41.50 \\ \text { Clothespin } & 2.75 & 3.75 \\ \text { Book 2 } & 10.00 & 9.25 \\ \text { Calculator } & 3.50 & 4.75 \end{array} $$ a. Find the least-squares regression line for predicting the actual measurement as a function of the estimated measurement. b. Plot the points and the fitted line. Does the assumption of a linear relationship appear to be reasonable?
What do you think about this solution?
We value your feedback to improve our textbook solutions.