/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 101 A doctor is studying cholesterol... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

A doctor is studying cholesterol readings in his patients. After reviewing the cholesterol readings, he calls the patients with the highest cholesterol readings (the top \(5 \%\) of readings in his office) and asks them to come back to discuss cholesterol-lowering methods. When he tests these patients a second time, the average cholesterol readings tend to have gone down somewhat. Explain what statistical phenomenon might have been partly responsible for this lowering of the readings.

Short Answer

Expert verified
The statistical phenomenon responsible for the second reading's average cholesterol being lower is 'Regression to the Mean'. This concept posits that when an extreme value, like a high cholesterol reading, is measured again, it is likely to be closer to the average.

Step by step solution

01

Explaining Regression to the Mean

Regression to the Mean is a statistical concept that suggests if a variable is extreme the first time you measure it, it will be closer to the average the next time you measure it. This phenomenon occurs due to a random variation.
02

Applying the concept to the exercise

In this case, the doctor chose the patients with the highest cholesterol readings - who had extreme values. By inviting these patients for a second test, there is a good chance that their cholesterol readings will regress towards the average, appearing as though their cholesterol levels have decreased.
03

Other factors to consider

Note that it is possible that other factors could have given a real lowering of cholesterol levels between the two readings, like changes made by patients in response to the initial high readings. However, even in the absence of these factors, Regression to the Mean is expected to take place.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The following table gives the distance from Boston to each city (in thousands of miles) and gives the time for one randomly chosen, commercial airplane to make that flight. Do a complete regression analysis that includes a scatterplot with the line, interprets the slope and intercept, and predicts how much time a nonstop flight from Boston to Seattle would take. The distance from Boston to Seattle is 3000 miles. See page 209 for guidance. $$ \begin{array}{|lcc|} \hline \text { City } & \begin{array}{c} \text { Distance } \\ \text { (1000s of miles) } \end{array} & \text { Time (hours) } \\ \hline \text { St. Louis } & 1.141 & 2.83 \\ \hline \text { Los Angeles } & 2.979 & 6.00 \\ \hline \text { Paris } & 3.346 & 7.25 \\ \hline \text { Denver } & 1.748 & 4.25 \\ \hline \text { Salt Lake City } & 2.343 & 5.00 \\ \hline \text { Houston } & 1.804 & 4.25 \\ \hline \text { New York } & 0.218 & 1.25 \\ \hline \end{array} $$

If there is a positive correlation between number of years studying math and shoe size (for children), does that prove that larger shoes cause more studying of math or vice versa? Can you think of a confounding variable that might be influencing both of the other variables?

The data shows the number of calories, carbohydrates (in grams) and sugar (in grams) found in a selection of menu items at McDonald's. Scatterplots suggest the relationship between calories and both carbs and sugars is linear. The data are also available on this text's website. (Source: shapefit.com) $$ \begin{array}{|c|c|c|} \hline \text { Calories } & \text { Carbs (in grams) } & \text { Sugars (in grams) } \\ \hline 530 & 47 & 9 \\ \hline 520 & 42 & 10 \\ \hline 720 & 52 & 14 \\ \hline 610 & 47 & 10 \\ \hline 600 & 48 & 12 \\ \hline 540 & 45 & 9 \\ \hline 740 & 43 & 10 \\ \hline 240 & 32 & 6 \\ \hline 290 & 33 & 7 \\ \hline 340 & 37 & 7 \\ \hline 300 & 32 & 6 \\ \hline 430 & 35 & 7 \\ \hline 380 & 34 & 7 \\ \hline 430 & 35 & 6 \\ \hline 440 & 35 & 7 \\ \hline 430 & 34 & 7 \\ \hline 750 & 65 & 16 \\ \hline 590 & 51 & 14 \\ \hline 510 & 55 & 10 \\ \hline 350 & 42 & 8 \\ \hline \end{array} $$ $$ \begin{array}{|l|l|} \hline \text { Calories } & \text { Carbs (in grams) } & \text { Sugars (in grams) } \\ \hline 670 & 58 & 11 \\ \hline 510 & 44 & 9 \\ \hline 610 & 57 & 11 \\ \hline 450 & 43 & 9 \\ \hline 360 & 40 & 5 \\ \hline 360 & 40 & 5 \\ \hline 430 & 41 & 6 \\ \hline 480 & 43 & 6 \\ \hline 430 & 43 & 7 \\ \hline 390 & 39 & 5 \\ \hline 500 & 44 & 11 \\ \hline 670 & 68 & 12 \\ \hline 510 & 54 & 10 \\ \hline 630 & 56 & 7 \\ \hline 480 & 42 & 6 \\ \hline 610 & 56 & 8 \\ \hline 450 & 42 & 6 \\ \hline 540 & 61 & 14 \\ \hline 380 & 47 & 12 \\ \hline 340 & 37 & 8 \\ \hline 260 & 30 & 7 \\ \hline 340 & 34 & 5 \\ \hline 260 & 27 & 4 \\ \hline 360 & 32 & 3 \\ \hline 280 & 25 & 2 \\ \hline 330 & 26 & 3 \\ \hline 190 & 12 & 0 \\ \hline 750 & 65 & 16 \\ \hline \end{array} $$ a. Calculate the correlation coefficient and report the equation of the regression line using carbs as the predictor and calories as the response variable. Report the slope and interpret it in the context of this problem. Then use your regression equation to predict the number of calories in a menu item containing 55 grams of carbohydrates. b. Calculate the correlation coefficient and report the equation of the regression line using sugar as the predictor and calories as the response variable. Report the slope and interpret it in the context of this problem. Then use your regression equation to predict the number of calories in a menu item containing 10 grams of sugars. c. Based on your answers to parts (a) and (b), which is a better predictor of calories for these data: carbs or sugars? Explain your choice using appropriate statistics.

The following table shows the fat content (in grams) and calories for a sample of granola bars. (Source: calorielab. com ) $$ \begin{array}{|c|l|} \hline \text { Fat (in grams) } & \text { Calories } \\ \hline 7.6 & 370 \\ \hline 3.3 & 106.1 \\ \hline 18.7 & 312.4 \\ \hline \end{array} $$ $$ \begin{array}{|c|c|} \hline \text { Fat (in grams) } & \text { Calories } \\ \hline 3.8 & 113.1 \\ \hline 5 & 117.8 \\ \hline 5.5 & 131.9 \\ \hline 7.2 & 140.6 \\ \hline 6.1 & 118.8 \\ \hline 4.6 & 124.4 \\ \hline 3.9 & 105.1 \\ \hline 6.1 & 136 \\ \hline 4.8 & 124 \\ \hline 4.4 & 119.3 \\ \hline 7.7 & 192.6 \\ \hline \end{array} $$ a. Use technology to make a scatterplot of the data. Use fat as the independent variable \((x)\) and calories as the dependent variable \((y)\). Does there seem to be a linear trend to the data? b. Compute the correlation coefficient and the regression equation, using fat as the independent variable and calories as the dependent variable. c. What is the slope of the regression equation? Interpret the slope in the context of this problem. d. What is the \(y\) -intercept of the regression equation? Interpret the \(y\) -intercept in the context of this problem or explain why it would be inappropriate to do so. e. Find and interpret the coefficient of determination. f. Use the regression equation to predict the calories in a granola bar containing 7 grams of fat. g. Would it be appropriate to use the regression equation to predict the calories in a granola bar containing 25 grams of fat? If so, predict the number of calories in such a bar. If not, explain why it would be inappropriate to do so. h. Looking at the scatterplot there is a granola bar in the sample that has an extremely high number of calories given the moderate amount of fat it contains. Remove its data from the sample and recalculate the correlation coefficient and regression equation. How did removing this unusual point change the value of \(r\) and the regression equation?

Suppose a doctor telephones those patients who are in the highest \(10 \%\) with regard to their recently recorded blood pressure and asks them to return for a clinical review. When she retakes their blood pressures, will those new blood pressures, as a group (that is, on average), tend to be higher than, lower than, or the same as the earlier blood pressures, and why?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.