/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q 69. Managing diabetes People with di... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Managing diabetes People with diabetes measure their fasting plasma glucose (FPG, measured in milligrams per milliliter) after fasting for at least 8 hours. Another measurement, made at regular medical checkups, is called HbA. This is roughly the percent of red blood cells that have a glucose molecule attached. It measures average exposure to glucose over a period of several months. The table gives data on both HbA and FPG for 18 diabetics five months after they had completed a diabetes education class.

a. Make a scatterplot with HbA as the explanatory variable. Describe what you see.

b. Subject 18 is an outlier in the x-direction. What effect do you think this subject has on the correlation? What effect do you think this subject has on the equation of the least-squares regression line? Calculate the correlation and equation of the least-squares regression line with and without this subject to confirm your answer.

c. Subject 15 is an outlier in the y-direction. What effect do you think this subject has on the correlation? What effect do you think this subject has on the equation of the least-squares regression line? Calculate the correlation and equation of the least-squares regression line with and without this subject to confirm your answer.

Short Answer

Expert verified

Part (a) the scatterplot confirms a weak relationship because the points seem to lie far apart.

Part (b) No, they do not affect.

Part (c) It makes the regression line steeper.

Step by step solution

01

Part (a) Step 1: Given information

02

Part (a) Step 2: Explanation

The scatterplot with HbA as the explanatory variable is as:

Because the scatterplot slopes upwards, we can conclude that the scatterplot confirms a positive linear connection. Because the points appear to be widely apart, the scatterplot indicates a weak association.

03

Part (b) Step 1: Calculation

Now we must use the excel function to calculate the correlation:

First, we'll enter the data into an excel file, and then we'll utilize the correlation function, which is,

CORREL function returns the correlation coefficient of the array1andarray2cell ranges. Thus, the syntax is as:

CORREL(array1,array2)

AVERAGE function returns the average of the array1andarray2cell ranges. The syntax is as:

AVERAGE(array1,array2)

For the case with outlier:

Thus, the calculation will be as:

Correlation=CORREL(H1:H18,I1:I18)

And the result will be as:

Correlation=0.4506

Thus, the slope will be,

b=rsysx=0.4506×81.48273.2619=11.2569

And the y-intercept will be,

a=y−bx=172.3846−11.2569×10.3769=55.5726

The regression line will be as:

yÁåœ=55.5726+11.2569x

For the case without outlier:

Thus, the calculation will be as:

Correlation=CORREL(H1:H17,I1:I17)

And the result will be as:

Correlation0.3837

Thus, the slope will be,

b=rsysx=0.3837×68.90202.1821=12.1158

And they-intercept will be,

a=y−bx=157.8824−12.1156×8.7176=52.2615

04

Part (b) Step 2: Calculation

The regression line will be as:

yÁåœ=55.5726+12.1158x

As a result, we can see that the correlation coefficient with the outlier is greater than the correlation coefficient without it. Due to the fact that subject 18 follows the same linear trend as the other points in the scatterplot, the outlier enhances the correlation. We then notice that the two regression lines in the scatterplot are nearly identical, implying that the outlier has little effect on the regression line.

05

Part (c) Step 1: Explanation

Now we must use the excel function to calculate the correlation:

First, we'll enter the data into an excel file, and then we'll utilize the correlation function, which is,

CORREL function returns the correlation coefficient of the array1and array2cell ranges. Thus, the syntax is as:

CORREL(array1,array2)

AVERAGE function returns the average of the array1 and array2 cell ranges. The syntax is as:

AVERAGE (array1,array2)

For the case with outlier:

Thus, the calculation will be as:

Correlation=CORREL(H1:H18,I1:I18)

And the result will be as:

Correlation0.4506

Thus, the slope will be,

b=rsysx=0.4506×81.48273.2619=11.2569

And the y-intercept will be,

a=y−bx=172.3846−11.2569×10.3769=55.5726

The regression line will be as:

yÁåœ=55.5726+11.2569x

For the case without outlier:

Thus, the calculation will be as:

Correlation=CORREL(H1:H17,I1:I17)

And the result will be as:

Correlation0.5684

Thus, the slope will be,

b=rsysx=0.5684×52.62313.3531=8.9204

And the y-intercept will be,

a=y−bx=151.7647−8.9204×9.2235=69.4872

06

Part (c) Step 2: Explanation

The regression line will be

yÁåœ=69.4872+8.9204x

As a result, we can see that the correlation coefficient with the outlier is lower than without the outlier. We then notice that the outlier reduces the correlation since subject 15 deviates from the general linear pattern in the other scatterplot points. Then we see that the regression line with the outlier is steeper than the regression line without the outlier, implying that the outlier causes the regression line to be steeper.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

If women always married men who were 2 years older than themselves, what would be the correlation between the ages of husband and wife?

a. 2

b. 1

c. 0.5

d. 0

e. Can’t tell without seeing the data

Driving speed and fuel consumption Exercise 9 (page 171) gives data on the fuel consumption y of a car at various speeds x. Fuel consumption is measured in liters of gasoline per 100 kilometers driven, and speed is measured in kilometers per hour. A statistical software package gives the least-squares regression line y^=11.058–0.01466x. Use the residual plot to determine if this linear model is appropriate.

All brawn? The following scatterplot plots the average brain weight (in grams) versus average body weight (in kilograms) for 96 species of mammals.18 There are many small mammals whose points overlap at the lower left.

a. The correlation between body weight and brain weight is r=0.86 Interpret this value.

b. What effect does the human have on the correlation? Justify your answer.

More Starbucks How do the nutritional characteristics of food products differ from drink products at Starbucks? The scatterplot shown here enhances the scatterplot from Exercise 6 by plotting the food products with blue squares. How are the relationships between fat and calories the same for the two types of products? How are the relationships different?

More heavy backpacks Refer to your graph from Exercise 3 Describe the relationship between body weight and backpack weight for this group of hikers.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.