/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q 66. Managing diabetes People with di... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Managing diabetes People with diabetes measure their fasting plasma glucose (FPG; measured in units of milligrams per milliliter) after fasting for at least 8 hours. Another measurement, made at regular medical checkups, is called HbA. This is roughly the percent of red blood cells that have a glucose

molecule attached. It measures average exposure to glucose over a period of several months. The table below gives data on both HbA and FPG for 18 diabetics five months after they had completed a diabetes education class.

(a) Make a scatterplot with HbA as the explanatory variable. There is a positive linear relationship, but it is surprisingly weak.

(b) Subject 15 is an outlier in the y-direction. Subject 18 is an outlier in the x-direction. Find the correlation for all 18 subjects, for all except Subject 15 and

for all except Subject 18 Are either or both of these subjects influential for the correlation? Explain in simple language why r changes in opposite directions when we remove each of these points.

(c) Add three regression lines for predicting FPG from HbA to your scatterplot: for all 18 subjects, for all except Subject 15 and for all except Subject 18

Is either Subject 15 or Subject 18 strongly influential for the least-squares line? Explain in simple language what features of the scatterplot explain the degree of influence.

Short Answer

Expert verified

Part (b) The correlation r with all 18 subjects is r=0.482

The correlation r without subject 15 is r=0.568

The correlation r without subject 18 is r=0.384

Part (c) Subject 15 and subject 18 both are influential.

Part (a)

Step by step solution

01

Part (a) Step 1: Given information

SubjectHb1AFPGSubjectHbAFPG
16.1141108.7172
26.3
158119.4200
36.41121210.4271
46.81531310.6103
57.01341410.7172
67.1951510.7359
77.5961611.2145
87.7781713.7147
97.91481819.3255
02

Part (a) Step 2: Concept

Linear regression is commonly used for predictive analysis and modeling.

03

Part (a) Step 3: Explanation

Set the horizontal axis for HbA (the explanatory variable) and the vertical axis for FPG (the response variable).

The scatterplot for the supplied data is presented below using the MINITAB:

The general pattern moves from the bottom left to the higher right, as shown in the graph. That is, people with a higher HbA have a higher FPG. This is referred to as a positive relationship between the two variables. The relationship is linear in nature. That example, the general pattern runs from bottom left to higher right in a straight line. Because the points deviate greatly from the line and there are some outliers, the relationship is weak. Therefore, the required scatterplot is drawn.

04

Part (b) Step 1: Calculation

The correlation r with all 18 individuals using the MINITAB is r=0.482

Without subject 15 the correlation coefficient is r=0.568

Without subject 18 the correlation coefficient isr=0.384

Without subject 15 and without subject 18 the correlation is r=0.324

The Correlation increases by 0.086 after outlier subject 15 is removed. However, removing subject 15 from the equation has no influence on the association. Because of subject 15's extreme position on the HbA scale, the position of the regression line is strongly influenced by this point. The Correlation drops by 0.098 when the outlier subject18 is removed. One outlier can be wholly responsible for a high correlation value that would otherwise be quite low (without the outlier). Needless to note, major decisions should never be made solely on the basis of the correlation coefficient's value (i.e., examining the respective scatterplot is always recommended). These are known as 'good' outliers. Both subjects 15 and 18 have an impact since the linear correlation coefficient varies dramatically when they are combined.

Therefore,

The correlation r with all 18 subjects is r=0.482

The correlation r without subject 15 is r=0.568

The correlation r without subject 18 is r=0.384

05

Part (c) Step 1: Explanation

The least-square lines with all 18topics, without subject 15 and without subject 18 are shown in the diagram below.

The relevance of subject 18 can be shown here. This point can be considered an excellent outlier because it spreads the pattern to the top right. When this point is removed, the correlation decreases since the remaining points exhibit no discernible pattern. Because this point is so distant from the regression line, Subject 15 has a very large residual. Least-squares lines minimize the sum of squares of the vertical distances between the points. The line is pulled toward itself by a point that is extreme in the X direction and has no other points nearby. It's known as influential spots. It lowers the line's incline. Therefore, subject 15 and subject 18 both are influential.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Do students with higher IQ test scores tend to do better in school? The figure below shows a scatterplot of IQ and school grade point average (GPA) for all 78seventh-grade students in a rural midwestern school. (GPA was recorded on a 12-point scale with , A+=12,A=11,A-=10,B+=9,…,D-=1,andF=0.)2

(a) Say in words what a positive association betweenIQ and GPA would mean. Does the plot show a positive association?

(b) What is the form of the relationship? Is it very strong? Explain your answers.

(c) At the bottom of the plot are several points that we might call outliers. One student, in particular, has a very low GPA despite an average IQ score. What are the approximate IQ and GPA of this student?

When it rains, it pours The figure below plots the record-high yearly precipitation in each state against that state’s record-high 24-hour precipitation. Hawaii is a high outlier, with a record-high yearly record of 704.83 inches of rain recorded at Kukui in 1982

(a) The correlation for all 50 states in the figure is 0.408 If we leave out Hawaii, would the correlation increase, decrease, or stay the same? Explain.

(b) Two least-squares lines are shown on the graph. One was calculated using all 50 states, and the other omits Hawaii. Which line is which? Explain.

(c) Explain how each of the following would affect the correlation, s, and the least-squares line:

  • Measuring record precipitation in feet instead of inches for both variables.
  • Switching the explanatory and response variables.

A recent study discovered the correlation between the age at which an infant first speaks and the child’s score on an IQ test given upon entering elementary school is −0.68. A scatterplot of the data shows a linear form. Which of the following statements about this finding is correct?

(a) Infants who speak at very early ages will have higher IQ scores by the beginning of elementary school than those who begin to speak later.

(b) 68%of the variation in IQ test scores is explained by the least-squares regression of age at first spoken word and IQ score.

(c) Encouraging infants to speak before they are ready can have a detrimental effect later in life, as evidenced by their lower IQ scores.

(d) There is a moderately strong, negative linear relationship between age at first spoken word and later IQ test scores for the individuals in this study.

How much gas? Refer to Exercise 40. During March, the average temperature was 46.4°F and Joan used 490 cubic feet of gas per day. Find and interpret the residual for this month.

Find the residual for the subject who increased NEA by620 calories. Show your work

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.