/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q. T12.12 T12.12 Foresters are interested ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

T12.12 Foresters are interested in predicting the amount of usable lumber they can harvest from various tree species. They collect data on the diameter at breast height (DBH) in inches and the yield in board feet of a random sample of 20 Ponderosa pine trees that have been harvested. (Note that a board foot is defined as a piece of lumber 12 inches by 12 inches by 1 inch.) Here is a scatterplot of the data.

a. Here is some computer output and a residual plot from a least-squares regression on these data. Explain why a linear model may not be appropriate in this case.

The foresters are considering two possible transformations of the original data: (1) cubing the diameter values or (2) taking the natural logarithm of the yield measurements. After transforming the data, a least-squares regression analysis is performed. Here is some computer output and a residual plot for each of the two possible regression models:

b. Use both models to predict the amount of usable lumber from a Ponderosa pine with diameter 30 inches.
c. Which of the predictions in part (b) seems more reliable? Give appropriate evidence to support your choice.

Short Answer

Expert verified

(a) The pattern in the residual plot involves substantial curvature, a linear model will not be appropriate because the variables have a curved connection.

(b) The predicted yield for option 1is 117.0899board feet and the predicted yield for option 2 is 102.967board feet.

(c) Option 1 is the better option for prediction.

Step by step solution

01

Part (a) Step 1: Given information

To determine that a linear model may not be appropriate in this case.

02

Part (a) Step 2: Explanation

Foresters want to know how much useful lumber they'll be able to get from different tree species.
They took measurements of a random sample of Ponderosa pine trees' diameter at breast height in inches and yield in broad feet.
In the question, will find the computer output as well as a residual plot from least square regression.
Because the pattern in the residual plot involves substantial curvature, a linear model will not be acceptable because the variables have a curved connection.

03

Part (b) Step 1: Given information

To use both models to predict the amount of usable lumber from a Ponderosa pine with diameter 30 inches.

04

Part (b) Step 2: Explanation

Foresters want to know how much useful lumber they will be able to collect from different tree types. They measured the diameter of a random sample of Ponderosa pine trees at breast height in inches and the yield in broad feet. The question includes the computer results as well as a residual graphic from a least square regression. The foresters are exploring cubing the diameter values or taking the natural logarithm of the yields measurements as two feasible modifications of the original data.
As a result, the general equation of the least square regression line for option 1 is:
y^=b0+b1x
The value of the constant b0 is calculated as follows in the computer output's row "Constant" and column "Coef":
b0=2.078
The value of the constant b1is calculated as follows in the computer output's row "DBH3" and column "Coef":
b1=0.0042597

In the general equation, replace b0with 2.078and b1with b1=0.0042597.
y^=b0+b1x
⇒y^=2.078+0.0042597x

Hence the cubic equation is calculated as:

y^=2.078+0.0042597x3

Substitute xfor 30:

y^=2.078+0.0042597x3

=2.078+0.0042597(30)3

=117.0899

As a result, the predicted yield is 117.0899board feet.

05

Part (b) Step 3: Explanation

Then, the general equation of the least square regression line for option 2 is:
y^=b0+b1x
The value of the constant $b 0$ is calculated as follows in the computer output's row "Constant" and column "Coef":
b0=1.2319

The value of the constant $b 1$ is calculated as follows in the computer output's row "DBH" and column "Coef":
b1=0.113417
In the general equation, replace $b_{0}=1.2319$ and $b_{1}$ with $b_{1}=0.113417$,
y^=b0+b1x
⇒y^=1.2319+0.113417x
Use the logarithm in the equation:
lny^=1.2319+0.113417x
Then multiply xby 30 to get:
lny^=1.2319+0.113417x
=1.2319+0.113417(30)
=4.63441
Take each side's exponential:
y^=elny^
=e4.63441
=102.967

As a result, the predicted yield is 102.967 board feet.

06

Part (c)  Step 1: Given information

To find the predictions in part (b) seems more reliable and to explain with appropriate evidence.

07

Part (c) Step 2: Explanation

Foresters want to know how much useful lumber they will be able to collect from different tree types. They measured the diameter of a random sample of Ponderosa pine trees at breast height in inches and the yield in broad feet.
The question includes the computer results as well as a residual graphic from a least square regression. The foresters are exploring cubing the diameter values or taking the natural logarithm of the yields measurements as two feasible modifications of the original data.
As a result, the residual plot of option 1 has no strong curvature, whereas the residual plot of option 2 has strong curvature.
Also, the model in option 1is appropriate for making predictions, but the model in option 2 is not.
Therefore, estimated that forecast using option 1 will be more accurate.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

A researcher from the University of California, San Diego, collected data on average per capita wine consumption and heart disease death rate in a random sample of 19 countries for which data were available. The following table displays the data

Is there convincing evidence of a negative linear relationship between wine consumption and heart disease deaths in the population of countries?

The swinging pendulum Refer to Exercise 33. We took the logarithm (base 10) of the values for both length and period. Here is some computer output from a linear regression analysis of the transformed data.


a. Based on the output, explain why it would be reasonable to use a power model to describe the relationship between the length and period of a pendulum.

b. Give the equation of the least-squares regression line. Be sure to define any variables you use.

c. Use the model from part (b) to predict the period of a pendulum with a length of 80cm.

Recycle and Review Exercises 29-31 refer to the following setting. Does the color in which words are printed affect your ability to read them? Do the words themselves affect your ability to name the color in which they are printed? Mr. Starnes designed a study to investigate these questions using the 16 students in his AP Statistics class as subjects. Each student performed the following two tasks in random order while a partner timed his or her performance: (1) Read 32words aloud as quickly as possible, and (2) say the color in which each of32 words is printed as quickly as possible. Try both tasks for yourself using the word list given.

Color words (4.2) Let's review the design of the study-

a. Explain why this was an experiment and not an observational study.

b. Did Mr. Stames use a completely randomized design a: randomizéd black design? Why do you think he choose this experimental design?

c. Explain the purpose of the ramdom assignment in the context of the study.

Here are the data from Mr. Stames's experiment. For each subject, the time to perform the Iwo tasks is given to the nearest second.

The professor swims Here are data on the time (in minutes) Professor Moore takes to swim 2000yards and his pulse rate (beats per minute) after swimming on a random sample of 23days:

Is there convincing evidence of a negative linear relationship between Professor Moore’s swim time and his pulse rate in the population of days on which he swims2000yards?

Western lowland gorillas, whose main habitat is in central Africa, have a mean weight of 275pounds with a standard deviation of 40pounds. Capuchin monkeys, whose main habitat is Brazil and other parts of Latin America, have a mean weight of 6pounds with a standard deviation of 1.1pounds. Both distributions of weight are approximately Normally distributed. If a particular western lowland gorilla is known to weigh 345pounds, approximately how much would a capuchin monkey have to weigh, in pounds, to have the same standardized weight as the gorilla?

a. 4.08

b. 7.27

c. 7.93

d.8.20

e. There is not enough information to determine the weight of a capuchin monkey.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.