/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 59 Toughness and fibrousness of asp... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Toughness and fibrousness of asparagus are major determinants of quality. This was the focus of a study reported in "Post-Harvest Glyphosphate Application Reduces Toughening, Fiber Content, and Lignification of Stored Asparagus Spears" (J. of the Amer. Soc. of Hort. Science, 1988: 569-572). The article reported the accompanying data (read from a graph) on \(x=\) shear force \((\mathrm{kg})\) and \(y=\) percent fiber dry weight. $$ \begin{array}{c|ccccccccc} x & 46 & 48 & 55 & 57 & 60 & 72 & 81 & 85 & 94 \\ \hline y & 2.18 & 2.10 & 2.13 & 2.28 & 2.34 & 2.53 & 2.28 & 2.62 & 2.63 \\ x & 109 & 121 & 132 & 137 & 148 & 149 & 184 & 185 & 187 \\ \hline y & 2.50 & 2.66 & 2.79 & 2.80 & 3.01 & 2.98 & 3.34 & 3.49 & 3.26 \end{array} $$ \(n=18, \Sigma x_{i}=1950, \Sigma x_{i}^{2}=251,970\) \(\Sigma y_{i}=47.92, \Sigma y_{i}^{2}=130.6074, \Sigma x_{i} y_{i}=5530.92\) a. Calculate the value of the sample correlation coefficient. Based on this value, how would you describe the nature of the relationship between the two variables? b. If a first specimen has a larger value of shear force than does a second specimen, what tends to be true of percent dry fiber weight for the two specimens? c. If shear force is expressed in pounds, what happens to the value of \(r\) ? Why? d. If the simple linear regression model were fit to this data, what proportion of observed variation in percent fiber dry weight could be explained by the model relationship? e. Carry out a test at significance level .01 to decide whether there is a positive linear association between the two variables.

Short Answer

Expert verified
Calculate \( r \) using formula, its strength tells the relationship's nature, unit conversion doesn't affect \( r \), \( r^2 \) shows proportion of variance explained, test significance with \( t \)-test at \( \alpha = 0.01 \).

Step by step solution

01

Understand the Variables

We have two variables: \(x\), the shear force in kilograms, and \(y\), the percent fiber dry weight. We need to analyze their relationship using statistical methods.
02

Calculate the Correlation Coefficient

The formula for the sample correlation coefficient \(r\) is \[ r = \frac{n \Sigma xy - \Sigma x \Sigma y}{\sqrt{(n \Sigma x^2 - (\Sigma x)^2)(n \Sigma y^2 - (\Sigma y)^2)}} \] Plug the given values into the formula: \( n = 18 \), \( \Sigma x = 1950 \), \( \Sigma x^2 = 251970 \), \( \Sigma y = 47.92 \), \( \Sigma y^2 = 130.6074 \), \( \Sigma xy = 5530.92 \). Calculate the result.
03

Interpret the Correlation

Once the correlation coefficient \(r\) is calculated, assess the strength and direction of the linear relationship. Positive \(r\) indicates a direct relationship, while negative \(r\) suggests an inverse relationship. The closer \(r\) is to \(1\) or \(-1\), the stronger the relationship.
04

Assess the Effect of Unit Conversion

If shear force is converted from kilograms to pounds, the correlation coefficient \(r\) remains unchanged because \(r\) is a dimensionless measurement. Changing units does not affect the strength or direction of a linear relationship.
05

Calculate Proportion of Variance Explained

The coefficient of determination \( r^2 \) represents the proportion of variance in \( y \) explained by the linear relationship. Square the correlation coefficient \( r \) to determine \( r^2 \), which indicates the explanatory power of the regression model.
06

Hypothesis Test for Significance of Correlation

For a hypothesis test at a significance level of \( \alpha = 0.01 \), the null hypothesis \( H_0: \rho = 0 \) (no correlation) is tested against the alternative hypothesis \( H_a: \rho > 0 \) (positive correlation). Use the test statistic \[ t = r \sqrt{\frac{n-2}{1-r^2}} \] and compare with the critical value from the \( t \)-distribution table with \( n-2 \) degrees of freedom.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Linear Regression
Linear regression is a method used to model the relationship between two variables by fitting a linear equation to observed data. In our asparagus study, these variables are the shear force, denoted as \(x\), and the percent fiber dry weight, denoted as \(y\). A simple linear regression will often appear as a straight line on a graph where \(x\) values are plotted on the horizontal axis and \(y\) values on the vertical axis. This line is called the "best fit" line.

The equation of this line is typically written as \(y = mx + c\), where \(m\) represents the slope of the line and \(c\) is the y-intercept. The slope \(m\) shows how much \(y\) changes for a one-unit change in \(x\).

This method is particularly useful for making predictions. If you have a data point for \(x\), you can plug it into the equation to predict \(y\). In the case of asparagus, having a known shear force allows estimation of the percent fiber dry weight. This predictive power is why linear regression is a go-to tool for analysts and researchers.
Hypothesis Testing
Hypothesis testing in the context of correlation involves evaluating whether a relationship between two variables is statistically significant. In the asparagus example, you might want to test if the observed correlation between shear force and percent fiber dry weight is due to a true association, rather than mere chance.

Hypothesis testing starts with forming a null hypothesis, \(H_0\), and an alternative hypothesis, \(H_a\). For these variables, \(H_0\) might state that there is no correlation \((\rho = 0)\), whereas \(H_a\) suggests there is a positive correlation \((\rho > 0)\).

Using a significance level, such as \(\alpha = 0.01\), which reflects the risk of accepting \(H_a\) when \(H_0\) is true, you calculate a test statistic from the sample data. This statistic is then compared against a critical value derived from the \(t\)-distribution with \(n-2\) degrees of freedom, where \(n\) is the number of data pairs.

If the test statistic exceeds this critical value, the null hypothesis is rejected, supporting the alternative hypothesis. Thus, you conclude that a significant positive correlation exists between shear force and percent fiber dry weight, confirming it's not just by chance.
Variance Explained
Variance explained in a dataset refers to how much of the variability in one variable is accounted for by its relationship with another variable, often through a linear model. The measure used here is the coefficient of determination, \( r^2 \).

In the asparagus study, once you have found the correlation coefficient \(r\), squaring it yields \( r^2 \). This value provides insight into the proportion of variance in percent fiber dry weight that can be explained by the changes in shear force.

A higher \( r^2 \) value indicates a stronger relationship, meaning that a greater percentage of \( y \) (percent fiber dry weight) variation can be predicted from \( x \) (shear force) variations. For example, if \( r^2 = 0.58 \), it implies that 58% of the variability in percent fiber dry weight is explained by the linear relationship with shear force.

This aspect of analysis is crucial because it shows the effectiveness of your linear model in explaining the variability, guiding decisions and insights drawn from the data.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The efficiency ratio for a steel specimen immersed in a phosphating tank is the weight of the phosphate coating divided by the metal loss (both in \(\mathrm{mg} / \mathrm{ft}^{2}\) ). The article "Statistical Process Control of a Phosphate Coating Line" (Wire J. Intl., May 1997: 78-81) gave the accompanying data on tank temperature \((x)\) and efficiency ratio \((y)\). $$ \begin{array}{l|rrrrrrr} \text { Temp. } & 170 & 172 & 173 & 174 & 174 & 175 & 176 \\ \hline \text { Ratio } & .84 & 1.31 & 1.42 & 1.03 & 1.07 & 1.08 & 1.04 \\ \text { Temp. } & 177 & 180 & 180 & 180 & 180 & 180 & 181 \\ \hline \text { Ratio } & 1.80 & 1.45 & 1.60 & 1.61 & 2.13 & 2.15 & .84 \\ \text { Temp. } & 181 & 182 & 182 & 182 & 182 & 184 & 184 \\ \hline \text { Ratio } & 1.43 & .90 & 1.81 & 1.94 & 2.68 & 1.49 & 2.52 \\ \text { Temp. } & 185 & 186 & 188 & & & & \\ \hline \text { Ratio } & 3.00 & 1.87 & 3.08 & & & & \end{array} $$ a. Construct stem-and-leaf displays of both temperature and efficiency ratio, and comment on interesting features. b. Is the value of efficiency ratio completely and uniquely determined by tank temperature? Explain your reasoning. c. Construct a scatterplot of the data. Does it appear that efficiency ratio could be very well predicted by the value of temperature? Explain your reasoning.

The invasive diatom species Didymosphenia geminata has the potential to inflict substantial ecological and economic damage in rivers. The article "Substrate Characteristics Affect Colonization by the BloomForming Diatom Didymosphenia geminata (Aquatic Ecology, 2010: 33-40) described an investigation of colonization behavior. One aspect of particular interest was whether \(y=\) colony density was related to \(x=\) rock surface area. The article contained a scatterplot and summary of a regression analysis. Here is representative data: $$ \begin{array}{l|rrrrrrrr} x & 50 & 71 & 55 & 50 & 33 & 58 & 79 & 26 \\ \hline y & 152 & 1929 & 48 & 22 & 2 & 5 & 35 & 7 \\ x & 69 & 44 & 37 & 70 & 20 & 45 & 49 & \\ \hline y & 269 & 38 & 171 & 13 & 43 & 185 & 25 & \end{array} $$ a. Fit the simple linear regression model to this data, predict colony density when surface area \(=70\) and when surface area \(=71\), and calculate the corresponding residuals. How do they compare? b. Calculate and interpret the coefficient of determination. c. The second observation has a very extreme \(y\) value (in the full data set consisting of 72 observations, there were two of these). This observation may have had a substantial impact on the fit of the model and subsequent conclusions. Eliminate it and recalculate the equation of the estimated regression line. Does it appear to differ substantially from the equation before the deletion? What is the impact on \(r^{2}\) and \(s\) ?

Head movement evaluations are important because individuals, especially those who are disabled, may be able to operate communications aids in this manner. The article "Constancy of Head Turning Recorded in Healthy Young Humans" (J. of Biomed. Engr., 2008: 428-436) reported data on ranges in maximum inclination angles of the head in the clockwise anterior, posterior, right, and left directions for 14 randomly selected subjects. Consider the accompanying data on average anterior maximum inclination angle (AMIA) both in the clockwise direction and in the counterclockwise direction. $$ \begin{array}{lccccccc} \text { Subj: } & 1 & 2 & 3 & 4 & 5 & 6 & 7 \\ \text { Cl: } & 57.9 & 35.7 & 54.5 & 56.8 & 51.1 & 70.8 & 77.3 \\ \text { Co: } & 44.2 & 52.1 & 60.2 & 52.7 & 47.2 & 65.6 & 71.4 \\ \text { Subj: } & 8 & 9 & 10 & 11 & 12 & 13 & 14 \\ \text { Cl: } & 51.6 & 54.7 & 63.6 & 59.2 & 59.2 & 55.8 & 38.5 \\ \text { Co: } & 48.8 & 53.1 & 66.3 & 59.8 & 47.5 & 64.5 & 34.5 \end{array} $$ a. Calculate a point estimate of the population correlation coefficient between Cl AMIA and Co AMIA \(\left(\Sigma \mathrm{Cl}=786.7, \quad \Sigma \mathrm{Co}=767.9, \quad \Sigma \mathrm{Cl}^{2}=45,727.31\right.\), \(\left.\Sigma \mathrm{Co}^{2}=43,478.07, \Sigma \mathrm{ClCo}=44,187.87\right)\). b. Assuming bivariate normality (normal probability plots of the \(\mathrm{Cl}\) and \(\mathrm{Co}\) samples are reasonably straight), carry out a test at significance level .01 to decide whether there is a linear association between the two variables in the population (as do the authors of the cited paper). Would the conclusion have been the same if a significance level of \(.001\) had been used?

The article "Behavioural Effects of Mobile Telephone Use During Simulated Driving" (Ergonomics, 1995: 2536-2562) reported that for a sample of 20 experimental subjects, the sample correlation coefficient for \(x=\) age and \(y=\) time since the subject had acquired a driving license (yr) was .97. Why do you think the value of \(r\) is so close to 1 ? (The article's authors give an explanation.)

Torsion during hip external rotation and extension may explain why acetabular labral tears occur in professional athletes. The article "Hip Rotational Velocities During the Full Golf Swing" (J. of Sports Science and Med., 2009: 296-299) reported on an investigation in which lead hip internal peak rotational velocity \((x)\) and trailing hip peak external rotational velocity \((y)\) were determined for a sample of 15 golfers. Data provided by the article's authors was used to calculate the following summary quantities: $$ \begin{aligned} \sum\left(x_{i}-\bar{x}\right)^{2}=& 64,732.83, \sum\left(y_{i}-\bar{y}\right)^{2}=130,566.96, \\ & \sum\left(x_{i}-\bar{x}\right)\left(y_{i}-\bar{y}\right)=44,185.87 \end{aligned} $$ Separate normal probability plots showed very substantial linear patterns. a. Calculate a point estimate for the population correlation coefficient. b. Carry out a test at significance level .01 to decide whether there is a linear relationship between the two velocities in the sampled population. c. Would the conclusion of (b) have changed if you had tested appropriate hypotheses to decide whether there is a positive linear association in the population? What if a significance level of .05 rather than \(.01\) had been used?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.