/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 65 The article "Chronological Trend... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

The article "Chronological Trend in Blood Lead Levels" (N. Engl. J. Med., 1983: 1373-1377) gives the following data on \(y=\) average blood lead level of white children age 6 months to 5 years and \(x=\) amount of lead used in gasoline production (in 1000 tons) for ten 6-month periods: $$ \begin{array}{l|ccccc} x & 48 & 59 & 79 & 80 & 95 \\ \hline y & 9.3 & 11.0 & 12.8 & 14.1 & 13.6 \\ x & 95 & 97 & 102 & 102 & 107 \\ \hline y & 13.8 & 14.6 & 14.6 & 16.0 & 18.2 \end{array} $$ a. Construct separate normal probability plots for \(x\) and \(y\). Do you think it is reasonable to assume that the \((x, y)\) pairs are from a bivariate normal population? b. Does the data provide sufficient evidence to conclude that there is a linear relationship between blood lead level and the amount of lead used in gasoline production? Use \(\alpha=.01\).

Short Answer

Expert verified
Plot both sets of data, test for linearity, and perform a regression test. If plots are linear and the test supports it, conclude a linear relationship exists.

Step by step solution

01

Organize the Data

First, arrange the given data points clearly. We have two variables, blood lead level \(y\), and lead usage in gasoline \(x\) for ten observations. Note their values: \(x: 48, 59, 79, 80, 95, 95, 97, 102, 102, 107\) and \(y: 9.3, 11.0, 12.8, 14.1, 13.6, 13.8, 14.6, 14.6, 16.0, 18.2\).
02

Construct Normal Probability Plot for x

Plot the normal probability plot for the variable \(x\). Sort \(x\) in increasing order and plot the sorted values against their expected z-scores if they were normally distributed. Assess linearity of the plot to determine normality.
03

Construct Normal Probability Plot for y

Plot the normal probability plot for the variable \(y\). As in Step 2, sort \(y\) and plot these against the expected z-scores. Check for linear patterns which suggest normality.
04

Evaluate Normal Probability Plots

Examine both normal probability plots for approximate linear relationships. If both plots show linear patterns, it is reasonable to assume normality of \(x\) and \(y\), which is a prerequisite for bivariate normality.
05

Conduct Hypothesis Testing for Linear Relationship

To test if there is a significant linear relationship between \(x\) and \(y\), perform a linear regression analysis. The null hypothesis \(H_0\) states that there is no linear relationship (\(\beta_1 = 0\)), and the alternative hypothesis \(H_1\) is that there is a linear relationship (\(\beta_1 eq 0\)).
06

Calculate Linear Regression Statistics

Fit the linear regression model \(y = \beta_0 + \beta_1 x\). Calculate the slope \(\beta_1\), intercept \(\beta_0\), and determine the correlation coefficient \(r\). Compute the test statistic for \(\beta_1\) using the formula \(t = \frac{\hat{\beta}_1}{SE(\hat{\beta}_1)}\), where \(SE(\hat{\beta}_1)\) is the standard error of the slope.
07

Conclusion from Hypothesis Test

Compare the calculated \(t\)-statistic to the critical value from the t-distribution with \(n-2\) degrees of freedom at \(\alpha = 0.01\). Reject \(H_0\) if the \(t\)-statistic falls in the rejection region, indicating a significant linear relationship. Otherwise, fail to reject \(H_0\).
08

Final Assessment of Bivariate Normality

If both normal probability plots suggest normality, and the hypothesis test indicates a significant linear relationship, it is reasonable to assume the data could come from a bivariate normal distribution.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Normal Probability Plot
A normal probability plot is a graphical technique that helps us assess whether a set of data follows a normal distribution. To construct one, we begin by sorting our data in ascending order. For each value, an expected z-score is calculated, assuming a normal distribution.

The data values are then plotted against these z-scores, and we look for linearity in the resulting scatterplot. A straight line suggests that the data adhere well to a normal distribution. Non-linearity, such as curves or upward or downward deviations, may indicate deviations from normality.

It is crucial to determine normality because many statistical tests, including bivariate analysis, assume that the involved variables are normally distributed.
  • A linear plot implies distribution approximates normal.
  • Non-linear plot suggests deviation from normality.
By constructing normal probability plots for both variables, we can infer about their individual normality, which is a step toward validating assumptions for more complex analyses like bivariate normality.
Bivariate Normal Distribution
When we say that a data set is from a bivariate normal distribution, we mean that two variables not only individually follow a normal distribution but also jointly display specific properties. Primarily, for each fixed value of one variable, the distribution of the other variable is normal.

In practical terms, this implies considering both the relationship between the two variables and their marginal distributions. A key aspect is the linear correlation between variables; observations must exhibit a linear relationship where deviations are randomly scattered around a mean line.

Steps to determine bivariate normality include:
  • Assess individual normality using normal probability plots.
  • Check for linear correlation using scatter plots.
  • Conduct a statistical test for linear relationship (e.g., regression analysis).
If both variables are normally distributed separately, and a linear relationship with some scatter of deviation exists, bivariate normality can be reasonably assumed.
Linear Regression Analysis
Linear regression analysis is used to model the relationship between two variables by fitting a linear equation to observed data. In this context, we aim to understand the dependency of one variable, usually called the response variable, on another, known as the predictor variable.

The basic equation for linear regression is given by \( y = \beta_0 + \beta_1 x + \varepsilon \), where \( y \) is the dependent variable, \( x \) is the independent variable, \( \beta_0 \) and \( \beta_1 \) are coefficients representing the intercept and slope, respectively, and \( \varepsilon \) is the error term.

Steps to perform linear regression analysis include:
  • Calculate the slope \( \beta_1 \) and the y-intercept \( \beta_0 \).
  • Compute the correlation coefficient, which measures how well \( x \) and \( y \) are related.
  • Determine the standard error and test statistics to evaluate the significance of the relationship.
If the test confirms the hypothesis that \( \beta_1 eq 0 \), it indicates a significant linear relationship between the variables. This knowledge is particularly valuable in predictive analytics, where understanding how one variable impacts another is crucial for forecasting and decision-making.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The article "A Dual-Buffer Titration Method for Lime Requirement of Acid Mine- soils" (J. of Environ. Qual., 1988: \(452-456\) ) reports on the results of a study relating to revegetation of soil at mine reclamation sites. With \(x=\mathrm{KCl}\) extractable aluminum and \(y=\) amount of lime required to bring soil \(\mathrm{pH}\) to \(7.0\), data in the article resulted in the following summary statistics: \(n=24, \quad \sum x=48.15, \quad \sum x^{2}=\) \(155.4685, \sum y=263.5, \sum y^{2}=3750.53\), and \(\sum x y=658.455\). Carry out a test at significance level .01 to see whether the population correlation coefficient is something other than 0 .

The article "Some Field Experience in the Use of an Accelerated Method in Estimating 28-Day Strength of Concrete" (J. Amer. Concrete Institute, 1969: 895) considered regressing \(y=28\)-day standard-cured strength (psi) against \(x=\) accelerated strength (psi). Suppose the equation of the true regression line is \(y=1800+1.3 x\). a. What is the expected value of 28-day strength when accelerated strength \(=2500\) ? b. By how much can we expect 28-day strength to change when accelerated strength increases by 1 psi? c. Answer part (b) for an increase of \(100 \mathrm{psi}\). d. Answer part (b) for a decrease of \(100 \mathrm{psi}\).

The article "The Incorporation of Uranium and Silver by Hydrothermally Synthesized Galena" (Econ. Geology, 1964: 1003-1024) reports on the determination of silver content of galena crystals grown in a closed hydrothermal system over a range of temperature. With \(x=\) crystallization temperature in \({ }^{\circ} \mathrm{C}\) and \(y=\mathrm{Ag}_{2} \mathrm{~S}\) in mol\%, the data follows: $$ \begin{array}{l|ccccccccccccc} x & 398 & 292 & 352 & 575 & 568 & 450 & 550 & 408 & 484 & 350 & 503 & 600 & 600 \\ \hline y & .15 & .05 & .23 & .43 & .23 & .40 & .44 & .44 & .45 & .09 & .59 & .63 & .60 \end{array} $$ from which \(\sum x_{i}=6130, \sum x_{i}^{2}=3,022,050, \sum y_{i}=4.73\), \(\sum y_{i}^{2}=2.1785, \sum x_{i} y_{i}=2418.74, \hat{\beta}_{1}=.00143, \hat{\beta}_{0}=-.311\), and \(s=.131\). a. Estimate true average silver content when temperature is \(500^{\circ} \mathrm{C}\) using a \(95 \%\) confidence interval. b. How would the width of a \(95 \%\) CI for true average silver content when temperature is \(400^{\circ} \mathrm{C}\) compare to the width of the interval in part (a)? Answer without computing this new interval. c. Calculate a \(95 \%\) CI for the true average change in silver content associated with a \(1^{\circ} \mathrm{C}\) increase in temperature. d. Suppose it had previously been believed that when crystallization temperature was \(400^{\circ} \mathrm{C}\), true average silver content would be .25. Carry out a test at significance level .05 to decide whether the sample data contradicts this prior belief.

Mist (airborne droplets or aerosols) is generated when metal-removing fluids are used in machining operations to cool and lubricate the tool and workpiece. Mist generation is a concern to OSHA, which has recently lowered substantially the workplace standard. The article "Variables Affecting Mist Generaton from Metal Removal Fluids" (Lubrication Engr., 2002: 10-17) gave the accompanying data on \(x=\) fluid flow velocity for a \(5 \%\) soluble oil \((\mathrm{cm} / \mathrm{sec})\) and \(y=\) the extent of mist droplets having diameters smaller than \(10 \mu \mathrm{m}\left(\mathrm{mg} / \mathrm{m}^{3}\right)\) : $$ \begin{array}{c|ccccccc} x & 89 & 177 & 189 & 354 & 362 & 442 & 965 \\ \hline y & .40 & .60 & .48 & .66 & .61 & .69 & .99 \end{array} $$ a. The investigators performed a simple linear regression analysis to relate the two variables. Does a scatter plot of the data support this strategy? b. What proportion of observed variation in mist can be attributed to the simple linear regression relationship between velocity and mist? c. The investigators were particularly interested in the impact on mist of increasing velocity from 100 to 1000 (a factor of 10 corresponding to the difference between the smallest and largest \(x\) values in the sample). When \(x\) increases in this way, is there substantial evidence that the true average increase in \(y\) is less than .6? d. Estimate the true average change in mist associated with a \(1 \mathrm{~cm} / \mathrm{sec}\) increase in velocity, and do so in a way that conveys information about precision and reliability.

The efficiency ratio for a steel specimen immersed in a phosphating tank is the weight of the phosphate coating divided by the metal loss (both in \(\mathrm{mg} / \mathrm{ft}^{2}\) ). The article "Statistical Process Control of a Phosphate Coating Line" (Wire J. Intl., May, 1997: 78-81) gave the accompanying data on tank temperature \((x)\) and efficiency ratio \((y)\). $$ \begin{array}{cccccccc} \text { Temp. } & 170 & 172 & 173 & 174 & 174 & 175 & 176 \\ \text { Ratio } & .84 & 1.31 & 1.42 & 1.03 & 1.07 & 1.08 & 1.04 \\ \text { Temp. } & 177 & 180 & 180 & 180 & 180 & 180 & 181 \\ \text { Ratio } & 1.80 & 1.45 & 1.60 & 1.61 & 2.13 & 2.15 & .84 \\ \text { Temp. } & 181 & 182 & 182 & 182 & 182 & 184 & 184 \\ \text { Ratio } & 1.43 & .90 & 1.81 & 1.94 & 2.68 & 1.49 & 2.52 \\ \text { Temp. } & 185 & 186 & 188 & & & & \\ \text { Ratio } & 3.00 & 1.87 & 3.08 & & & & \end{array} $$ a. Construct stem-and-leaf displays of both temperature and efficiency ratio, and comment on interesting features. b. Is the value of efficiency ratio completely and uniquely determined by tank temperature? Explain your reasoning. c. Construct a scatter plot of the data. Does it appear that efficiency ratio could be very well predicted by the value of temperature? Explain your reasoning.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.