/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 23 Fuming because you are stuck in ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Fuming because you are stuck in traffic? Roadway congestion is a costly item, in both time wasted and fuel wasted. Let \(x\) represent the average annual hours per person spent in traffic delays and let \(y\) represent the average annual gallons of fuel wasted per person in traffic delays. A random sample of eight cities showed the following data (Reference: Statistical Abstract of the United States, 122 nd Edition). $$ \begin{array}{l|llllllll} \hline x(\mathrm{hr}) & 28 & 5 & 20 & 35 & 20 & 23 & 18 & 5 \\ \hline y(\mathrm{gal}) & 48 & 3 & 34 & 55 & 34 & 38 & 28 & 9 \\ \hline \end{array} $$ (a) Draw a scatter diagram for the data. Verify that \(\Sigma x=154, \Sigma x^{2}=3712\), \(\Sigma y=249, \Sigma y^{2}=9959\), and \(\Sigma x y=6067\). Compute \(r\) The data in part (a) represent average annual hours lost per person and average annual gallons of fuel wasted per person in traffic delays. Suppose that instead of using average data for different cities, you selected one person at random from each city and measured the annual number of hours lost \(x\) for that person and the annual gallons of fuel wasted \(y\) for the same person. $$ \begin{array}{l|cccccccc} \hline x(\mathrm{hr}) & 20 & 4 & 18 & 42 & 15 & 25 & 2 & 35 \\ \hline y(\mathrm{gal}) & 60 & 8 & 12 & 50 & 21 & 30 & 4 & 70 \\ \hline \end{array} $$ (b) Compute \(\bar{x}\) and \(\bar{y}\) for both sets of data pairs and compare the averages. Compute the sample standard deviations \(s_{x}\) and \(s_{y}\) for both sets of data pairs and compare the standard deviations. In which set are the standard deviations for \(x\) and \(y\) larger? Look at the defining formula for \(r\), Equation \(1 .\) Why do smaller standard deviations \(s_{x}\) and \(s_{y}\) tend to increase the value of \(r\) ? (c) Make a scatter diagram for the second set of data pairs. Verify that \(\Sigma x=161, \quad \Sigma x^{2}=4583, \quad \Sigma y=255, \quad \Sigma y^{2}=12,565\), and \(\Sigma x y=7071 .\) Compute \(r\). (d) Compare \(r\) from part (a) with \(r\) from part (c). Do the data for averages have a higher correlation coefficient than the data for individual measurements? List some reasons why you think hours lost per individual and fuel wasted per individual might vary more than the same quantities averaged over all the people in a city.

Short Answer

Expert verified
The city data set has verified sums, compute using formulas.

Step by step solution

01

Verify Given Sums for Part (a)

Check the sums for the given city data set:- For hours \(x\): \(\Sigma x = 28 + 5 + 20 + 35 + 20 + 23 + 18 + 5 = 154\) (verified)- \(\Sigma x^2 = 28^2 + 5^2 + 20^2 + 35^2 + 20^2 + 23^2 + 18^2 + 5^2 = 3712\) (verified)- For gallons \(y\): \(\Sigma y = 48 + 3 + 34 + 55 + 34 + 38 + 28 + 9 = 249\) (verified)- \(\Sigma y^2 = 48^2 + 3^2 + 34^2 + 55^2 + 34^2 + 38^2 + 28^2 + 9^2 = 9959\) (verified)- \(\Sigma xy = 28\times48 + 5\times3 + 20\times34 + 35\times55 + 20\times34 + 23\times38 + 18\times28 + 5\times9 = 6067\)

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Scatter Diagram
A scatter diagram, also known as a scatter plot, is a graphical representation used to visualize the relationship between two quantitative variables. Each point on the diagram corresponds to a pair of values from the two datasets. For this specific exercise, one axis represents the average annual hours spent in traffic delays (variable \(x\)), and the other axis represents the average annual gallons of fuel wasted (variable \(y\)).

To create a scatter plot, you would plot each city’s data point as a coordinate on the diagram. For example, if city A has an \(x\)-value of 28 hours and a \(y\)-value of 48 gallons, you would place a point at (28, 48) on the chart. Doing this for all the data points allows us to visually inspect patterns or correlations between the variables.

By analyzing a scatter diagram, you can easily see if there's a positive or negative correlation between variables. A positive trend will show data points moving upwards as you go along the x-axis, suggesting an increase in one variable tends to correlate with an increase in the other variable.
Correlation Coefficient
The correlation coefficient, denoted as \(r\), measures the strength and direction of a linear relationship between two variables on a scatter plot. This coefficient ranges from -1 to 1.

  • An \(r\) value close to 1 implies a strong positive correlation, meaning as one variable increases, the other also increases.
  • An \(r\) value close to -1 indicates a strong negative correlation, where one variable decreases as the other increases.
  • An \(r\) value around 0 suggests no linear correlation between the variables.


For calculating \(r\), you utilize a formula involving the sums of the products of paired scores, as seen in the gesture towards Equation 1. Smaller standard deviations in the datasets result in a more established linear relationship, potentially increasing the value of \(r\). This is because smaller variances imply that data points are closer to the mean and vary less from each other.
Standard Deviation
Standard deviation is a measure of the amount of variation or dispersion present in a set of values. A low standard deviation means that the data points are close to the mean, whereas a high standard deviation indicates that the data points are spread out over a large range of values.

To compute the standard deviation for a sample set, you first calculate the sample variance. This involves taking each data point's deviation from the sample mean, squaring it, and then averaging these square deviations. The standard deviation \(s\) is the square root of this variance. Mathematically, it is expressed as:
\[ s = \sqrt{\frac{\sum (x_i - \bar{x})^2}{n-1}} \]
Where \(x_i\) represents each value in the dataset, \(\bar{x}\) is the sample mean, and \(n\) is the number of observations. Calculating for both \(x\) and \(y\) allows you to understand how spread out hours in traffic and fuel consumption are, respectively, across the dataset.
Sample Mean
The sample mean, denoted by \(\bar{x}\) or \(\bar{y}\), represents the average of a set of values. It's calculated by summing all the values and then dividing by the number of observations.

For example, if you have a set of data representing hours spent in traffic delays \([28, 5, 20, 35, 20, 23, 18, 5]\), you sum these values to get 154. To find the mean, you then divide by the number of observations, which is 8. So, the sample mean \(\bar{x}\) is \(154/8 = 19.25\) hours.

The same principle applies for finding the mean of gallons of fuel wasted, \(\bar{y}\). The mean provides a central value of the dataset, helping to understand the average trend within the data. It is pivotal for both descriptive statistics and drawing conclusions about populations and data behavior.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Let \(x=\) day of observation and \(y=\) number of locusts per square meter during a locust infestation in a region of North Africa. $$ \begin{array}{l|llrrr} \hline x & 2 & 3 & 5 & 8 & 10 \\ \hline y & 2 & 3 & 12 & 125 & 630 \\ \hline \end{array} $$ (a) Draw a scatter diagram of the \((x, y)\) data pairs. Do you think a straight line will be a good fit to these data? Do the \(y\) values almost seem to explode as time goes on? (b) Now consider a transformation \(y^{\prime}=\log y .\) We are using common logarithms of base \(10 .\) Draw a scatter diagram of the \(\left(x, y^{\prime}\right)\) data pairs and compare this diagram with the diagram of part (a). Which graph appears to better fit a straight line? (c) Use a calculator with regression keys to find the linear regression equation for the data pairs \(\left(x, y^{\prime}\right) .\) What is the correlation coefficient? (d) The exponential growth model is \(y=\alpha \beta^{x}\). Estimate \(\alpha\) and \(\beta\) and write the exponential growth equation. Hint: See Problem 22 .

In baseball, is there a linear correlation between batting average and home run percentage? Let \(x\) represent the batting average of a professional baseball player, and let \(y\) represent the player's home run percentage (number of home runs per 100 times at bat). A random sample of \(n=7\) professional baseball players gave the following information (Reference: The Baseball Encyclopedia, Macmillan Publishing Company). $$ \begin{array}{l|lllllll} \hline x & 0.243 & 0.259 & 0.286 & 0.263 & 0.268 & 0.339 & 0.299 \\ \hline y & 1.4 & 3.6 & 5.5 & 3.8 & 3.5 & 7.3 & 5.0 \\ \hline \end{array} $$ (a) Make a scatter diagram and draw the line you think best fits the data. (b) Would you say the correlation is low, moderate, or high? positive or negative? (c) Use a calculator to verify that \(\Sigma x=1.957, \Sigma x^{2} \approx 0.553, \Sigma y=30.1\), \(\Sigma y^{2}=150.15\), and \(\Sigma x y \approx 8.753 .\) Compute \(r .\) As \(x\) increases, does the value of \(r\) imply that \(y\) should tend to increase or decrease? Explain.

When we use a least-squares line to predict \(y\) values for \(x\) values beyond the range of \(x\) values found in the data, are we extrapolating or interpolating? Are there any concerns about such predictions?

When we take measurements of the same general type, a power law of the form \(y=\alpha x^{\beta}\) often gives an excellent fit to the data. A lot of research has been conducted as to why power laws work so well in business, economics, biology, ecology, medicine, engineering, social science, and so on. Let us just say that if you do not have a good straight-line fit to data pairs \((x, y)\), and the scatter plot does not rise dramatically (as in exponential growth), then a power law is often a good choice. College algebra can be used to show that power law models become linear when we apply logarithmic transformations to both variables. To see how this is done, please read on. Note: For power law models, we assume all \(x>0\) and all \(y>0\). Suppose we have data pairs \((x, y)\) and we want to find constants \(\alpha\) and \(\beta\) such that \(y=\alpha x^{\beta}\) is a good fit to the data. First, make the logarithmic transformations \(x^{\prime}=\log x\) and \(y^{\prime}=\log y .\) Next, use the \(\left(x^{\prime}, y^{\prime}\right)\) data pairs and a calculator with linear regression keys to obtain the least-squares equation \(y^{\prime}=a+b x^{\prime} .\) Note that the equation \(y^{\prime}=a+b x^{\prime}\) is the same as \(\log y=a+b(\log x)\). If we raise both sides of this equation to the power 10 and use some college algebra, we get \(y=10^{a}(x)^{b}\). In other words, for the power law model, we have \(\alpha \approx 10^{a}\) and \(\beta \approx b\). In the electronic design of a cell phone circuit, the buildup of electric current \((\mathrm{Amps})\) is an important function of time (microseconds). Let \(x=\) time in microseconds and let \(y=\) Amps built up in the circuit at time \(x .\) $$ \begin{array}{l|lllll} \hline x & 2 & 4 & 6 & 8 & 10 \\ \hline y & 1.81 & 2.90 & 3.20 & 3.68 & 4.11 \\ \hline \end{array} $$ (a) Make the logarithmic transformations \(x^{\prime}=\log x\) and \(y^{\prime}=\log y .\) Then make a scatter plot of the \(\left(x^{\prime}, y^{\prime}\right)\) values. Does a linear equation seem to be a good fit to this plot? (b) Use the \(\left(x^{\prime}, y^{\prime}\right)\) data points and a calculator with regression keys to find the least-squares equation \(y^{\prime}=a+b x^{\prime} .\) What is the sample correlation coefficient? (c) Use the results of part (b) to find estimates for \(\alpha\) and \(\beta\) in the power law \(y=\alpha x^{\beta}\). Write the power law giving the relationship between time and Amp buildup. Note: The TI-84Plus/TI-83Plus/TI- \(n\) spire calculators fully support the power law model. Place the original \(x\) data in list L1 and the corresponding \(y\) data in list L2. Then press STAT, followed by CALC, and scroll down to option A: PwrReg. The output gives values for \(\alpha, \beta\), and the sample correlation coefficient \(r\).

Suppose you are interested in buying a new Lincoln Navigator or Town Car. You are standing on the sales lot looking at a model with different options. The list price is on the vehicle. As a salesperson approaches, you wonder what the dealer invoice price is for this model with its options. The following data are based on information taken from Consumer Guide (Vol. 677). Let \(x\) be the list price (in thousands of dollars) for a random selection of these cars of different models and options. Let \(y\) be the dealer invoice (in thousands of dollars) for the given vehicle. $$ \begin{array}{l|lllll} \hline x & 32.1 & 33.5 & 36.1 & 44.0 & 47.8 \\ \hline y & 29.8 & 31.1 & 32.0 & 42.1 & 42.2 \\ \hline \end{array} $$ (a) Verify that \(\Sigma x=193.5, \quad \Sigma y=177.2, \quad \Sigma x^{2}=7676.71, \quad \Sigma y^{2}=6432.5\), \(\Sigma x y=7023.19\), and \(r \approx 0.977\) (b) Use a \(1 \%\) level of significance to test the claim that \(\rho>0\). (c) Verify that \(S_{e} \approx 1.5223, a \approx 1.4084\), and \(b \approx 0.8794\). (d) Find the predicted dealer invoice when the list price is \(x=40\) (thousand dollars). (e) Find a \(95 \%\) confidence interval for \(y\) when \(x=40\) (thousand dollars). (f) Use a \(1 \%\) level of significance to test the claim that \(\beta>0\). (g) Find a \(90 \%\) confidence interval for \(\beta\) and its meaning.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.