/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 8 The article "A Thin-Film Oxygen ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

The article "A Thin-Film Oxygen Uptake Test for the Evaluation of Automotive Crankease Lubricants" (Lubric. Engr., 1984: 75-83) reported the following data on oxidation-induction time (min) for various commercial oils: \(\begin{array}{lllllllllll}87 & 103 & 130 & 160 & 180 & 195 & 132 & 145 & 211 & 105 & 145\end{array}\) \(\begin{array}{llllllll}153 & 152 & 138 & 87 & 99 & 93 & 119 & 129\end{array}\) a. Calculate the sample variance and standard deviation. b. If the observations were reexpressed in hours, what would be the resulting values of the sample variance and sample standard deviation? Answer without actually performing the reexpression.

Short Answer

Expert verified
Variance is 1304.58 and standard deviation is 36.12; in hours, they are scaled down by 1/3600 and 1/60.

Step by step solution

01

Calculate the Mean

The mean of a data set is given by the sum of all observations divided by the total number of observations. Sum all the values of the oxidation-induction times and divide by 19 (the number of observations). \[ \bar{x} = \frac{1}{19}(87 + 103 + 130 + 160 + 180 + 195 + 132 + 145 + 211 + 105 + 145 + 153 + 152 + 138 + 87 + 99 + 93 + 119 + 129) \] \[ \bar{x} = 133.42 \]
02

Calculate the Deviations from the Mean

Subtract the mean from each observation to find the deviations: \[ 87 - 133.42, \ 103 - 133.42, \ldots, \ 129 - 133.42 \] The deviations will be used in calculating the variance.
03

Calculate the Squared Deviations

Square each of the deviations from the mean. For instance, for 87, the deviation is \( 87 - 133.42 \) and squaring it gives \( (87 - 133.42)^2 \). Do this for each observation.
04

Calculate the Sample Variance

The sample variance is calculated by averaging these squared deviations: \[ s^2 = \frac{1}{n-1} \sum_{i=1}^{n} (x_i - \bar{x})^2 \] where \( n = 19 \). Sum all squared deviations and divide by 18 (since this is a sample).\[ s^2 \approx \frac{1}{18}((87 - 133.42)^2 + (103 - 133.42)^2 + \ldots + (129 - 133.42)^2) \approx 1304.58 \]
05

Calculate the Sample Standard Deviation

The sample standard deviation is the square root of the sample variance: \[ s = \sqrt{1304.58} \approx 36.12 \]
06

Reexpress Observations in Hours

Each observation in minutes converts to hours by dividing by 60. This affects calculations involving variances and standard deviations.
07

Determine Effect of Reexpression on Variance and Standard Deviation

When reexpressing in hours, variance becomes:\[ (\frac{1}{60})^2 \times 1304.58 \] Standard deviation becomes:\[ \frac{1}{60} \times 36.12 \] Variance and standard deviation are scaled by the square and normal factor of \(\frac{1}{60}\) respectively: The new variance is \(\frac{1304.58}{3600}\) and the new standard deviation is \(\frac{36.12}{60}\).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Mean Calculation
Calculating the mean is the first step in many statistical analyses, providing the average value of a dataset. To find the mean of a set of numbers, add them all together and divide by the number of observations. For example, in our dataset about oxidation-induction times, we sum up each value provided and then divide this sum by the number of observations, which in this case is 19. The equation looks like this:\[\bar{x} = \frac{1}{19}(87 + 103 + 130 + \ldots + 129)\]Calculating this gives us a mean (\bar{x}) of 133.42 minutes. This mean represents the central tendency of our sample data and will be used in calculating both the variance and standard deviation.
Data Reexpression
Data reexpression involves transforming your dataset using a consistent mathematical operation. In our case, we need to reexpress the observation times from minutes to hours. Since there are 60 minutes in an hour, we simply divide each observation by 60. It’s important to understand how this affects calculations for variance and standard deviation. When reexpressing data, one must adjust the calculations accordingly. For a variance, which is a squared value, if each original observation is divided by a factor, i.e., 60, then the variance is divided by 60 squared. For standard deviation, which is not squared, we simply divide by 60. This understanding of data transformation allows us to apply results in different units without recalculating everything anew.
Sample Variance
Sample variance describes how data points in a sample differ from the mean. It indicates the degree of spread in our data. Calculating it involves several steps:
  • First, compute the deviation of each observation from the mean. Subtract the mean calculated earlier from each data point.
  • Next, square each deviation. This eliminates negative values and emphasizes larger differences.
  • Finally, sum these squared deviations and divide by \(n-1\), where \(n\) is the number of observations. The formula to calculate sample variance is:\[s^2 = \frac{1}{n-1} \sum_{i=1}^{n} (x_i - \bar{x})^2\]For our dataset, this computation results in a sample variance of 1304.58 minutes squared.
Variance is crucial for determining how varied or similar the dataset is to the mean, providing foundational understanding before calculating standard deviation.
Sample Standard Deviation
Sample standard deviation provides insight into the amount of variation or dispersion in a dataset. A low standard deviation means data points are close to the mean, while a high standard deviation indicates more spread out data.To find the standard deviation, take the square root of the sample variance. Using our example's variance of 1304.58, the standard deviation is:\[s = \sqrt{1304.58} \approx 36.12\]This tells us that, on average, our data points deviate from the mean by about 36.12 minutes. This measure helps us understand data volatility and is often used to compare different datasets or to set expectations for future observations. When data is reexpressed, remember that standard deviation changes linearly with the transformation factor, in contrast to variance that changes quadratically.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Many universities and colleges have instituted supplemental instruction (SI) programs, in which a student facilitator meets regularly with a small group of students enrolled in the course to promote discussion of course material and enhance subject mastery. Suppose that students in a large statistics course (what else?) are randomly divided into a control group that will not participate in SI and a treatment group that will participate. At the end of the term, each student's total score in the course is determined. a. Are the scores from the SI group a sample from an existing population? If so, what is it? If not, what is the relevant conceptual population? b. What do you think is the advantage of randomly dividing the students into the two groups rather than letting each student choose which group to join? c. Why didn't the investigators put all students in the treatment group? [Note: The article 'Supplemental Instruction: An Effective Component of Student Affairs Programming" \(U\). of College Student Devel., 1997: 577-586) discusses the analysis of data from several SI programs.]

In a famous experiment carried out in 1882 , Michelson and Newcomb obtained 66 observations on the time it took for light to travel between two locations in Washington, D.C. A few of the measurements (coded in a certain manner) were \(31,23,32,36,-2,26,27\), and 31 . a. Why are these measurements not identical? b. Is this an enumerative study? Why or why not?

Consider numerical observations \(x_{1}, \ldots, x_{n^{-}}\)- It is frequently of interest to know whether the \(x_{i}\) s are (at least approximately) symmetrically distributed about some value. If \(n\) is at least moderately large, the extent of symmetry can be assessed from a stem-and-leaf display or histogram. However, if \(n\) is not very large, such pictures are not particularly informative. Consider the following alternative. Let \(y_{1}\) denote the smallest \(x_{p}, y_{2}\) the second smallest \(x_{i}\), and so on. Then plot the following pairs as points on a two-dimensional coordinate system: \(\left(y_{n}-\tilde{x}, \tilde{x}-y_{1}\right)\), \(\left(y_{n-1}-\tilde{x}, \tilde{x}-y_{2}\right),\left(y_{n-2}-\tilde{x}, \tilde{x}-y_{3}\right), \ldots\) There are \(n / 2\) points when \(n\) is even and \((n-1) / 2\) when \(n\) is odd. a. What does this plot look like when there is perfect symmetry in the data? What does it look like when observations stretch out more above the median than below it (a long upper tail)? 83. Consider numerical observations \(x_{1}, \ldots, x_{n^{-}}\)- It is frequently of interest to know whether the \(x_{i}\) s are (at least approximately) symmetrically distributed about some value. If \(n\) is at least moderately large, the extent of symmetry can be assessed from a stem-and-leaf display or histogram. However, if \(n\) is not very large, such pictures are not particularly informative. Consider the following alternative. Let \(y_{1}\) denote the smallest \(x_{p}, y_{2}\) the second smallest \(x_{i}\), and so on. Then plot the following pairs as points on a two-dimensional coordinate system: \(\left(y_{n}-\tilde{x}, \tilde{x}-y_{1}\right)\), \(\left(y_{n-1}-\tilde{x}, \tilde{x}-y_{2}\right),\left(y_{n-2}-\tilde{x}, \tilde{x}-y_{3}\right), \ldots\) There are \(n / 2\) points when \(n\) is even and \((n-1) / 2\) when \(n\) is odd. a. What does this plot look like when there is perfect symmetry in the data? What does it look like when observations stretch out more above the median than below it (a long upper tail)?

Consider the population consisting of all computers of a certain brand and model, and focus on whether a computer needs service while under warranty. a. Pose several probability questions based on selecting a sample of 100 such computers. b. What inferential statistics question might be answered by determining the number of such computers in a sample of size 100 that need warranty service?

In 1997 a woman sued a computer keyboard manufacturer, charging that her repetitive stress injuries were caused by the keyboard (Genessy \(v\). Digital Equipment Corp.). The injury awarded about \(\$ 3.5\) million for pain and suffering, but the court then set aside that award as being unreasonable compensation. In making this determination, the court identified a "normative" group of 27 similar cases and specified a reasonable award as one within two standard deviations of the mean of the awards in the 27 cases. The 27 awards were (in \(\$ 1000\) s) 37,60 , \(75,115,135,140,149,150,238,290,340,410,600,750\), \(750,750,1050,1100,1139,1150,1200,1200,1250\), \(1576,1700,1825\), and 2000 , from which \(\Sigma x_{i}=20,179\), \(\Sigma x_{i}^{2}=24,657,511\). What is the maximum possible amount that could be awarded under the two-standarddeviation rule?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.