/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 16 To decide whether two different ... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

To decide whether two different types of steel have the same true average fracture toughness values, \(n\) specimens of each type are tested, yielding the following results: \begin{tabular}{ccc} Type & Sample Average & Sample SD \\ \hline 1 & \(60.1\) & \(1.0\) \\ 2 & \(59.9\) & \(1.0\) \\ \hline \end{tabular} Calculate the \(P\)-value for the appropriate two-sample \(z\) test, assuming that the data was based on \(n=100\). Then repeat the calculation for \(n=400\). Is the small \(P\)-value for \(n=400\) indicative of a difference that has practical significance? Would you have been satisfied with just a report of the \(P\)-value? Comment briefly.

Short Answer

Expert verified
For \(n=100\), \(P\)-value is 0.157; for \(n=400\), \(P\)-value is 0.0047. Not practically significant. P-value alone is not enough.

Step by step solution

01

Specify the Hypotheses

We are conducting a two-sample \(z\) test. The null hypothesis \(H_0\) assumes that the true average fracture toughness of the two types of steel is the same, i.e., \( \mu_1 = \mu_2 \). The alternative hypothesis \(H_a\) assumes that the two averages are different, i.e., \( \mu_1 eq \mu_2 \).
02

Calculate the Test Statistic for n=100

The test statistic \(z\) is calculated using the formula:\[z = \frac{\bar{x}_1 - \bar{x}_2}{\sqrt{\frac{s_1^2}{n} + \frac{s_2^2}{n}}}\]Substituting the values for \(n=100\), \(\bar{x}_1=60.1\), \(\bar{x}_2=59.9\), and \(s_1 = s_2 = 1.0\), we get:\[z = \frac{60.1 - 59.9}{\sqrt{\frac{1^2}{100} + \frac{1^2}{100}}} = \frac{0.2}{\sqrt{0.02}} = \frac{0.2}{0.1414} = 1.414\]
03

Calculate the P-value for n=100

To find the \(P\)-value, use the standard normal distribution table. The \(z\)-score calculated is \(1.414\). The \(P\)-value for this two-tailed test is approximately 2 times the area in the right tail beyond \(1.414\), which is approximately \(0.157\).
04

Calculate the Test Statistic for n=400

Using the same formula as in Step 2, but with \(n=400\):\[z = \frac{0.2}{\sqrt{\frac{1^2}{400} + \frac{1^2}{400}}} = \frac{0.2}{\sqrt{0.005}} = \frac{0.2}{0.0707} \approx 2.828\]
05

Calculate the P-value for n=400

Using the standard normal distribution for \(z = 2.828\), the \(P\)-value for this two-tailed test is approximately 0.0047.
06

Interpret the Results

For \(n = 100\), a \(P\)-value of 0.157 suggests no significant difference at typical significance levels (e.g., 0.05). For \(n = 400\), the \(P\)-value of 0.0047 is small enough to reject \(H_0\) at the 0.05 level, indicating statistical significance. However, the difference in fracture toughness of 0.2 units is very small, so it may not be practically significant despite being statistically significant. Reporting only the \(P\)-value might be misleading without understanding the context and practical implications of the difference.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Hypothesis Testing
Hypothesis testing is a method used in statistics to decide whether a certain belief or claim about a population parameter is supported by evidence collected from a sample. In our exercise, we want to determine if two types of steel have the same average fracture toughness using a two-sample z test. We start by setting up two hypotheses: the null hypothesis (H_0 ) and the alternative hypothesis (H_a ).
  • The null hypothesis (H_0 ) is a statement of no effect, suggesting that both types of steel have the same average fracture toughness, namely \( \mu_1 = \mu_2 \).
  • The alternative hypothesis (H_a ) contrasts H_0 and proposes that the averages are different, i.e., \( \mu_1 e \mu_2 \).
Hypothesis testing uses sample data to assess the evidence against the null hypothesis. If the data is highly unlikely under H_0, we may conclude that H_a is more likely true. The formal decision is made by comparing the calculated p-value to a pre-decided significance level (usually 0.05). If the p-value is smaller, we reject H_0 , indicating the evidence supports the alternative hypothesis.
P-value Calculation
The p-value is a crucial part of hypothesis testing, representing the probability of observing results as extreme as, or more extreme than, those measured, assuming the null hypothesis is true. In our exercise, we calculate the p-value for each sample size (n=100 and n=400) using the following steps:
  • Calculate the test statistic ( z -score), which indicates how many standard deviations away the sample mean difference is from zero under H_0 .
  • For instance, for n = 100 , the z -score is 1.414. For n = 400 , it's 2.828.
  • Using the standard normal distribution, find the area in the tails beyond the calculated z -score. This is done through statistical tables or software.
  • The p-value for n = 100 is about 0.157, and for n = 400 , it's approximately 0.0047.
P-value helps assess whether the observed data is consistent with H_0. Lower p-values suggest that the data is unlikely under H_0, hence providing evidence in favor of H_a. However, it doesn't express the size or importance of the difference found.
Statistical Significance
Statistical significance is about determining whether an observed effect could reasonably occur by chance. It's commonly assessed by comparing the p-value against a significance level, often set at 0.05.
  • If the p-value is less than the significance level, the result is deemed statistically significant, suggesting strong enough evidence to reject the null hypothesis.
  • In our steel example, with n = 100 producing a p-value of 0.157, we do not find statistical significance.
  • Yet for n = 400 , a p-value of 0.0047 indicates statistical significance, allowing us to reject the null hypothesis.
However, statistical significance does not imply that the findings are practically important or large. It simply says that the effect observed is unlikely to have arisen merely by chance. Thus, it's crucial to further investigate whether the difference is meaningful in real-world terms.
Practical Significance
Practical significance evaluates whether the size of an effect is large enough to be of value in a practical sense, rather than merely being statistically significant.
  • In our example, while the difference in fracture toughness was statistically significant for n = 400 with a p-value of 0.0047, the actual difference of 0.2 units in average fracture toughness is too minor to matter practically.
  • It's essential to consider whether the observed difference is relevant in a real-world context, such as engineering standards or cost efficiency.
Practical significance helps us understand the real-world impact of our findings beyond the mathematical computations of statistical significance. So, simply reporting the p-value without context might paint an incomplete picture. Practical implications should always accompany statistical results for a comprehensive evaluation.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Tensile-strength tests were carried out on two different grades of wire rod ('"Fuidized Bed Patenting of Wire Rods, Wire \(J .\) J June 1977: \(56-61\) ), resulting in the accompanying data. \begin{tabular}{lccc} Grade & Sample Size & Sample Mean \(\left(\mathrm{kg} / \mathrm{mm}^{2}\right)\) & Sample SD \\ \hline AISI 1064 & \(m=129\) & \(\bar{x}=107.6\) & \(s_{1}=1.3\) \\ AISI 1078 & \(n=129\) & \(\bar{y}=123.6\) & \(s_{2}=2.0\) \\ \hline \end{tabular} a. Does the data provide compelling evidence for concluding that true average strength for the 1078 grade exceeds that for the 1064 grade by more than \(10 \mathrm{~kg} /\) \(\mathrm{mm}^{2}\) ? Test the appropriate hypotheses using a significance level of \(.01\). b. Estimate the difference between true average strengths for the two grades in a way that provides information about precision and reliability.

81\. The accompanying data on response time appeared in the article "The Extinguishment of Fires Using Low-Flow Water Hose Streams-Part 11" (Fire Technology, 1991: 291-320). Good visibility Poor visibility \(\begin{array}{llllllll}.43 & 1.17 & .37 & .47 & .68 & .58 & .50 & 2.75\end{array}\) \(\begin{array}{llllllll}1.47 & .80 & 1.58 & 1.53 & 4.33 & 4.23 & 3.25 & 3.22\end{array}\) The authors analyzed the data with the pooled \(t\) test. Does the use of this test appear justified? [Hint: Check for normality. The \(z\) percentiles for \(n=8\) are \(-1.53,-.89,-.49,-.15\), \(.15, .49, .89\), and 1.53.]

According to the article "Modeling and Predicting the Effects of Submerged Arc Weldment Process Parameters on Weldment Characteristics and Shape Profiles" ( \(J\). of Engr. Manuf., 2012: 1230-1240), the submerged arc welding (SAW) process is commonly used for joining thick plates and pipes. The heat affected zone (HAZ), a band created within the base metal during welding, was of particular interest to the investigators. Here are observations on depth (mm) of the HAZ both when the current setting was high and when it was lower. \(\begin{array}{llllll}\text { Non-high } & 1.04 & 1.15 & 1.23 & 1.69 & 1.92 \\\ & 1.98 & 2.36 & 2.49 & 2.72 & \\ & 1.37 & 1.43 & 1.57 & 1.71 & 1.94 \\ & 2.06 & 2.55 & 2.64 & 2.82 & \\ \text { High } & 1.55 & 2.02 & 2.02 & 2.05 & 2.35 \\ & 2.57 & 2.93 & 2.94 & 2.97 & \end{array}\) a. Construct a comparative boxplot and comment on interesting features. b. Is it reasonable to use the two-sample \(t\) test to test hypotheses about the difference between true average HAZ depths for the two conditions? c. Does it appear that true average HAZ depth is larger for the higher current condition than for the lower condition? Carry out a test of appropriate hypotheses using a significance level of .01.

Wait staff at restaurants have employed various strategies to increase tips. An article in the Sept. 5, 2005, New Yorker reported that "In one study a waitress received \(50 \%\) more in tips when she introduced herself by name than when she didn't." Consider the following (fictitious) data on tip amount as a percentage of the bill: Introduction: \(\quad m=50 \quad \bar{x}=22.63 \quad s_{1}=7.82\) No introduction: \(\quad n=50 \quad \bar{y}=14.15 \quad s_{2}=6.10\) Does this data suggest that an introduction increases tips on average by more than \(50 \%\) ? State and test the relevant hypotheses. [Hint: Consider the parameter \(\left.\theta=\mu_{1}-1.5 \mu_{2} .\right]\)

The article "Flexure of Concrete Beams Reinforced with Advanced Composite Orthogrids" (J. of Aerospace Engr. 1997: 7-15) gave the accompanying data on ultimate load (kN) for two different types of beams. \begin{tabular}{lccc} Type & Sample Size & Sample Mean & Sample SD \\ \hline Fiberglass grid & 26 & \(33.4\) & \(2.2\) \\ Commercial carbon grid & 26 & \(42.8\) & \(4.3\) \\ \hline \end{tabular} a. Assuming that the underlying distributions are normal, calculate and interpret a \(99 \%\) CI for the difference between true average load for the fiberglass beams and that for the carbon beams. b. Does the upper limit of the interval you calculated in part (a) give a \(99 \%\) upper confidence bound for the difference between the two \(\mu\) 's? If not, calculate such a bound. Does it strongly suggest that true average load for the carbon beams is more than that for the fiberglass beams? Explain.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.