/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Q48 E Construct a paired data set for ... [FREE SOLUTION] | 91影视

91影视

Construct a paired data set for which\(t = \infty \), so that the data is highly significant when the correct analysis is used, yet \(t\) for the two-sample \(t\)test is quite near zero, so the incorrect analysis yields an insignificant result.

Short Answer

Expert verified

\({\rm{ The incorrect analysis yields an insignificant result }}\)

Step by step solution

01

To find the paired t test

The Paired \(t\)Test:

When \(D = X - Y\)(difference between observations within a pair), \({\mu _D} = {\mu _1} - {\mu _2}\) and null hypothesis

\({H_0}:{\mu _D} = {\Delta _0}\)

the test statistic value for testing the hypotheses is

\(t = \frac{{\bar d - {\Delta _0}}}{{{s_D}/\sqrt n }}\)

where \(\bar d\) and \({s_D}\)are the sample mean and the sample standard deviation of differences\({d_i}\), respectively. In order to use this test, assume that the differences \({D_i}\)are from a normal population. Depending on alternative hypothesis, the \(P\)value can be determined as the corresponding area under the \({t_{n - 1}}\)curve.

02

To find the data set for which t is infinite

The goal here is to find data set for which t is infinite. From

\(t = \frac{{\bar d - {\Delta _0}}}{{{s_D}/\sqrt n }}\)

t is infinite when \({s_D}\) is zero.

The Sample Variance \({s^2}\)is

\({s^2} = \frac{1}{{n - 1}} \cdot {S_{xx}}\)

where

\({S_{xx}} = {\left( {{x_i} - \bar x} \right)^2} = \;\;\;x_i^2 - \frac{1}{n} \cdot {\left( {{x_i}} \right)^2}\)

The Sample Standard Deviation s is

\(s = \sqrt {{s^2}} = \sqrt {\frac{1}{{n - 1}} \cdot {S_{xx}}} \)

From the definition, \({s_D}\)is zero if and only if

\({x_i} = \bar x\)

for every\(i = 1,2, \ldots ,n\). This now indicates that all data should be equal.

NOTE: the data for the differences should be equal, not the paired data set.

From this conclusion, let's take that the difference is for example\(0.5\), the chosen data set is given in the table below (taking numbers for which \({x_i} - {y_i} = \)0.5)

i

First,\({x_i}\)

Second,\({y_i}\)

Difference,\({d_i} = {x_i} - {y_i}\)

1

22.5

22

0.5

2

35.5

35

0.5

3

48.5

48

0.5

4

61.5

61

0.5

5

74.5

74

0.5

6

87.5

87

0.5

7

100.5

100

0.5

8

113.5

113

0.5

9

126.5

126

0.5

10

139.5

139

0.5

11

152.5

152

0.5

12

165.5

165

0.5

13

178.5

178

0.5

Obviously, for the paired t test t=0, because the sample standard deviation for the differences is

\({s_D} = \sqrt {\frac{1}{{12}} \cdot \left( {{{(0.5 - 0.5)}^2} + {{(0.5 - 0.5)}^2} + \ldots + {{(0.5 - 0.5)}^2}} \right)} = 0\)

thus

\(t = \frac{{\bar d - {\Delta _0}}}{{{s_D}/\sqrt n }} = \frac{{0.5}}{{0/\sqrt {13} }} = \infty \)

03

To find what happens with t for two sample t test

Let's see what happens with t for the two sample t test.

The two-sample t test for testing \({H_0}:{\mu _1} - {\mu _2} = {\Delta _0}\)uses the following value of test statistic

\(t = \frac{{\bar x - \bar y - {\Delta _0}}}{{\sqrt {\frac{{s_1^2}}{m} + \frac{{s_2^2}}{n}} }}\)

For the adequate alternative hypothesis the adequate area under the \({t_\nu }\)curve is the P value.

First data set:

the sample mean of the first data set is

\(\bar x = \frac{1}{{13}} \cdot (22.5 + 35.5 + \ldots + 178.5) = 100.5\)

and the sample standard deviation is

\(\begin{array}{l}{s_1} = \sqrt {\frac{1}{{12}} \cdot \left( {{{(22.5 - 100.5)}^2} + {{(35.5 - 100.5)}^2} + \ldots + {{(178.5 - 100.5)}^2}} \right)} \\ = 50.63\end{array}\)

and m=13.

Second data set:

there is only difference in the sample mean by 0.5 because the data set was chosen like that, however compute the sample mean of the second data set

\(\bar y = \frac{1}{{13}} \cdot (22 + 35 + \ldots + 178) = 100\)

and the sample standard deviation is

\(\begin{array}{l}{s_2} = \sqrt {\frac{1}{{12}} \cdot \left( {{{(22 - 100)}^2} + {{(35 - 100)}^2} + \ldots + {{(178 - 100)}^2}} \right)} \\ = 50.63\end{array}\)

and n=13.

The two sample t test statistic value is

\(t = \frac{{\bar x - \bar y - {\Delta _0}}}{{\sqrt {\frac{{s_1^2}}{m} + \frac{{s_2^2}}{n}} }} = \frac{{100.5 - 100}}{{50.63 \cdot \sqrt {\frac{2}{{13}}} }} = 0.025\)

which is nearly zero.

The incorrect analysis yields an insignificant result

04

Final proof

\({\rm{ The incorrect analysis yields an insignificant result }}\)

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91影视!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Consider the accompanying data on breaking load (\(kg/25\;mm\)width) for various fabrics in both an unabraded condition and an abraded condition (\(^6\)The Effect of Wet Abrasive Wear on the Tensile Properties of Cotton and Polyester-Cotton Fabrics," J. Testing and Evaluation, \(\;1993:84 - 93\)). Use the paired\(t\)test, as did the authors of the cited article, to test\({H_0}:{\mu _D} = 0\)versus\({H_a}:{\mu _D} > 0\)at significance level . \(01\)

Consider the pooled\(t\)variable

\(T = \frac{{(\bar X - \bar Y) - \left( {{\mu _1} - {\mu _2}} \right)}}{{{S_p}\sqrt {\frac{1}{m} + \frac{1}{n}} }}\)

which has a\(t\)distribution with\(m + n - 2\)df when both population distributions are normal with\({\sigma _1} = {\sigma _2}\)(see the Pooled\(t\)Procedures subsection for a description of\({S_p}\)).

a. Use this\(t\)variable to obtain a pooled\(t\)confidence interval formula for\({\mu _1} - {\mu _2}\).

b. A sample of ultrasonic humidifiers of one particular brand was selected for which the observations on maximum output of moisture (oz) in a controlled chamber were\(14.0, 14.3, 12.2\), and 15.1. A sample of the second brand gave output values\(12.1, 13.6\),\(11.9\), and\(11.2\)("Multiple Comparisons of Means Using Simultaneous Confidence Intervals," J. of Quality Technology, \(1989: 232 - 241\)). Use the pooled\(t\)formula from part (a) to estimate the difference between true average outputs for the two brands with a\(95\% \)confidence interval.

c. Estimate the difference between the two\(\mu \)'s using the two-sample\(t\)interval discussed in this section, and compare it to the interval of part (b).

The article "Enhancement of Compressive Properties of Failed Concrete Cylinders with Polymer Impregnation" (J. of Testing and Evaluation, 1977: 333-337) reports the following data on impregnated compressive modulus (\(psi\)\( \times 1{0^6}\)) when two different polymers were used to repair cracks in failed concrete.

\(\begin{array}{*{20}{l}}{ Epoxy }&{1.75}&{2.12}&{2.05}&{1.97}\\{ MMA prepolymer }&{1.77}&{1.59}&{1.70}&{1.69}\end{array}\)

Obtain a \(90\% \) CI for the ratio of variances by first using the method suggested in the text to obtain a general confidence interval formula.

To decide whether two different types of steel have the same true average fracture toughness values, n specimens of each type are tested, yielding the following results:

\(\begin{array}{l}\underline {\begin{array}{*{20}{c}}{ Type }&{ Sample Average }&{ Sample SD }\\1&{60.1}&{1.0}\\2&{59.9}&{1.0}\\{}&{}&{}\end{array}} \\\end{array}\)

Calculate the P-value for the appropriate two-sample \(z\) test, assuming that the data was based on \(n = 100\). Then repeat the calculation for \(n = 400\). Is the small P-value for \(n = 400\) indicative of a difference that has practical significance? Would you have been satisfied with just a report of the P-value? Comment briefly.a

Is there any systematic tendency for part-time college faculty to hold their students to different standards than do full-time faculty? The article 鈥淎re There Instructional Differences Between Full-Time and Part-Time Faculty?鈥 (College Teaching, \(2009: 23 - 26)\) reported that for a sample of \(125 \) courses taught by fulltime faculty, the mean course \(GPA\) was \(2.7186\) and the standard deviation was \(.63342\), whereas for a sample of \(88\) courses taught by part-timers, the mean and standard deviation were \(2.8639\) and \(.49241,\) respectively. Does it appear that true average course \(GPA\) for part-time faculty differs from that for faculty teaching full-time? Test the appropriate hypotheses at significance level \( .01\).

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.