/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 9 Sand and clay studies were condu... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

Sand and clay studies were conducted at the West Side Field Station of the University of California (Reference: Professor D. R. Nielsen, University of California, Davis). Twelve consecutive depths, each about \(15 \mathrm{~cm}\) deep, were studied and the following percentages of clay in the soil were recorded. $$ \begin{array}{llllllllllll} 19.0 & 27.0 & 30.0 & 24.3 & 33.2 & 27.5 & 24.2 & 18.0 & 16.2 & 8.3 & 1.0 & 0.0 \end{array} $$ (i) Convert this sequence of numbers to a sequence of symbols \(\mathrm{A}\) and \(\mathrm{B}\), where A indicates a value above the median and B a value below the median. (ii) Test the sequence for randomness about the median. Use \(\alpha=0.05\).

Short Answer

Expert verified
The sequence is not random regarding the median, given the significant Z-value.

Step by step solution

01

Find the Median of the Data

First, arrange the data in ascending order: \(0.0, 1.0, 8.3, 16.2, 18.0, 19.0, 24.2, 24.3, 27.0, 27.5, 30.0, 33.2\). There are 12 data points. The median will be the average of the 6th and 7th values, which are \(19.0\) and \(24.2\). Calculate the median as \(\frac{19.0 + 24.2}{2} = 21.6\). Thus, the median is 21.6.
02

Convert to Symbols A and B

For each value, determine if it is above or below the median \(21.6\). Values greater than 21.6 are replaced with 'A', and values less than or equal to 21.6 are replaced with 'B'. The converted sequence is: B, A, A, A, A, A, B, B, B, B, B, B.
03

Calculate the Runs in the Sequence

A run is a sequence of consecutive identical symbols. For the sequence: B, A, A, A, A, A, B, B, B, B, B, B, there are 2 runs of 'A' and 2 runs of 'B'. So, there are 2 runs in total: \(5A, 7B\).
04

Check Randomness of the Sequence

Check if the number of runs is consistent with the randomness hypothesis at \(\alpha = 0.05\). Use the formula for the expected number of runs: \(E(R) = \frac{2n_1n_2}{n} + 1\), where \(n_1=6\) (number of B's), \(n_2=6\) (number of A's), and \(n=12\). Find the variance \(\sigma^2(R) = \frac{2n_1n_2(2n_1n_2 - n)}{n^2(n-1)}\). Substituting the values: \(E(R) = \frac{2(6)(6)}{12} + 1 = 7\) and \(\sigma^2(R) = \frac{2(6)(6)(72-12)}{12^2 \cdot 11} = \frac{360}{1584} \approx 0.227\).
05

Decision on Randomness

To decide if the sequence is random, compare the actual number of runs (2) to the expected runs \(E(R) = 7\). Use the Z-test for runs: \(Z = \frac{R - E(R)}{\sqrt{\sigma^2(R)}}\), \(Z = \frac{2 - 7}{\sqrt{0.227}} \approx -10.38\). The critical Z-value for \(\alpha = 0.05\) is \(\pm 1.96\). Since \(|Z| > 1.96\), the sequence is not random concerning the median.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Median
In statistics, the median is a crucial measure of central tendency. It represents the middle value in a data set when the values are ordered in ascending or descending order.
Unlike the mean, the median is less affected by extreme values or outliers, making it a more robust indicator of central location, especially in skewed distributions.
When calculating the median, if the data set has an odd number of values, the median is simply the middle number. However, if there is an even number of values, the median is the average of the two middle numbers.
For example, with the data set:
  • 0.0, 1.0, 8.3, 16.2, 18.0, 19.0, 24.2, 24.3, 27.0, 27.5, 30.0, 33.2
The median will be the average of the 6th and 7th values because there are 12 entries, which are 19.0 and 24.2.
Thus, the median is calculated as \(\frac{19.0 + 24.2}{2} = 21.6\). This median provides a cut-off point for splitting the data into two halves, which is useful for subsequent analysis like testing for randomness.
Randomness test
A randomness test is used to determine whether a sequence of data points can be considered random. It's essential in various fields like quality control and statistical analysis to ensure that no underlying pattern is skewing results.
In this exercise, we're testing the randomness of a data sequence concerning its median. By converting the data to symbols \(A\) and \(B\), where \(A\) indicates values above the median and \(B\) for values below, we can visually inspect the sequence for runs.
A run is a consecutive sequence of the same symbol, and the number of runs is calculated to test for randomness. The formula for the expected number of runs \(E(R)\) is \(\frac{2n_1n_2}{n} + 1\), where \(n_1\) and \(n_2\) are the number of "B's" and "A's," respectively, and \(n\) is the total number of data points.
For randomness, the actual number of runs in the sequence is compared to this expected value, and statistical tests such as the Z-test are applied to evaluate the hypothesis of randomness.
Data sequence
In statistical analysis, a data sequence refers to an ordered list of data points. This ordered set can be used to analyze trends, patterns, and randomness within the data.
For instance, the given data sequence in the example:
  • 19.0, 27.0, 30.0, 24.3, 33.2, 27.5, 24.2, 18.0, 16.2, 8.3, 1.0, 0.0
demonstrates the percentage of clay content at different soil depths. Analyzing this sequence involves arranging the data in order and determining its properties such as the median.
Once the data is ordered \(0.0, 1.0, 8.3, 16.2, 18.0, 19.0, 24.2, 24.3, 27.0, 27.5, 30.0, 33.2\), it becomes easier to convert the sequence into symbols based on whether values fall above or below the median.
This conversion simplifies further statistical analyses, like assessing randomness or identifying runs within the sequence.
Z-test
The Z-test is a statistical method used to determine whether there is a significant difference between an observed value and an expected value. It is a type of hypothesis test that allows researchers to infer whether deviations are due to chance or indicate genuine effects.
In the context of testing randomness, the Z-test helps compare the actual number of runs in a sequence with the number of runs expected if the sequence were random. The formula for the Z-score in this situation is \( Z = \frac{R - E(R)}{\sqrt{\sigma^2(R)}} \), where \( R \) is the observed number of runs, \( E(R) \) is the expected number of runs, and \( \sigma^2(R) \) is the variance of the number of runs.
If the computed Z-value is beyond the critical value at a chosen significance level (for example, \(\alpha = 0.05\)), the null hypothesis of randomness is rejected. In this exercise, a Z-value of \(-10.38\) far exceeds the critical Z-value of \(\pm 1.96\), indicating the sequence is not random regarding the median.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

An army psychologist gave a random sample of seven soldiers a test to measure sense of humor and another test to measure aggressiveness. Higher scores mean greater sense of humor or more aggressiveness. $$ \begin{array}{l|rrrrrrr} \hline \text { Soldier } & 1 & 2 & 3 & 4 & 5 & 6 & 7 \\ \hline \text { Score on humor test } & 60 & 85 & 78 & 90 & 93 & 45 & 51 \\ \text { Score on aggressiveness test } & 78 & 42 & 68 & 53 & 62 & 50 & 76 \\ \hline \end{array} $$ (i) Ranking the data with rank 1 for highest score on a test, make a table of ranks to be used in a Spearman rank correlation test. (ii) Using a \(0.05\) level of significance, test the claim that rank in humor has a monotone-decreasing relation to rank in aggressiveness.

Are yields for organic farming different from conventional farming yields? Independent random samples from method A (organic farming) and method B (conventional farming) gave the following information about yield of sweet corn (in tons/acre) (Reference: Agricultural Statistics, U.S. Department of Agriculture). $$ \begin{array}{l|llllllllllll} \hline \text { Method A } & 6.88 & 6.86 & 7.12 & 5.91 & 6.80 & 6.92 & 6.25 & 6.98 & 7.21 & 7.33 & 5.85 & 6.72 \\ \hline \text { Method B } & 5.71 & 6.93 & 7.05 & 7.15 & 6.79 & 6.87 & 6.45 & 7.34 & 5.68 & 6.78 & 6.95 & \\ \hline \end{array} $$ Use a \(5 \%\) level of significance to test the claim that there is no difference between the yield distributions.

Many economists and financial experts claim that the price level of a stock or bond is not random; rather, the price changes tend to follow a random sequence over time. The following data represent annual percentage returns on Vanguard Total Stock Index for a sequence of recent years. This fund represents nearly all publicly traded U.S. stocks. $$ \begin{array}{rrrrrrrrr} 10.4 & 10.6 & -0.2 & 35.8 & 21.0 & 31.0 & 23.3 & 23.8 & -10.6 \\ -11.0 & -21.0 & 12.8 & & & & & & \end{array} $$ (i) Convert this sequence of numbers to a sequence of symbols \(\mathrm{A}\) and \(\mathrm{B}\), where A indicates a value above the median and B a value below the median. (ii) Test the sequence for randomness about the median. Use \(\alpha=0.05\).

When applying the rank-sum test, do you need independent or dependent samples?

To compare two elementary schools regarding teaching of reading skills, 12 sets of identical twins were used. In each case, one child was selected at random and sent to school A, and his or her twin was sent to school B. Near the end of fifth grade, an achievement test was given to each child. The results follow: $$ \begin{array}{l|cccccc} \hline \text { Twin Pair } & 1 & 2 & 3 & 4 & 5 & 6 \\ \hline \text { School A } & 177 & 150 & 112 & 95 & 120 & 117 \\ \hline \text { School B } & 86 & 135 & 115 & 110 & 116 & 84 \\ \hline \end{array} $$ $$ \begin{array}{l|cccccc} \hline \text { Twin Pair } & 7 & 8 & 9 & 10 & 11 & 12 \\ \hline \text { School A } & 86 & 111 & 110 & 142 & 125 & 89 \\ \hline \text { School B } & 93 & 77 & 96 & 130 & 147 & 101 \\ \hline \end{array} $$ Use a \(0.05\) level of significance to test the hypothesis that the two schools have the same effectiveness in teaching reading skills against the alternate hypothesis that the schools are not equally effective.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.