Problem 9 Sand and clay studies were condu... [FREE SOLUTION]

91影视

Understandable Statistics : Concepts and Methods

Charles Henry Brase

$Math Studyset 91影视 Explanations$ Math

10 Edition

Chapter 11: Problem 9

Sand and clay studies were conducted at the West Side Field Station of the University of California (Reference: Professor D. R. Nielsen, University of California, Davis). Twelve consecutive depths, each about $15 \mathrm{~cm}$ deep, were studied and the following percentages of clay in the soil were recorded. $$ \begin{array}{llllllllllll} 19.0 & 27.0 & 30.0 & 24.3 & 33.2 & 27.5 & 24.2 & 18.0 & 16.2 & 8.3 & 1.0 & 0.0 \end{array} $$ (i) Convert this sequence of numbers to a sequence of symbols $\mathrm{A}$ and $\mathrm{B}$, where A indicates a value above the median and B a value below the median. (ii) Test the sequence for randomness about the median. Use $\alpha=0.05$.

Short Answer

Expert verified

The sequence is not random regarding the median, given the significant Z-value.

Step by step solution

Find the Median of the Data

First, arrange the data in ascending order: $0.0, 1.0, 8.3, 16.2, 18.0, 19.0, 24.2, 24.3, 27.0, 27.5, 30.0, 33.2$. There are 12 data points. The median will be the average of the 6th and 7th values, which are $19.0$ and $24.2$. Calculate the median as $\frac{19.0 + 24.2}{2} = 21.6$. Thus, the median is 21.6.

Convert to Symbols A and B

For each value, determine if it is above or below the median $21.6$. Values greater than 21.6 are replaced with 'A', and values less than or equal to 21.6 are replaced with 'B'. The converted sequence is: B, A, A, A, A, A, B, B, B, B, B, B.

Calculate the Runs in the Sequence

A run is a sequence of consecutive identical symbols. For the sequence: B, A, A, A, A, A, B, B, B, B, B, B, there are 2 runs of 'A' and 2 runs of 'B'. So, there are 2 runs in total: $5A, 7B$.

Check Randomness of the Sequence

Check if the number of runs is consistent with the randomness hypothesis at $\alpha = 0.05$. Use the formula for the expected number of runs: $E(R) = \frac{2n_1n_2}{n} + 1$, where $n_1=6$ (number of B's), $n_2=6$ (number of A's), and $n=12$. Find the variance $\sigma^2(R) = \frac{2n_1n_2(2n_1n_2 - n)}{n^2(n-1)}$. Substituting the values: $E(R) = \frac{2(6)(6)}{12} + 1 = 7$ and $\sigma^2(R) = \frac{2(6)(6)(72-12)}{12^2 \cdot 11} = \frac{360}{1584} \approx 0.227$.

Decision on Randomness

To decide if the sequence is random, compare the actual number of runs (2) to the expected runs $E(R) = 7$. Use the Z-test for runs: $Z = \frac{R - E(R)}{\sqrt{\sigma^2(R)}}$, $Z = \frac{2 - 7}{\sqrt{0.227}} \approx -10.38$. The critical Z-value for $\alpha = 0.05$ is $\pm 1.96$. Since $|Z| > 1.96$, the sequence is not random concerning the median.

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91影视!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Median

In statistics, the median is a crucial measure of central tendency. It represents the middle value in a data set when the values are ordered in ascending or descending order.
Unlike the mean, the median is less affected by extreme values or outliers, making it a more robust indicator of central location, especially in skewed distributions.
When calculating the median, if the data set has an odd number of values, the median is simply the middle number. However, if there is an even number of values, the median is the average of the two middle numbers.
For example, with the data set:

0.0, 1.0, 8.3, 16.2, 18.0, 19.0, 24.2, 24.3, 27.0, 27.5, 30.0, 33.2

The median will be the average of the 6th and 7th values because there are 12 entries, which are 19.0 and 24.2.
Thus, the median is calculated as $\frac{19.0 + 24.2}{2} = 21.6$. This median provides a cut-off point for splitting the data into two halves, which is useful for subsequent analysis like testing for randomness.

Randomness test

A randomness test is used to determine whether a sequence of data points can be considered random. It's essential in various fields like quality control and statistical analysis to ensure that no underlying pattern is skewing results.
In this exercise, we're testing the randomness of a data sequence concerning its median. By converting the data to symbols $A$ and $B$, where $A$ indicates values above the median and $B$ for values below, we can visually inspect the sequence for runs.
A run is a consecutive sequence of the same symbol, and the number of runs is calculated to test for randomness. The formula for the expected number of runs $E(R)$ is $\frac{2n_1n_2}{n} + 1$, where $n_1$ and $n_2$ are the number of "B's" and "A's," respectively, and $n$ is the total number of data points.
For randomness, the actual number of runs in the sequence is compared to this expected value, and statistical tests such as the Z-test are applied to evaluate the hypothesis of randomness.

Data sequence

In statistical analysis, a data sequence refers to an ordered list of data points. This ordered set can be used to analyze trends, patterns, and randomness within the data.
For instance, the given data sequence in the example:

19.0, 27.0, 30.0, 24.3, 33.2, 27.5, 24.2, 18.0, 16.2, 8.3, 1.0, 0.0

demonstrates the percentage of clay content at different soil depths. Analyzing this sequence involves arranging the data in order and determining its properties such as the median.
Once the data is ordered $0.0, 1.0, 8.3, 16.2, 18.0, 19.0, 24.2, 24.3, 27.0, 27.5, 30.0, 33.2$, it becomes easier to convert the sequence into symbols based on whether values fall above or below the median.
This conversion simplifies further statistical analyses, like assessing randomness or identifying runs within the sequence.

Z-test

The Z-test is a statistical method used to determine whether there is a significant difference between an observed value and an expected value. It is a type of hypothesis test that allows researchers to infer whether deviations are due to chance or indicate genuine effects.
In the context of testing randomness, the Z-test helps compare the actual number of runs in a sequence with the number of runs expected if the sequence were random. The formula for the Z-score in this situation is $ Z = \frac{R - E(R)}{\sqrt{\sigma^2(R)}} $, where $ R $ is the observed number of runs, $ E(R) $ is the expected number of runs, and $ \sigma^2(R) $ is the variance of the number of runs.
If the computed Z-value is beyond the critical value at a chosen significance level (for example, $\alpha = 0.05$), the null hypothesis of randomness is rejected. In this exercise, a Z-value of $-10.38$ far exceeds the critical Z-value of $\pm 1.96$, indicating the sequence is not random regarding the median.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

91影视

Short Answer

Step by step solution

Find the Median of the Data

Convert to Symbols A and B

Calculate the Runs in the Sequence

Check Randomness of the Sequence

Decision on Randomness

Key Concepts

Median

Randomness test

Data sequence

Z-test

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on Math Textbooks

Calculus

Discrete Mathematics

Decision Maths

Geometry

Theoretical and Mathematical Physics

Statistics

Study anywhere. Anytime. Across all devices.