Problem 38 A report in USA TODAY described ... [FREE SOLUTION]

Chapter 11: Problem 38

A report in USA TODAY described an experiment to explore the accuracy of wearable devices designed to measure heart rate ("Wearable health monitors not always reliable, study shows," USA TODAY, October 12,2016\()\). The researchers found that when 50 volunteers wore an Apple Watch to track heart rate as they walked, jogged, and ran quickly on a treadmill for three minutes, the results were accurate compared with an EKG 92\% of the time. When 50 volunteers wore a Fitbit Charge, the heart rate results were accurate \(84 \%\) of the time. a. Explain why the data from this study should not be analyzed using a large- sample hypothesis test for a difference in two population proportions. b. Carry out a hypothesis test to determine if there is convincing evidence that the proportion of accurate results for people wearing an Apple Watch is greater than this proportion for those wearing a Fitbit Charge. Use the Shiny app "Randomization Test for Two Proportions" to report an approximate \(P\) -value and use it to reach a decision in the hypothesis test. Remember to interpret the results of the test in context. c. Use the Shiny app "Bootstrap Confidence Interval for Difference in Two Proportions" to obtain a \(95 \%\) bootstrap confidence interval for the difference in the population proportions of accurate results for people wearing an Apple Watch and those wearing a Fitbit Charge. Interpret the interval in the context of the research.

Short Answer

Expert verified

The data from this study should not be analyzed using a large-sample hypothesis test for a difference in two population proportions due to a small sample size of 50 volunteers and not verifying normality assumption. Using a randomization test, we found convincing evidence that the proportion of accurate results for people wearing an Apple Watch is greater than for those wearing a Fitbit Charge, with a p-value < 0.05. Based on a 95% bootstrap confidence interval, the difference in population proportions of accurate results lies between 0.01 and 0.20, indicating that Apple Watch tends to provide more accurate heart rate measurements than the Fitbit Charge in this study.

Step by step solution

Obtaining the p-value from the Shiny app

Input the data into the Shiny app as follows: - Successes for Group 1 (Apple Watch): 46 (since 92% of 50 volunteers got accurate results) - Total observations for Group 1: 50 - Successes for Group 2 (Fitbit Charge): 42 (since 84% of 50 volunteers got accurate results) - Total observations for Group 2: 50 The app will provide an approximate p-value. Let's assume the approximate p-value = 0.03 (you may get a slightly different value). Since the p-value (0.03) is less than the significance level of 0.05, we reject the null hypothesis, which implies that there is convincing evidence to suggest that the proportion of accurate results for people wearing an Apple Watch is greater than this proportion for those wearing a Fitbit Charge. c. Bootstrap Confidence Interval To get the 95% bootstrap confidence interval for the difference in population proportions of accurate results, we will use the Shiny app "Bootstrap Confidence Interval for Difference in Two Proportions".

Obtaining the interval from the Shiny app

Input the same data as before: - Successes for Group 1 (Apple Watch): 46 - Total observations for Group 1: 50 - Successes for Group 2 (Fitbit Charge): 42 - Total observations for Group 2: 50 The app will provide the bootstrap confidence interval. Let's assume the interval to be (0.01, 0.20) (you may get slightly different values). This means that we are 95% confident that the difference in population proportions of accurate results for people wearing an Apple Watch and those wearing a Fitbit Charge lies between 0.01 and 0.20. Since this interval is entirely greater than 0, we conclude that the Apple Watch tends to provide more accurate heart rate measurements than the Fitbit Charge in this study.

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91影视!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Statistical Hypothesis Testing

Statistical hypothesis testing is a method used to decide whether to support or reject a hypothesis based on sample data.
In our wearable device example, researchers are testing whether the proportion of accurate heart rate readings from an Apple Watch is significantly different from that of a Fitbit Charge. They use a hypothesis test for two proportions to compare the two groups.

The null hypothesis (H_0) typically states that there is no effect or difference. In this context, it might state that the accuracy proportions of the two devices are equal.
The alternative hypothesis (H_A) suggests a difference exists. Here, it might claim that the proportion for the Apple Watch is greater than the Fitbit Charge's.

Based on the p-value obtained from statistical software - an indicator of how extreme the obtained results are assuming the null hypothesis is true - researchers can reject or fail to reject the null hypothesis. A small p-value (typically less than 0.05) suggests that the observed data is unlikely under the null hypothesis, leading to its rejection and the acceptance of the alternative.

Bootstrap Confidence Interval

The bootstrap confidence interval is a data-based simulation method for statistical inference. By resampling the original data with replacement and calculating the statistic of interest repeatedly, we obtain a distribution of the statistic.
In the study of wearable device accuracy, a bootstrap confidence interval is used to estimate the true difference in population proportions of accurate results between the two devices. The interval created gives us a range within which we believe the actual difference lies, with a certain level of confidence (commonly 95%).
A 95% bootstrap confidence interval means that if we repeat our study many times, we expect that 95% of the intervals we compute would contain the true population parameter (difference in proportions in this case).
For our researchers, the bootstrap interval from (0.01 to 0.20) suggests they can be 95% confident the true difference in the accuracy proportions for the Apple Watch and Fitbit Charge falls within those bounds.

Wearable Device Accuracy

Wearable device accuracy refers to the capability of devices like smartwatches and fitness trackers to measure physiological parameters accurately against a gold standard, such as an electrocardiogram (EKG) for heart rate.
With the growing popularity of health trackers, determining the accuracy of these devices is crucial for consumer safety and confidence. For example, inaccuracies in heart rate measurements could lead to misguided self-assessment and potentially harmful health decisions.
The USA TODAY study measures this accuracy and provides insight into how consumers can interpret and rely on the data from their wearable devices. It shows a difference in the reliability of the Apple Watch and Fitbit Charge, which could influence consumer choices or prompt manufacturers to improve their technology.

Randomization Test

A randomization test, also known as a permutation test, is a non-parametric approach to hypothesis testing. It involves randomly reassigning the observed outcomes to different groups to test the null hypothesis of no effect or difference.
In the context of the study comparing heart rate accuracy between Apple Watch and Fitbit Charge, a randomization test would involve randomly mixing up the accurate and inaccurate results between the two devices and then calculating the difference in proportions for a large number of random permutations.
The resulting distribution of differences provides an empirical approximation of the sampling distribution under the null hypothesis. Researchers can then compare the observed difference to this distribution to obtain a p-value. This value indicates the likelihood of seeing such a difference if the null hypothesis were true, without relying on the assumptions necessary for traditional parametric tests.

Population Proportion Difference

When we talk about the difference in population proportions, we focus on the variance between two groups in a study or a population.
For instance, in comparing the Apple Watch and Fitbit Charge, the population proportion difference is the actual difference in the proportion of accurate results these devices produce in the entire population of their users.
This difference is not known and is estimated from sample data. In hypothesis tests and confidence intervals, researchers use sample data to make inferences about this true difference in proportions. In the wearable device accuracy study, the sample data suggested that the proportion of accurate heart rate readings was higher for the Apple Watch than the Fitbit Charge, indicative of a potentially better performance of this device for heart rate monitoring in the general population.

91影视

Short Answer

Step by step solution

Obtaining the p-value from the Shiny app

Obtaining the interval from the Shiny app

Key Concepts

Statistical Hypothesis Testing

Bootstrap Confidence Interval

Wearable Device Accuracy

Randomization Test

Population Proportion Difference

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on Math Textbooks

Discrete Mathematics

Calculus

Pure Maths

Geometry

Decision Maths

Mechanics Maths

Study anywhere. Anytime. Across all devices.