/*! This file is auto-generated */ .wp-block-button__link{color:#fff;background-color:#32373c;border-radius:9999px;box-shadow:none;text-decoration:none;padding:calc(.667em + 2px) calc(1.333em + 2px);font-size:1.125em}.wp-block-file__button{background:#32373c;color:#fff;text-decoration:none} Problem 83 An article in the Los Angeles Ti... [FREE SOLUTION] | 91Ó°ÊÓ

91Ó°ÊÓ

An article in the Los Angeles Times (Dec. 3, 1993) reports that 1 in 200 people carry the defective gene that causes inherited colon cancer. In a sample of 1000 individuals, what is the approximate distribution of the number who carry this gene? Use this distribution to calculate the approximate probability that a. Between 5 and 8 (inclusive) carry the gene. b. At least 8 carry the gene.

Short Answer

Expert verified
a) P(5 ≤ X ≤ 8) ≈ 0.503; b) P(X ≥ 8) ≈ 0.106.

Step by step solution

01

Define the Random Variable

We define the random variable \(X\) as the number of individuals in a sample of 1000 who carry the defective gene. Thus, we want to find the distribution of \(X\).
02

Identify the Distribution Type

Since each individual either carries the gene or not, this scenario fits a binomial distribution model where the probability of success \(p = \frac{1}{200}\) and the number of trials \(n = 1000\). Thus, \(X \sim \text{Binomial}(1000, \frac{1}{200})\).
03

Approximate with Normal Distribution

For large \(n\), the binomial distribution \(X\) can be approximated by a normal distribution if \(np\) and \(n(1-p)\) are greater than 5. Here, \(np = 5\) and \(n(1-p) = 995\), so we can use a normal approximation: \(X \sim N(5, \sqrt{np(1-p)})\). Compute the standard deviation: \(\sigma = \sqrt{1000 \times \frac{1}{200} \times \frac{199}{200}} \approx 2.2\).
04

Calculate Probability for Part (a)

To find \(P(5 \leq X \leq 8)\), convert the binomial range to a normal distribution using continuity correction. Calculate \(P(4.5 \leq X \leq 8.5)\) in the normal approximation: \[ P(4.5 < X < 8.5) \approx \frac{1}{2} \left[ 1 + \, \text{erf} \left(\frac{x - \mu}{\sigma \sqrt{2}}\right) \right] \text{where} \mu = 5, \sigma = 2.2 \].
05

Calculate Probability for Part (b)

To find \(P(X \geq 8)\), convert the boundary to the normal approximation using continuity correction: calculate \(P(X \geq 7.5)\). Use the cumulative distribution function: \[ P(X \geq 7.5) = 1 - P(X < 7.5) \]. Use the normal approximation to find \(P(X < 7.5)\) and subtract from 1.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91Ó°ÊÓ!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Normal Approximation
When working with a binomial distribution, calculating probabilities directly can be challenging, especially when the number of trials is large. This is where the normal approximation becomes useful. The idea is to approximate the binomial distribution with a normal distribution because it simplifies calculations and allows us to use techniques developed for normal distributions.

For a binomial distribution with parameters \( n \) (number of trials) and \( p \) (probability of success), we can use a normal distribution to approximate it if \( np \) and \( n(1-p) \) are both greater than 5.
  • The mean (\( \mu \)) of the normal approximation is the same as the binomial mean: \( np \).
  • The standard deviation (\( \sigma \)) is calculated by \( \sqrt{np(1-p)} \).
This allows the binomial distribution \( B(n, p) \) to be approximated by the normal distribution \( N(np, \sqrt{np(1-p)}) \).

In our exercise, since \( np = 5 \) and \( n(1-p) = 995 \), we meet this condition, which makes the normal approximation valid.
Continuity Correction
In the context of the normal approximation to the binomial distribution, a continuity correction is used to improve the accuracy of calculation. The continuity correction accounts for the fact that a binomial distribution is discrete (individual, separate pieces) while a normal distribution is continuous (smooth and flowing everywhere).

To apply the continuity correction, we adjust our range of interest by 0.5 units. This ensures that the approximated area under the curve is more aligned with the discrete nature of the original binomial distribution.
  • If calculating a probability like \( P(a \leq X \leq b) \), you adjust it to \( P(a - 0.5 \leq X \leq b + 0.5) \).
  • For probabilities such as \( P(X \geq k) \), adjust to \( P(X \geq k - 0.5) \).
  • For \( P(X < k) \), adjust to \( P(X < k + 0.5) \).
So in exercise part (a), \( P(5 \leq X \leq 8) \) adjusts to \( P(4.5 \leq X \leq 8.5) \), and for part (b), \( P(X \geq 8) \) adjusts to \( P(X \geq 7.5) \), making our approximation more precise.
Cumulative Distribution Function
The cumulative distribution function (CDF) helps determine the probability that a random variable is less than or equal to a specific value. For a continuous probability distribution, the CDF is expressed as \( F(x) = P(X \leq x) \). This function is extremely helpful in finding probabilities in our exercise when dealing with normal approximations.

Once we apply the continuity correction and convert our problem to a normal distribution setup, we utilize the CDF to find these probabilities.
  • For part (a) of the exercise, we calculate \( P(4.5 < X < 8.5) \), which involves finding \( F(8.5) - F(4.5) \).
  • For part (b), \( P(X \geq 7.5) = 1 - P(X < 7.5) \), which uses the CDF to find \( F(7.5) \) and subtracting it from 1.
Computing these CDF values for a normal distribution often involves looking up standard normal distribution tables or using computational tools to find the required probabilities.
Random Variable
A random variable is a concept that helps describe numerical outcomes of random processes. In probability and statistics, it can be thought of as a function that assigns a real number to each outcome of a statistical experiment. In our gene-carrying example, the random variable \( X \) represents the number of people in a sample of 1000 who carry a specific defective gene.

Understanding the nature of a random variable is crucial because it sets the stage for identifying the distribution type and computing probabilities based on that distribution.
  • Discrete random variables take on specific, isolated values, like our variable \( X \) which counts people.
  • Continuous random variables can assume any value within a given range and are typically associated with measurements.
Therefore, by defining \( X \) in the context of this exercise as the number of gene carriers among 1000 individuals, it becomes possible to apply the binomial distribution, and subsequently, approximate it using a normal distribution for probability calculations.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

A trial has just resulted in a hung jury because eight members of the jury were in favor of a guilty verdict and the other four were for acquittal. If the jurors leave the jury room in random order and each of the first four leaving the room is accosted by a reporter in quest of an interview, what is the pmf of \(X=\) the number of jurors favoring acquittal among those interviewed? How many of those favoring acquittal do you expect to be interviewed?

Consider a disease whose presence can be identified by carrying out a blood test. Let \(p\) denote the probability that a randomly selected individual has the disease. Suppose \(n\) individuals are independently selected for testing. One way to proceed is to carry out a separate test on each of the \(n\) blood samples. A potentially more economical approach, group testing, was introduced during World War II to identify syphilitic men among army inductees. First, take a part of each blood sample, combine these specimens, and carry out a single test. If no one has the disease, the result will be negative, and only the one test is required. If at least one individual is diseased, the test on the combined sample will yield a positive result, in which case the \(n\) individual tests are then carried out. If \(p=.1\) and \(n=3\), what is the expected number of tests using this procedure? What is the expected number when \(n=5\) ? [The article "Random Multiple-Access Communication and Group Testing" (IEEE Trans. on Commun., 1984: 769-774) applied these ideas to a communication system in which the dichotomy was active/idle user rather than diseased/nondiseased.]

The College Board reports that \(2 \%\) of the 2 million high school students who take the SAT each year receive special accommodations because of documented disabilities (Los Angeles Times, July 16, 2002). Consider a random sample of 25 students who have recently taken the test. a. What is the probability that exactly 1 received a special accommodation? b. What is the probability that at least 1 received a special accommodation? c. What is the probability that at least 2 received a special accommodation? d. What is the probability that the number among the 25 who received a special accommodation is within 2 standard deviations of the number you would expect to be accommodated? e. Suppose that a student who does not receive a special accommodation is allowed 3 hours for the exam, whereas an accommodated student is allowed \(4.5\) hours. What would you expect the average time allowed the 25 selected students to be?

a. Show that \(b(x ; n, 1-p)=b(n-x ; n, p)\). b. Show that \(B(x ; n, 1-p)=1-B(n-x-1 ; n, p)\). [Hint: At most \(x S\) 's is equivalent to at least \((n-x) F\) 's.] c. What do parts (a) and (b) imply about the necessity of including values of \(p\) greater than \(.5\) in Appendix Table A.1?

Suppose that you read through this year's issues of the \(N e w\) York Times and record each number that appears in a news article-the income of a CEO, the number of cases of wine produced by a winery, the total charitable contribution of a politician during the previous tax year, the age of a celebrity, and so on. Now focus on the leading digit of each number, which could be \(1,2, \ldots, 8\), or 9 . Your first thought might be that the leading digit \(X\) of a randomly selected number would be equally likely to be one of the nine possibilities (a discrete uniform distribution). However, much empirical evidence as well as some theoretical arguments suggest an alternative probability distribution called Benford's law: \(p(x)=P(1\) st digit is \(x)=\log _{10}(1+1 / x) x=1,2, \ldots, 9\) a. Compute the individual probabilities and compare to the corresponding discrete uniform distribution. b. Obtain the cdf of \(X\). c. Using the cdf, what is the probability that the leading digit is at most 3 ? At least 5 ? [Note: Benford's law is the basis for some auditing procedures used to detect fraud in financial reporting-for example, by the Internal Revenue Service.]

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.