Problem 1
Aw, nuts! A company claims that each batch of its deluxe mixed nuts contains \(52 \%\) cashews, \(27 \%\) almonds, \(13 \%\) macadamia nuts, and \(8 \%\) brazil nuts. To test this claim, a quality-control inspector takes a random sample of 150 nuts from the latest batch. The one-way table below displays the sample data. $$ \begin{array}{lcccc} \hline \text { Nut: } & \text { Cashew } & \text { Almond } & \text { Macadamia } & \text { Brazil } \\ \text { Count: } & 83 & 29 & 20 & 18 \\ \hline \end{array} $$ (a) State appropriate hypotheses for performing a test of the company's claim. (b) Calculate the expected counts for each type of nut. Show your work.
Problem 2
Roulette Casinos are required to verify that their games operate as advertised. American roulette wheels have 38 slots -18 red, 18 black, and 2 green. In one casino, managers record data from a random sample of 200 spins of one of their American roulette wheels. The one-way table below displays the results. $$ \begin{array}{lccc} \hline \text { Color: } & \text { Red } & \text { Black } & \text { Green } \\\ \text { Count: } & 85 & 99 & 16 \\ \hline \end{array} $$ (a) State appropriate hypotheses for testing whether these data give convincing evidence that the distribution of outcomes on this wheel is not what it should be. (b) Calculate the expected counts for each color. Show your work.
Problem 9
No chi-square A school's principal wants to know if students spend about the same amount of time on homework each night of the week. She asks a random sample of 50 students to keep track of their homework time for a week. The following table displays the average amount of time (in minutes) students reported per night: $$ \begin{array}{lccccccc} \hline \text { Night: } & \text { Sunday } & \text { Monday } & \text { Tuesday } & \text { Wednesday } & \text { Thursday } & \text { Friday } & \text { Saturday } \\ \text { Average } & 130 & 108 & 115 & 104 & 99 & 37 & 62 \\ \text { time: } & & & & & & & \\ \hline \end{array} $$ Explain carefully why it would not be appropriate to perform a chi-square test for goodness of fit using these data.
Problem 11
Benford's law Faked numbers in tax returns, invoices, or expense account claims often display patterns that aren't present in legitimate records. Some patterns are obvious and easily avoided by a clever crook. Others are more subtle. It is a striking fact that the first digits of numbers in legitimate records often follow a model known as Benford's law. \({ }^{3}\) Call the first digit of a randomly chosen record \(X\) for short. Benford's law gives this probability model for \(X\) (note that a first digit can't be 0 ): $$ \begin{array}{lccccccccc} \hline \text { First digit: } & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 \\ \text { Probability: } & 0.301 & 0.176 & 0.125 & 0.097 & 0.079 & 0.067 & 0.058 & 0.051 & 0.046 \\ \hline \end{array} $$ A forensic accountant who is familiar with Benford's law inspects a random sample of 250 invoices from a company that is accused of committing fraud. The table below displays the sample data. $$ \begin{array}{lcrrrrrrrr} \hline \text { First digit: } & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 \\ \text { Count: } & 61 & 50 & 43 & 34 & 25 & 16 & 7 & 8 & 6 \\ \hline \end{array} $$ (a) Are these data inconsistent with Benford's law? Carry out an appropriate test at the \(\alpha=0.05\) level to support your answer. If you find a significant result, perform a follow-up analysis. (b) Describe a Type I error and a Type II error in this setting, and give a possible consequence of each. Which do you think is more serious?
Problem 12
Housing According to the Census Bureau, the distribution by ethnic background of the New York City population in a recent year was Hispanic: \(28 \%\) Black: \(24 \% \quad\) White: \(35 \%\) Asian: \(12 \%\) Others: \(1 \%\) The manager of a large housing complex in the city wonders whether the distribution by race of the complex's residents is consistent with the population distribution. To find out, she records data from a random sample of 800 residents. The table below displays the sample data. \({ }^{4}\) $$ \begin{array}{lccccc} \hline \text { Race: } & \text { Hispanic } & \text { Black } & \text { White } & \text { Asian } & \text { 0ther } \\ \text { Count: } & 212 & 202 & 270 & 94 & 22 \\ \hline \end{array} $$ Are these data significantly different from the city's distribution by race? Carry out an appropriate test at the \(\alpha=0.05\) level to support your answer. If you find a significant result, perform a follow-up analysis.
Problem 13
Skittles Statistics teacher Jason Molesky contacted Mars, Inc., to ask about the color distribution for Skittles candies. Here is an excerpt from the response he received: "The original flavor blend for the SKITTLES BITE SIZE CANDIES is lemon, lime, orange, strawberry and grape. They were chosen as a result of consumer preference tests we conducted. The flavor blend is 20 percent of each flavor." (a) State appropriate hypotheses for a significance test of the company's claim. (b) Find the expected counts for a bag of Skittles with 60 candies. (c) How large a \(\chi^{2}\) statistic would you need to have significant evidence against the company's claim at the \(\alpha=0.05\) level? At the \(\alpha=0.01\) level? (d) Create a set of observed counts for a bag with 60 candies that gives a \(P\) -value between 0.01 and \(0.05 .\) Show the calculation of your chi-square statistic.
Problem 14
Is your random number generator working? Use your calculator's RandInt function to generate 200 digits from 0 to 9 and store them in a list. (a) State appropriate hypotheses for a chi-square test for goodness of fit to determine whether your calculator's random number generator gives each digit an equal chance to be generated. (b) Carry out a test at the \(\alpha=0.05\) significance level. For parts (c) and (d), assume that the students' random number generators are all working properly. (c) What is the probability that a student who does this exercise will make a Type I error? (d) Suppose that 25 students in an AP Statistics class independently do this exercise for homework. Find the probability that at least one of them makes a Type I error.
Problem 19
Multiple choice: Select the best answer for Exercises 19 to 22 Exercises 19 to 21 refer to the following setting. The manager of a high school cafeteria is planning to offer several new types of food for student lunches in the following school year. She wants to know if each type of food will be equally popular so she can start ordering supplies and making other plans. To find out, she selects a random sample of 100 students and asks them, "Which type of food do you prefer: Asian food, Mexican food, pizza, or hamburgers?" Here are her data: $$ \begin{array}{lcccc} \hline \text { Type of Food: } & \text { Asian } & \text { Mexican } & \text { Pizza } & \text { Hamburgers } \\ \text { Count: } & 18 & 22 & 39 & 21 \\ \hline \end{array} $$ An appropriate null hypothesis to test whether the food choices are equally popular is (a) \(H_{0}: \mu=25,\) where \(\mu=\) the mean number of students that prefer each type of food. (b) \(H_{0}: p=0.25,\) where \(p=\) the proportion of all students who prefer Asian food. (c) \(H_{0}: n_{A}=n_{M}=n_{P}=n_{H}=25,\) where \(n_{A}\) is the number of students in the school who would choose Asian food, and so on. (d) \(H_{0}: p_{A}=p_{M}=p_{P}=p_{H}=0.25,\) where \(p_{A}\) is the proportion of students in the school who would choose Asian food, and so on. (e) \(\quad H_{0}: \hat{p}_{\mathrm{A}}=\hat{p}_{M}=\hat{p}_{P}=\hat{p}_{H}=0.25,\) where \(\hat{p}_{\mathrm{A}}\) is the pro- portion of students in the sample who chose Asian food, and so on.
Problem 20
Refer to the following setting. The manager of a high school cafeteria is planning to offer several new types of food for student lunches in the following school year. She wants to know if each type of food will be equally popular so she can start ordering supplies and making other plans. To find out, she selects a random sample of 100 students and asks them, "Which type of food do you prefer: Asian food, Mexican food, pizza, or hamburgers?" Here are her data: $$ \begin{array}{lcccc} \hline \text { Type of Food: } & \text { Asian } & \text { Mexican } & \text { Pizza } & \text { Hamburgers } \\ \text { Count: } & 18 & 22 & 39 & 21 \\ \hline \end{array} $$ (a) \(\frac{(18-25)^{2}}{25}+\frac{(22-25)^{2}}{25}+\frac{(39-25)^{2}}{25}+\frac{(21-25)^{2}}{25}\) (b) \(\frac{(25-18)^{2}}{18}+\frac{(25-22)^{2}}{22}+\frac{(25-39)^{2}}{39}+\frac{(25-21)^{2}}{21}\) (c) \(\frac{(18-25)}{25}+\frac{(22-25)}{25}+\frac{(39-25)}{25}+\frac{(21-25)}{25}\) (d) \(\frac{(18-25)^{2}}{100}+\frac{(22-25)^{2}}{100}+\frac{(39-25)^{2}}{100}+\frac{(21-25)^{2}}{100}\) (e) \(\frac{(0.18-0.25)^{2}}{0.25}+\frac{(0.22-0.25)^{2}}{0.25}+\frac{(0.39-0.25)^{2}}{0.25}\) \(+\frac{(0.21-0.25)^{2}}{0.25}\) The chi-square statistic is
Problem 23
Exercises 23 through 25 refer to the following setting. Do students who read more books for pleasure tend to earn higher grades in English? The boxplots below show data from a simple random sample of 79 students at a large high school. Students were classified as light readers if they read fewer than 3 books for pleasure per year. Otherwise, they were classified as heavy readers. Each student's average English grade for the previous two marking periods was converted to a GPA scale where \(A+=4.3\), \(A=4.0, A-=3.7, B+=3.3,\) and so on. Reading and grades (1.3) Write a few sentences comparing the distributions of English grades for light and heavy readers.