Problem 8 The report "State of the News Me... [FREE SOLUTION]

Chapter 3: Problem 8

The report "State of the News Media $2015^{n}$ (Pew Research Center, April 29,2015 ) published the accompanying circulation numbers for 15 news magazines (such as Time and The New Yorker) for 2014: $$ \begin{array}{rrrrr} 3,284,012 & 1,469,223 & 1,214,590 & 1,046,977 & 993,043 \\ 931,228 & 905,755 & 843,914 & 783,353 & 574,370 \\ 483,360 & 412,062 & 147,808 & 119,297 & 41,518 \end{array} $$ Explain why the average may not be the best measure of a typical value for this data set.

Short Answer

Expert verified

In this data set, there is a substantial difference between the highest circulation number and the rest of the data points, which suggests a possible skewness in the distribution due to high-end outliers. The extremely high circulation number (3,284,012) significantly increases the average (approximately to 1,029,589), but a majority of the circulation numbers are much smaller than the average. This fact shows that the average value does not represent a "typical" circulation number, as it is quite larger than most of the data points. Therefore, the average is not the best measure of a typical value for this data set, and measures like median or mode could better represent the "typical" value in this case.

Step by step solution

Calculate the average of the given data set

To calculate the average, we should sum up all the circulation numbers and divide the total by the number of data points. Here, we have 15 data points. \[Average = \frac{\sum{circulation\,numbers}}{number\,of\,data\,points} \] \[Average = \frac{3,284,012 + 1,469,223 + 1,214,590 + 1,046,977 + 993,043 + 931,228 + 905,755 + 843,914 + 783,353 + 574,370 + 483,360 + 412,062 + 147,808 + 119,297 + 41,518}{15}\] \[Average \approx 1,029,589\]

Analyze the distribution of the circulation numbers

We can observe the following circulation numbers distribution (sorted in a decreasing order): 3,284,012; 1,469,223; 1,214,590; 1,046,977; 993,043; 931,228; 905,755; 843,914; 783,353; 574,370; 483,360; 412,062; 147,808; 119,297; 41,518 There is a substantial difference between the highest circulation number (3,284,012) and the rest of the data points. This fact suggests a possible skewness in the distribution, with a few high-end outliers possibly affecting the average.

Explain how the skewness affects the average

In our data set, the extremely high circulation number (3,284,012) affects the average, increasing it significantly (approximately to 1,029,589). If we look at the data, a majority of the circulation numbers are much smaller than the average. Therefore, the average value does not represent a "typical" circulation number in this case, as it is significantly larger than most of the data points. As a result, for this data set, the average is not the best measure of a typical value. Instead, measures like median or mode could better represent the "typical" value in this instance.

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91影视!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Measure of Central Tendency

When we talk about a measure of central tendency, we refer to a single value that represents the center point or typical value of a dataset. It aims to provide an accurate description of the entire data set with just one number. The three main measures of central tendency are the mean (average), median, and mode.

The mean is calculated by adding up all the numbers and then dividing by the count of numbers. However, the mean can be heavily influenced by outliers, which are values significantly higher or lower than the rest. In a balanced dataset, the mean is an excellent measure as it incorporates every value.

The median is the middle value when you order the data from least to greatest, and it is not affected by outliers. This makes it particularly useful in skewed distributions. It provides a more accurate 'typical' value in such cases.

The mode is the most frequently occurring value in the dataset. It can be useful in datasets with a high frequency of a particular value.

Average vs Median

The difference between the average (mean) and median can be significant, especially if the data is skewed. Average is calculated by summing all the numbers in a data set and dividing by the count of numbers, which can result in an unrepresentative measure if there are extreme numbers in the set (i.e., outliers). For instance, in a neighborhood with mostly modest houses but a couple of mansions, the average home price will be much higher because of the mansions' influence, but that average wouldn't reflect what most people pay for a house.

The median, being the middle value, effectively cuts the dataset in half, ensuring that half the values are below it and half above. This makes the median a better measure when there's a skewed distribution, as it is less affected by extremely high or low values. It more accurately reflects the 'typical' value that most data points are closer to.

For the given exercise data, with such a broad spread in circulation numbers and a clear gap between the leading figures and the majority, median provides a more accurate representation of a typical news magazine's circulation.

Skewed Distribution

A skewed distribution occurs when the data points cluster more on one side of the scale than the other, creating a long tail in the opposite direction of the cluster. If the tail is to the right, it's called a positive skew, and if it's to the left, it's a negative skew.

In a positively skewed distribution, the mean will be greater than the median, because the few high values pull the mean upwards. Conversely, in a negatively skewed distribution, the mean will be less than the median, influenced downward by the low values. Therefore, analysts prefer the median in skewed distributions because it's a better representation of where the middle of the data truly lies.

Returning to our exercise data, the presence of a single very high circulation number creates a positively skewed distribution. Using the mean here gives a false impression that the typical magazine has a higher circulation than what's true for most of the data set. That is why, for such data, other central tendency measures like median or mode are usually preferred.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

91影视

Short Answer

Step by step solution

Calculate the average of the given data set

Analyze the distribution of the circulation numbers

Explain how the skewness affects the average

Key Concepts

Measure of Central Tendency

Average vs Median

Skewed Distribution

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on Math Textbooks

Calculus

Geometry

Logic and Functions

Probability and Statistics

Discrete Mathematics

Applied Mathematics

Study anywhere. Anytime. Across all devices.