Problem 7 Entropy can be viewed as a measu... [FREE SOLUTION]

91影视

Statistical mechanics: Entropy, Order Parameters and Complexity

James P. Sethna

Physics

2006 Edition

Chapter 5: Problem 7

Entropy can be viewed as a measure of the lack of information you have about a system. Claude Shannon [112] realized, back in the 1940 's, that communication over telephone wires amounts to reducing the listener's uncertainty about the sender's message, and introduced a definition of an information entropy. Most natural languages (voice, written English) are highly redundant; the number of intelligible fifty-letter sentences is many fewer than $26^{50}$, and the number of ten-second phone conversations is far smaller than the number of sound signals that could be generated with frequencies between up to $20,000 \mathrm{~Hz} .^{44}$ Shannon, knowing statistical mechanics, defined the entropy of an ensemble of messages: if there are $N$ possible messages that can be sent in one package, and message $m$ is being transmitted with probability $p_{m}$, then Shannon's entropy is $$ S_{I}=-k_{S} \sum_{1}^{N} p_{m} \log p_{m} $$ where instead of Boltzmann's constant, Shannon picked $k_{S}=1 / \log 2$ This immediately suggests a theory for signal compression. If you can recode the alphabet so that common letters and common sequences of letters are abbreviated, while infrequent combinations are spelled out in lengthly fashion, you can dramatically reduce the channel capacity needed to send the data. (This is lossless compression, like zip and gz and gif). An obscure language A'b莽! for long-distance communication has only three sounds: a hoot represented by A, a slap represented by $\mathrm{B}$, and a click represented by C. In a typical message, hoots and slaps occur equally often $(p=1 / 4)$, but clicks are twice as common $(p=1 / 2)$. Assume the messages are otherwise random. (a) What is the Shannon entropy in this language? More specifically, what is the Shannon entropy rate (entropy per sound, or letter, transmitted)? (b) Show that a communication channel transmitting bits (ones and zeros) can transmit no more than one unit of Shannon entropy per bit. (Hint: this should follow by showing that, for $N=2^{n}$ messages, equation $5.38$ is maximized by $p_{m}=1 / N$. You needn't prove it's a global maximum: check that it is a local extremum. You'll need either a Lagrange multiplier or will need to explicitly set $\left.p_{N}=1-\sum_{m=1}^{N-1} p_{m .}\right)$ (c) In general, argue that the Shannon entropy gives the minimum number of bits needed to transmit the ensemble of messages. (Hint: compare the Shannon entropy of the $N$ original messages with the Shannon entropy of the $N$ (shorter) encoded messages.) Calculate the minimum number of bits per letter on average needed to transmit messages for the particular case of an A'b莽! communication channel. (d) Find a compression scheme (a rule that converts $a$ $A$ 'b莽! message to zeros and ones, that can be inverted to give back the original message) that is optimal, in the sense that it saturates the bound you derived in part (b). (Hint: Look for a scheme for encoding the message that compresses one letter at a time. Not all letters need to compress to the same number of bits.) Shannon also developed a measure of the channel capacity of a noisy wire, and discussed error correction codes...

Short Answer

Expert verified

(a) 1.5 bits/sound; (b) Shannon entropy max is 1 bit/bit. (c) Minimum is 1.5 bits/letter. (d) Encode A as '10', B as '11', C as '0'. Optimal compression achieved.

Step by step solution

Determine Probabilities

For the language A'b莽!, there are three sounds: A, B, and C. We determine the probability for each sound given in the problem. Let $ p_A = p_B = 1/4 $, and $ p_C = 1/2 $. This is because A and B occur equally often, while C is twice as common.

Shannon Entropy Formula

The Shannon entropy $ S_I $ is calculated as follows: \[ S_I = -k_S \sum_{i=1}^N p_i \log_2(p_i) \] where $ k_S = 1 $ since $ k_S = 1 / \log 2 $. We substitute in the probabilities: $ p_A = 1/4 $, $ p_B = 1/4 $, and $ p_C = 1/2 $.

Calculate Shannon Entropy for A'b莽!

Calculating each term separately: \[ -((1/4) \cdot \log_2(1/4) + (1/4) \cdot \log_2(1/4) + (1/2) \cdot \log_2(1/2)) \]\[ S_I = -(1/4 \cdot -2 + 1/4 \cdot -2 + 1/2 \cdot -1) = 1.5 \text{ bits} \] Thus, the Shannon entropy per sound or letter for this language is 1.5 bits.

Maximum Entropy for Binary Channel

For a binary channel (bits), maximize entropy for $ N = 2^n $ messages by using equal probabilities. Check the condition using calculus, particularly the use of Lagrange multipliers or constraints to show $ p_m = 1/N $ maximizes entropy. For binary, $ p_0 = p_1 = 1/2 $, so $ S_I^{\text{binary}} = -2(1/2 \log_2(1/2)) = 1 \text{ bit} $.

Compare Entropy of Encoded Messages

Recognize that Shannon entropy represents the minimum average number of bits needed to transmit the messages. Calculate the average bits needed using the function: \[ \text{Average bits/letter} = \sum p_m \cdot \text{number of bits per letter} \]. Given the entropy calculated, encoding must deliver 1.5 bits/letter on average.

Optimal Compression Scheme

Develop a rule that translates the sounds into binary, aiming to match the entropy rate. Use a shorter encoding for more frequent sounds (C) and longer for less frequent (A, B). For example, use: A as '10', B as '11', and C as '0', giving an optimal compression to match step 4's requirements.

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with 91影视!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Information Theory

Information theory is a field of study that focuses on quantifying, storing, and communicating information. It was developed by Claude Shannon in the 1940s to address problems related to communication over channels like telephone wires. Shannon introduced the concept of entropy as a way to measure the uncertainty or surprise associated with a set of messages. In information theory:

Entropy quantifies the amount of information or uncertainty in a message.
Higher entropy means more uncertainty or disorder.
It helps in reducing redundancy and optimizing data transmission.

For example, in a language with redundant symbols, understanding entropy allows us to streamline communication by reducing unnecessary repetition. This is the foundation of modern communication systems and data compression techniques.

Signal Compression

Signal compression is a method used to reduce the amount of data needed to represent a signal without losing vital information. This is particularly important in contexts where data storage or bandwidth is limited. Signal compression is built on the principles of entropy from information theory:

Lossless compression means reducing file sizes without losing any original data, like in ZIP files.
Common sounds or letters are encoded using fewer bits, while rarer ones use more bits.
This technique maximizes the efficiency of data transmission, saving both time and resources.

For instance, in the exercise, the A'b莽! language can be optimally compressed by assigning shorter binary codes to frequent sounds. This approach reduces the overall channel capacity required to relay information efficiently.

Probability Distribution

A probability distribution is a mathematical function that provides the probabilities of occurrence of different possible outcomes in an experiment. In the context of information theory and Shannon entropy, understanding the probability distribution of a system's possible states is key to calculating its entropy.

The probability of each sound in the A'b莽! language reflects how often it occurs.
The probabilities for sounds A, B, and C are 1/4, 1/4, and 1/2, respectively.
The sum of all probabilities in a proper distribution is always equal to 1.

In signal processing, these probabilities are used to determine the best way to encode the data so that it requires the fewest bits on average. By calculating the entropy using these probabilities, we can establish the minimum number of bits required to send messages.

Binary Encoding

Binary encoding denotes the process of representing data using the binary number system, which includes only two numbers: 0 and 1. In digital communications, binary encoding is crucial as it forms the foundation of how information is shared over digital communication channels.

Data is broken down into a sequence of bits (binary digits), which represent the original information.
An optimal encoding scheme ensures that frequently occurring symbols use fewer bits, such as using '0' for sound C in the A'b莽! language.
This approach can be adapted and optimized based on the computed entropy and probability distribution of data.

Binary encoding doesn't just simplify storage and transmission; it also ensures that the maximum channel capacity is utilized efficiently. By translating sounds or signals into binary codes, systems can transmit messages more swiftly and accurately, adhering to the limits established by Shannon's entropy and channel capacity principles.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Recommended explanations on Physics Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

91影视

Short Answer

Step by step solution

Determine Probabilities

Shannon Entropy Formula

Calculate Shannon Entropy for A'b莽!

Maximum Entropy for Binary Channel

Compare Entropy of Encoded Messages

Optimal Compression Scheme

Key Concepts

Information Theory

Signal Compression

Probability Distribution

Binary Encoding

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on Physics Textbooks

Electrostatics

Work Energy and Power

Further Mechanics and Thermal Physics

Scientific Method Physics

Atoms and Radioactivity

Geometrical and Physical Optics

Study anywhere. Anytime. Across all devices.