Shannon–Fano–Elias coding

In information theory, Shannon–Fano–Elias coding is a precursor to arithmetic coding, in which probabilities are used to determine codewords.^[1]

Algorithm description

Given a discrete random variable X of ordered values to be encoded, let $p (x)$ be the probability for any x in X. Define a function

\bar{F} (x) = \sum_{x_{i} < x} p (x_{i}) + \frac{1}{2} p (x)

Algorithm:

For each x in X,

Let Z be the binary expansion of

\bar{F} (x)

.

Choose the length of the encoding of x,

L (x)

, to be the integer

⌈ \log_{2} \frac{1}{p (x)} ⌉ + 1

Choose the encoding of x,

c o d e (x)

, be the first

L (x)

most significant bits after the decimal point of Z.

Example

Let X = {A, B, C, D}, with probabilities p = {1/3, 1/4, 1/6, 1/4}.

For A

\bar{F} (A) = \frac{1}{2} p (A) = \frac{1}{2} \cdot \frac{1}{3} = 0.1666 \dots

In binary, Z(A) = 0.0010101010...

L (A) = ⌈ \log_{2} \frac{1}{\frac{1}{3}} ⌉ + 1 = 𝟑

code(A) is 001

For B

\bar{F} (B) = p (A) + \frac{1}{2} p (B) = \frac{1}{3} + \frac{1}{2} \cdot \frac{1}{4} = 0.4583333 \dots

In binary, Z(B) = 0.01110101010101...

L (B) = ⌈ \log_{2} \frac{1}{\frac{1}{4}} ⌉ + 1 = 𝟑

code(B) is 011

For C

\bar{F} (C) = p (A) + p (B) + \frac{1}{2} p (C) = \frac{1}{3} + \frac{1}{4} + \frac{1}{2} \cdot \frac{1}{6} = 0.66666 \dots

In binary, Z(C) = 0.101010101010...

L (C) = ⌈ \log_{2} \frac{1}{\frac{1}{6}} ⌉ + 1 = 𝟒

code(C) is 1010

For D

\bar{F} (D) = p (A) + p (B) + p (C) + \frac{1}{2} p (D) = \frac{1}{3} + \frac{1}{4} + \frac{1}{6} + \frac{1}{2} \cdot \frac{1}{4} = 0.875

In binary, Z(D) = 0.111

L (D) = ⌈ \log_{2} \frac{1}{\frac{1}{4}} ⌉ + 1 = 𝟑

code(D) is 111

Algorithm analysis

Prefix code

Shannon–Fano–Elias coding produces a binary prefix code, allowing for direct decoding.

Let bcode(x) be the rational number formed by adding a decimal point before a binary code. For example, if code(C) = 1010 then bcode(C) = 0.1010. For all x, if no y exists such that

bcode (x) \leq bcode (y) < bcode (x) + 2^{- L (x)}

then all the codes form a prefix code.

By comparing F to the CDF of X, this property may be demonstrated graphically for Shannon–Fano–Elias coding.

By definition of L it follows that

2^{- L (x)} \leq \frac{1}{2} p (x)

And because the bits after L(y) are truncated from F(y) to form code(y), it follows that

\bar{F} (y) - bcode (y) \leq 2^{- L (y)}

thus bcode(y) must be no less than CDF(x).

So the above graph demonstrates that the $bcode (y) - bcode (x) > p (x) \geq 2^{- L (x)}$ , therefore the prefix property holds.

Code length

The average code length is $L C (X) = \sum_{x \in X} p (x) L (x) = \sum_{x \in X} p (x) (⌈ \log_{2} \frac{1}{p (x)} ⌉ + 1)$ .
Thus for H(X), the entropy of the random variable X,

H (X) + 1 \leq L C (X) < H (X) + 2

Shannon Fano Elias codes from 1 to 2 extra bits per symbol from X than entropy, so the code is not used in practice.

References

↑ T. M. Cover and Joy A. Thomas (2006). Elements of information theory (2nd ed.). John Wiley and Sons. pp. 127–128. ISBN 978-0-471-24195-9. https://books.google.com/books?id=0QuawYmc2pIC&pg=PA127.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Shannon–Fano–Elias coding. Read more

[1] T. M. Cover and Joy A. Thomas (2006). Elements of information theory (2nd ed.). John Wiley and Sons. pp. 127–128. ISBN 978-0-471-24195-9. https://books.google.com/books?id=0QuawYmc2pIC&pg=PA127.

[1]

Anonymous

Search

Shannon–Fano–Elias coding

Namespaces

More

Page actions

Contents