Revision as of 23:33, 25 October 2021

Internal

Huffman Codes

Overview

A binary code is a function that maps each character of an alphabet Σ to a binary string.

Fixed Length Code

A fixed length binary code is a binary code that maps each character of the alphabet Σ to a fixed length code. Arbitrary characters have the same length representation in the binary code. ASCII is an example of a fixed length binary code. However, in case when some characters are more likely to appear than others, a schema that allocates shorter codes to characters that are likely to appear more frequently tends to encode information using fewer bits. This encoding schemas are called variable length codes.

Variable Length Code

A variable length binary code is a binary code that maps each character of the alphabet Σ to a variable length code, usually in relation to the frequency the character is used in certain patterns of communication. The more frequent characters are associated with shorter variable length codes. Huffman codes are an example of variable length binary code.

A problem with variable length codes is that they may introduce ambiguity that makes decoding problematic. For example, in the case of the alphabet Σ = {A, B, C, D}, if we use the encoding {0, 1, 01, 10}, there is not enough information to decode 001. It could be decoded as AAB or as AC, because without further precautions, it is not clear when one symbol ends and another symbol begins.

Prefix-Free Codes

Prefix-free codes solve the ambiguity of decoding variable length code-encoded content.

A prefix-free code is defined as follows: for every pair i,j of characters in the alphabet Σ, neither of the encodings f(i) and f(j) is a prefix of the other.

In the previous example, the {0, 1, 01, 10} encoding of the alphabet Σ = {A, B, C, D} is not prefix free, as A and C encodings have the same prefix, and also B and D. However, {0, 10, 110, 111} is a prefix free encoding.

Revision as of 23:31, 25 October 2021 (view source) Ovidiu (talk \| contribs) (→‎Prefix-Free Codes) ← Older edit		Revision as of 23:33, 25 October 2021 (view source) Ovidiu (talk \| contribs) (→‎Prefix-Free Codes) Newer edit →
Line 14:		Line 14:
	A prefix-free code is defined as follows: for every pair i,j of characters in the alphabet Σ, neither of the encodings f(i) and f(j) is a prefix of the other.		A prefix-free code is defined as follows: for every pair i,j of characters in the alphabet Σ, neither of the encodings f(i) and f(j) is a prefix of the other.

	In the previous example, the {} encoding of the alphabet Σ = {A, B, C, D}		In the previous example, the {0, 1, 01, 10} encoding of the alphabet Σ = {A, B, C, D} is not prefix free, as A and C encodings have the same prefix, and also B and D. However, {0, 10, 110, 111} is a prefix free encoding.

Binary Codes: Difference between revisions

Revision as of 23:33, 25 October 2021

Contents

Internal

Overview

Fixed Length Code

Variable Length Code

Prefix-Free Codes

Navigation menu

Binary Codes: Difference between revisions

Revision as of 23:33, 25 October 2021

Internal

Overview

Fixed Length Code

Variable Length Code

Prefix-Free Codes

Navigation menu

Search