Revision as of 20:29, 25 June 2018

External

https://en.wikipedia.org/wiki/Character_encoding

Internal

Overview

Character encoding is the process though which characters within a text document are represented by numeric codes. Depending of the character encoding used, the same text will end up with different binary representations. Common character encoding standards are ASCII and Unicode.

Character Set

Character Encoding Standards

ASCII

Unicode

Unicode supports a larger character set than ASCII.

Binary representation of a text represented in Unicode depends on the "transformation format" used. UTF stands for "Unicode Transformation Format", and the number specified after the dash in the transformation format name represents the number of bits used to represent each character.

@@ Line 19: / Line 19: @@
 Unicode supports a larger character set than [[#ASCII|ASCII]].
-Unicode Transformation Format UTF.
+Binary representation of a text represented in Unicode depends on the "transformation format" used. UTF stands for "Unicode Transformation Format", and the number specified after the dash in the transformation format name represents the number of bits used to represent each character.
 ===UTF-8===

Character Encoding: Difference between revisions

Revision as of 20:29, 25 June 2018

Contents

External

Internal

Overview

Character Set

Character Encoding Standards

ASCII

Unicode

UTF-8

UTF-16

UTF-32

Universal Character Set (UCS) ISO 10646

Western

Latin-US

Navigation menu

Character Encoding: Difference between revisions

Revision as of 20:29, 25 June 2018

External

Internal

Overview

Character Set

Character Encoding Standards

ASCII

Unicode

UTF-8

UTF-16

UTF-32

Universal Character Set (UCS) ISO 10646

Western

Latin-US

Navigation menu

Search