Character Encoding

From NovaOrdis Knowledge Base
Jump to navigation Jump to search

External

Internal

Overview

Character encoding is the process though which characters within a text document are represented by numeric codes. Depending of the character encoding used, the same text will end up with different binary representations. Common character encodings are ASCII and Unicode.

Character Set

Character Encoding Standards

ASCII

Unicode

Unicode Transformation Format UTF.

UTF-8

UTF-16

UTF-32

Universal Character Set (UCS) ISO 10646

Western

Latin-US