Revision as of 18:22, 26 June 2018

Internal

Character Encoding

Overview

Character information is maintained in Java by the primitive type char, which was designed based on the original Unicode specification that allowed only 2¹⁶ code points, so it was defined as a fixed-with 16-bit/2-byte entity. Since then, the Unicode standard has evolved to allow for characters whose representation requires more than 16 bits.

Java platform uses the UTF-16 representation in char arrays and in the String and StringBuffer classes. The Basic Multilingual Plane characters are represented as char instances, while the supplementary characters are represented as a pair of char values. For more details about supplementary character representation in Java see https://docs.oracle.com/javase/10/docs/api/java/lang/Character.html.

Also see:

char

More resources:

http://www.oracle.com/us/technologies/java/supplementary-142654.html

@@ Line 1: / Line 1: @@
+=Internal=
+* [[Character Encoding#Java_and_Unicode|Character Encoding]]
+=Overview=
 Character information is maintained in Java by the primitive type <tt>char</tt>, which was designed based on the original Unicode specification that allowed only 2<sup>16</sup> code points, so it was defined as a fixed-with 16-bit/2-byte entity. Since then, the Unicode standard has evolved to allow for characters whose representation requires [[#Unicode_Code_Points|more than 16 bits]].

Java and Unicode: Difference between revisions

Revision as of 18:22, 26 June 2018

Internal

Overview

Navigation menu

Java and Unicode: Difference between revisions

Revision as of 18:22, 26 June 2018

Internal

Overview

Navigation menu

Search