External

Internal

Overview

Character information is maintained in Java by the primitive type char, which was designed based on the original Unicode 1.0 specification that allowed only 2¹⁶ code points, so it was defined as a fixed-with 16-bit/2-byte entity. Since then, the Unicode standard has evolved to allow for characters whose representation requires more than 16 bits.

U+n Notation Support

U+n notation is supported in Java as follows:

Character Representation

Java platform uses the UTF-16 representation in char arrays and in the String, StringBuffer and StringBuilder classes.

The Basic Multilingual Plane characters are represented as char instances, while the supplementary characters are represented as a pair of char values. Java 5, which supports Unicode 4.0, introduced enhancements to correctly handle Unicode supplementary characters.

Java and Unicode

Contents

External

Internal

Overview

U+n Notation Support

Character Representation

Navigation menu

Java and Unicode

External

Internal

Overview

U+n Notation Support

Character Representation

Navigation menu

Search