|
Unicode is a character encoding scheme using 16-bit codes. It is the UCS-2 sub-set of the UCS (Universal Character Set) that has a superset of 32-bit encodings. The large code range allows for a systematic encoding of wide range of alphabets and symbols. Nearly 40 thousand characters have been defined so far. This compares to the 256 possible characters with ASCII. See the www.unicode.org site for displays of Charts of Unicode values vs characters. Internally, Java only uses Unicode for character handling. In Java a character, if not available on the keyboard, can be represented by its Unicode value given as 4 hex values preceded by the \u escape sequence, e.g. ö is given by \u00F6, è by \u00E8. Remember that Fonts determine only how the characters are displayed on the screen or printed on paper. For a given Font set, a character code will either point to a known font for that code or to a default symbol indicating an unknown character. We can see what Fonts are available with the current host by using this applet. It shows the row value X and the column value Y for \u00XY Unicode values.
|
||||||||||||
|
|||||||||||||