The unicode character set was used in the simplest forms of coding over 25 years ago and is still used today for computing in e-mails, the web, and even for fonts.
Character literals in Java are stored as UTF-16 Unicode characters. Each character takes up 16 bits of memory, allowing for representation of a wide range of characters in the Unicode character set.
UNICODE
I did it and it is this
1 byte (Unicode)
In computer memory, character are represented using predefined character set. Historically 7 bit American Standard Code for Information Interchange (ASCII) code, 8 bit American National Standards Institute (ANSI) code and Extended Binary Coded Decimal Interchange Code(EBCDIC) were used. These coding scheme represents selected characters into 7 or 8 bit binary code. These character schemes do not represent all the characters in all the languages in uniform format. At present Unicode is used to represent characters into the computer memory. Unicode provides universal and efficient character presentations and hence evolved as modern character representation scheme. Unicode scheme is maintained by a non-profit organization called Unicode consortium. Unicode is also compatible with other coding scheme like ASCII. Unicode use either 16 bits or 32 bits to represent a character. Unicode has capability represent characters from all the major languages in use currently across the world.
The Unicode Transformation Format Unicode is a character set supported across many commonly used software applications and operating systems. For example, many popular web browser, e-mail, and word processing applications support Unicode. Operating systems that support Unicode include Solaris Operating Environment, Linux, Microsoft Windows 2000, and Apple's Mac OS X. Applications that support Unicode are often capable of displaying multiple languages and scripts within the same document. In a multilingual office or business setting, Unicode's importance as a universal character set cannot be overlooked. Unicode is the only practical character set option for applications that support multilingual documents. However, applications do have several options for how they encode Unicode. An encoding is the mapping of Unicode code points to a stream of storable code units or octets. The most common encodings include the following: UTF-8 UTF-16 UTF-32 Each encoding has advantages and drawbacks. However, one encoding in particular has gained widespread acceptance. That encoding is UTF-8.
The idea is to have a single character set that can represent ALL the languages of the World, without the need to change between different character encodings.
A wyde is a unit of two-byte unsigned data, mostly used for a Unicode character.
The number of bytes used by a character varies from language to language. Java uses a 16-bit (two-byte) character so that it can represent many non-Latin characters in the Unicode character set.
24
ASCII is a set of digital codes widely used as a standard fromat in the transfer of text. Unicode is an international encoding standard for used with different languages and scripts
ASCII is a set of digital codes widely used as a standard fromat in the transfer of text. Unicode is an international encoding standard for used with different languages and scripts