CCCII (Chinese Character Code for Information Interchange, 中文資訊交換碼) is a character set developed specifically to address the problem of interchange of Chinese information. It is used mostly by libraries because the code contains various properties considered to be desirable by libraries. Modern-style library itories and/or access points for maps, prints or other artwork, microfilm, microfiche, audio tapes, CDs, LPs, video tapes and DVDs, and provide public facilities to access CD-ROM databases and the Internet. ...
CCCII is a superset of ASCII designed to conform to ISO 2022. Each Chinese character is represented by a 3-byte code in which each byte is 7-bit. Thus, the maximum number of Chinese characters representable in CCCII is 94×94×94 = 830584. In practice the number of characters encodable by CCCII would be less than this number, because variant characters are encoded in related ISO 2022 planes under CCCII, so most of the code points would have to be reserved for variants. There are 95 printable ASCII characters, numbered 32 to 126. ... ISO 2022, more formally ISO/IEC 2022, is an ISO standard (equivalent to the ECMA standard ECMA-35) specifying a technique for including multiple character sets in a single character encoding. ...
References
Some information on this page is based on the information on the CNS official web site (http://www.cns11643.gov.tw).
External links
CCCII unofficial web site (http://www.cccii.org.tw) (English version of pages available)
CNS 11643 official web site (http://www.cns11643.gov.tw) (English version of pages available) has information about the CCCII character set in the "Chinese Information Code" section
Han unification is the process used by the authors of Unicode and the Universal CharacterSet to map multiple charactersets of the CJK languages into a single set of unified characters.
While the Han root character may be the same for CJK languages, the glyphs in common use for the same characters may not be, and new characters were invented in each country.
Unicode defines abstract characters, as opposed to glyphs, which are particular visual representations of a character in a font, or graphemes, basic units of writing in a particular language.
To use this character scheme with MIME, CN-GB is used as the value for the charset parameter: Content-Type: text/plain; charset=cn-gb; charset-edition=1980 Note: The "charset-edition" is a new MIME parameter described in section 4.1 of the "Specification" part of this document.
To use this character scheme with MIME, CN-Big5 is used as the value for the charset parameter: Content-Type: text/plain; charset=cn-big5; charset-edition=1984 Note: The "charset-edition" is a new MIME parameter described in section 4.1 of the "Specification" part of this document.
A characterset extension has displayed glyphs for code points that are not assigned in the characterset, for example, vendor-specific extensions of standard charactersets.