FACTOID # 76: The fourteen unhappiest countries are all in Eastern Europe.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
WHAT'S NEW
RECENT ARTICLES
More Recent Articles »
 

FACTS & STATISTICS    Simple view

  1. Select countries to view: (hold down Control key and click to select several)

     

     

    Compare:

     

     

  1. Select fact or statistic: (* = graphable)

     

     

     

  2. (OPTIONAL) Compare to statistic: (both need to be graphable)

     

     

     

  3. View result as:

     

       
(OR) SEARCH ALL encyclopedia, stats & forums:   

Encyclopedia > Unicode typefaces
Unicode
Encodings
UCS
Mapping
Bi-directional text
BOM
Han unification
Unicode and HTML
Unicode and e-mail
Unicode typefaces

Unicode typefaces (also known as UCS fonts and Unicode fonts) contains wide range of characters, letters, digits, glyphs, symbols, ideograms, logograms, etc, which are collectively mapped into Universal Character Set, also known as, UCS (which is an international standard ISO/IEC 10646), derived from many different languages, scripts from all around the world. Thus, only one font is able to display vast range of characters, from different languages. Due to technical limitations, some web browsers may not display some special characters in this article. ... This page compares Unicode encodings. ... UTF-7 (7-bit Unicode Transformation Format) is a variable-length character encoding that was proposed for representing Unicode-encoded text using a stream of ASCII characters, for example for use in Internet e-mail messages. ... UTF-8 (8-bit Unicode Transformation Format) is a variable-length character encoding for Unicode created by Ken Thompson and Rob Pike. ... CESU-8 is a variant of UTF-8 that is described in Unicode Technical Report 26. ... In computing, UTF-16 is a fixed-length (16 bits) character encoding for Unicode. ... UTF-32 and UCS-4 are alternate names for a method of encoding Unicode characters, using the fixed amount of exactly 32 bits for each Unicode code point. ... UTF-EBCDIC is an encoding of Unicode that is meant to be EBCDIC friendly so that some older EBCDIC applications can handle some Unicode data. ... The Standard Compression Scheme for Unicode (SCSU) is a Unicode Technical Standard to reduce the number of bytes needed to represent text, especially if that text uses mostly characters from a small number of Unicode blocks. ... This article or section may be confusing for some readers, and should be edited to be clearer. ... GB18030 is the registered internet name for the official character set of the Peoples Republic of China. ... The international standard ISO/IEC 10646 defines the Universal Character Set (UCS) as a character encoding. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Some writing systems of the world, such as Arabic and Hebrew, are written in a form known as right-to-left (RTL), in which writing begins at the right-hand side of a page and concludes at the left-hand side. ... A Byte Order Mark (BOM) is the character at code point U+FEFF (zero-width no-break space), when that character is used to denote the endianness of a string of UCS/Unicode characters encoded in UTF-16 or UTF-32 and/or as a marker to indicate that text... Han unification is the process used by the authors of Unicode and the Universal Character Set to map multiple character sets of the CJK languages into a single set of unified characters. ... The relationship between Unicode and HTML tends to be a difficult topic for many computer professionals, document authors, and web users alike. ... Many e-mail clients are now able to use Unicode. ... This article is about letter, a written message from one party to another. ... In mathematics and computer science, a numerical digit is a symbol, e. ... A glyph is a carved figure or character, incised or in relief; a carved pictograph; hence, a pictograph representing a form originally adopted for sculpture, whether carved or painted. ... A Chinese character. ... A Chinese logogram A logogram, or logograph, is a single written character which represents a word or a morpheme (a meaningful unit of language). ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... The international standard ISO/IEC 10646 defines the Universal Character Set (UCS) as a character encoding. ... The International Organization for Standardization (ISO) is an international standard-setting body composed of representatives from national standards bodies. ... The International Electrotechnical Commission (IEC) is an international standards organization dealing with electrical, electronic and related technologies. ... The Universal Character Set is a character encoding that is defined by the international standard ISO/IEC 10646. ... Writing Systems of the World today A Specimen of typeset fonts and languages, by William Caslon, letter founder; from the 1728 Cyclopaedia. ...


Current Unicode fonts do not include all UCS characters defined in the present revision of ISO 10646 standard. Their creators are updating them with more characters, which were not included previously, or, were not placed correctly, or, including their own characters, or, updating them with new characters defined in the newer standard revision of ISO 10646. Due to technical limitations, some web browsers may not display some special characters in this article. ...


The UCS has over 1.1 million code points, but only the first 65,536 (the Plane 0: Basic Multilingual Plane, or BMP) had entered into common use before 2000. See the Mapping of Unicode characters article for more information on other planes (Plane 1: SMP, Plane 2: SIP, Plane 14: SSP, Plane 15 and 16: reserved for PUA) and therein included scripts for different languages, dialects. Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ...


There are typgraphical ambiguities in Unicode, so that some of the unified Chinese characters will be typographically different in different regions. For example, Unicode point U+9AA8 (骨) is typographically different between simplified Chinese and traditional Chinese. This has implications for the idea that a single typeface can satisfy the needs of all locales[1].


However, Unicode is now the base character set for many new standards and protocols, and is built into the architecture of operating systems (Microsoft Windows, Apple Mac OS X, and many versions of Unix), programming languages (Perl, Python, Java, Common LISP, APL), and libraries (IBM International Components for Unicode (ICU) along with the Pango, Graphite, Scribe, Uniscribe, and ATSUI rendering engines), font formats (TrueType and OpenType) and so on. Many other standards are also getting upgraded to Unicode compliancy, day by day. Due to technical limitations, some web browsers may not display some special characters in this article. ... Microsoft Windows is a family of operating systems by Microsoft for use on personal computers, although versions of Windows designed for servers, embedded devices, and other platforms also exist. ... Apple Computer, Inc. ... Mac OS X is a proprietary operating system developed and sold by Apple Computer, Inc. ... Unix or UNIX is a computer operating system originally developed in the 1960s and 1970s by a group of AT&T Bell Labs employees including Ken Thompson, Dennis Ritchie, and Douglas McIlroy. ... Perl, also Practical Extraction and Report Language (a backronym, see below) is a dynamic procedural programming language designed by Larry Wall and first released in 1987. ... Python is an interpreted programming language created by Guido van Rossum in 1990. ... Java is an object-oriented programming language developed by James Gosling and colleagues at Sun Microsystems in the early 1990s. ... Common Lisp, commonly abbreviated CL, is a dialect of the Lisp programming language, standardised by ANSI X3. ... APL (for A Programming Language) is an array programming language based on a notation invented in 1957 by Kenneth E. Iverson while at Harvard University. ... International Components for Unicode (ICU) is an open source project of mature C/C++ and Java libraries for Unicode support, software internationalization and software globalization. ... In computing, Pango is an open source library for rendering internationalized texts integrated into GTK+ 2. ... Graphite is a programmable Unicode-compliant smart-font rendering system developed by SIL International. ... In computer programming, Qt is a cross-platform graphical widget toolkit for the development of GUI programs. ... Uniscribe is the Microsoft Windows set of services for rendering Unicode-encoded text. ... The Apple Type Services for Unicode Imaging (ATSUI) is the Mac OS set of services for rendering Unicode-encoded text. ... TrueType is an outline font standard originally developed by Apple Computer in the late 1980s as a competitor to Adobes Type 1 fonts used in PostScript. ... It has been suggested that OpenType features supported by Mac OS X be merged into this article or section. ...


Utility software such as the Character Map applet included with Windows 2000/XP, MainType (by HighLogic. Commercial, 40-day trial version is available), BabelMap (by Andrew West. Free, donation-ware.), Unicode Font Viewer (by Mike Lischke. Freeware), Quick Key (by Nathanael Jones. Opensource, free.), etc, can be used to see exactly which characters are included, inside a font file. Character Map (charmap. ... MainType is a font viewer and font manager software. ... BabelMap can display unicode characters inside different fonts, characters can be copied and pasted from its Edit Buffer into a document as well. ... Unicode Font Viewer can display unicode characters from any TrueType unicode font. ... Quick Key is primarily a flexible keyboard extension software, along with the features similar to Microsoft Windows Character Map and Alt codes character input method. ...


Unicode fonts may refer to:
Arial, Arial Unicode MS, Bitstream Cyberbit, Cardo (font), Caslon, Code2000, Charis SIL, Chrysanthi Unicode (Chryſanþi), ClearlyU, DejaVu fonts, Doulos SIL, Everson Mono, Gentium, GNU Unifont, Junicode, Lucida Grande, Lucida Sans Unicode, Microsoft Sans Serif, New Gulim, Tahoma, Times New Roman, TITUS Cyberbit Basic, Y.OzFontN. It has been suggested that this article or section be merged into Arial. ... In digital typography, Arial Unicode MS is an extended version of the OpenType font Arial. ... Bitstream Cyberbit is a commercial Unicode font designed by Bitstream. ... William Caslons 1734 Specimen Origins Caslon refers to a large family of typefaces originating from William Caslons original old style, first shown in 1734 in a now-highly-sought broadside specimen. ... Code2000 is a digital font which includes characters and symbols from a very large range of writing systems. ... The DejaVu fonts are modifications of the Bitstream Vera fonts. ... Doulos SIL is a serif typeface developed by SIL International. ... Gentium is a proprietary Unicode font designed by Victor Gaultney and copyright SIL International. ... Junicode (short for Junius-Unicode) is a free Unicode font for mediaevalists. ... Lucida Grande is a font available on Mac OS X. Like the similar Microsoft Windows font Lucida Sans Unicode, it supports the most commonly used characters defined in version 2. ... In digital typography, 's Lucida Sans Unicode OpenType font is designed to support the most commonly used characters defined in version 2. ... Tahoma is a sans-serif typeface designed by Matthew Carter for the Microsoft Corporation in 1999. ... The Times New Roman typeface, on top at 88. ... Titus Cyberbit Basic is a Unicode font designed by Bitstream and the TITUS (Thesaurus Indogermanischer Text- und Sprachmaterialien) for Unicode 4. ...

Contents


Unicode Font List

Out of many Unicode fonts, only few are listed below, which are mostly and commonly used by the (mainstream) majority of users around the world, in major platforms. Unicode font list with more fonts can be found in this (List of typefaces) article's "Unicode fonts" section. In computing, a platform describes some sort of framework, either in hardware or software, which allows software to run. ... This is a list of typefaces. ...

Unicode Fonts
Font Characters Glyphs Version Other Info
Arial 1,419 1674 3.00  
Arial Unicode MS 38,917 50,377 1.01  
Bitstream Cyberbit  ? 29,934 2.0b  
Cardo 2,195 2,882 0.98 (2004)  
Caslon  ? 3,551 16-12-2001  
Code2000 51,155 63,888 1.15  
Charis SIL 1,956 3,084 4.002  
Chryſanþi Unicode 4,818 4,383 3.1  
ClearlyU  ? 9,538 1.9  
DejaVu fonts 1,686 1,701 2.3  
Doulos SIL 1,956 3,083 4.014  
Everson Mono 4,893 4,899 4.1.3[2]  
Gentium 1,469 1,699 1.0.1 (2003)  
GNU Unifont  ? > 35,000 -  
Junicode 1,433 1,435 0.6.3  
Lucida Grande - 2,244 5.0d8e1  
Lucida Sans Unicode  ? 1,776 2.00  
Microsoft Sans Serif 2,301 2,257 1.41  
New Gulim  ? 49,284 3.10  
Tahoma 1,912 2,034 3.14  
Times New Roman 1,419 1,674 3.00  
TITUS Cyberbit Basic  ? 9,779 3.0 (2000)  
Y.OzFontN 16,777 59,678 9.13  

Arial, sometimes marketed as Arial MT, is a typeface and a computer font packaged with Microsoft Windows, other Microsoft software applications, and many PostScript computer printers. ... In digital typography, Arial Unicode MS is an extended version of the OpenType font Arial. ... Bitstream Cyberbit is a commercial Unicode font designed by Bitstream. ... William Caslons 1734 Specimen Origins Caslon refers to a large family of typefaces originating from William Caslons original old style, first shown in 1734 in a now-highly-sought broadside specimen. ... Code2000 is a digital font which includes characters and symbols from a very large range of writing systems. ... The DejaVu fonts are modifications of the Bitstream Vera fonts. ... Doulos SIL is a serif typeface developed by SIL International. ... Gentium is a proprietary Unicode font designed by Victor Gaultney and copyright SIL International. ... Junicode (short for Junius-Unicode) is a free Unicode font for mediaevalists. ... Lucida Grande is a font available on Mac OS X. Like the similar Microsoft Windows font Lucida Sans Unicode, it supports the most commonly used characters defined in version 2. ... In digital typography, 's Lucida Sans Unicode OpenType font is designed to support the most commonly used characters defined in version 2. ... Tahoma is a sans-serif typeface designed by Matthew Carter for the Microsoft Corporation in 1999. ... The Times New Roman typeface, on top at 88. ... Titus Cyberbit Basic is a Unicode font designed by Bitstream and the TITUS (Thesaurus Indogermanischer Text- und Sprachmaterialien) for Unicode 4. ...

0000-077F

 N  = Numerical digits. This number of characters are included in the font for that range.
Chaeck mark = Most or some portion out of all characters in that range are present in the font.
 X  = No characters are included in the font for that range or Unicode block.
  -  = Data not available now.

Unicode Fonts
Font ──>

Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Basic Latin (0000–007F) Chaeck mark 95 Chaeck mark 95 Chaeck mark 95 95 Chaeck mark Chaeck mark 96 95 95 95 Chaeck mark Chaeck mark 98 Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark 95
Latin-1 Supplement (0080–00FF) Chaeck mark 96 Chaeck mark 96 Chaeck mark 96 96 Chaeck mark Chaeck mark 96 96 96 96 Chaeck mark Chaeck mark 96 Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark 96
Latin Extended-A (0100–017F) Chaeck mark 128 Chaeck mark 128 Chaeck mark 128 128 Chaeck mark X 128 128 128 128 Chaeck mark Chaeck mark 128 Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark 128
Latin Extended-B (0180–024F) Chaeck mark 148 Chaeck mark 52 Chaeck mark 194 194 Chaeck mark Chaeck mark 126 194 183 178 Chaeck mark Chaeck mark 183 Chaeck mark Chaeck mark X Chaeck mark Chaeck mark Chaeck mark 28
IPA Extensions (0250–02AF) X 89 Chaeck mark 96 Chaeck mark 96 96 Chaeck mark Chaeck mark 96 96 96 94 Chaeck mark Chaeck mark 96 Chaeck mark Chaeck mark X X X Chaeck mark 55
Spacing Modifier Letters (02B0–02FF) X 57 Chaeck mark 80 Chaeck mark 73 80 Chaeck mark Chaeck mark 41 80 80 56 Chaeck mark Chaeck mark 80 Chaeck mark X Chaeck mark X X Chaeck mark 16
Combining Diacritical Marks (0300–036F) X 72 Chaeck mark 112 Chaeck mark 97 104 Chaeck mark Chaeck mark 66 104 107 82 Chaeck mark Chaeck mark 106 Chaeck mark Chaeck mark X Chaeck mark X Chaeck mark 32
Greek (0370–03FF) Chaeck mark 105 Chaeck mark 124 Chaeck mark 120 14 Chaeck mark Chaeck mark 110 14 118 82 Chaeck mark X Chaeck mark 106 Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark 76
Cyrillic (0400–04FF) Chaeck mark 226 Chaeck mark 2 Chaeck mark 246 209 Chaeck mark Chaeck mark 193 209 246 80 Chaeck mark X 244 Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark 66
Cyrillic Supplement (0500–052F) X 0 X X X 16 16 X X X 16 16 1 - 16 X Chaeck mark X X X X Chaeck mark X
Armenian (0530–058F) X 85 X X Chaeck mark 86 X Chaeck mark Chaeck mark 86 X 86 X Chaeck mark X X X X X X X Chaeck mark X
Hebrew (0590–05FF) Chaeck mark 82 Chaeck mark 86 Chaeck mark 82 X Chaeck mark Chaeck mark X X 82 X - X 82 Chaeck mark Chaeck mark X Chaeck mark Chaeck mark Chaeck mark X
Arabic (0600–06FF) Chaeck mark 194 Chaeck mark 10 X 185 X Chaeck mark X X X 3 X Chaeck mark X X X Chaeck mark X Chaeck mark Chaeck mark Chaeck mark X
Syriac (0700–074F) X 0 X X X 50 X X X X X X X - X X X X X X X Chaeck mark X
Arabic Supplement (0750–077F) X 0 X X X X X X X X X X X - X X X X X X X X X

The Latin alphabet, also called the Roman alphabet, is the most widely used alphabetic writing system in the world today. ... ISO 8859-1, more formally cited as ISO/IEC 8859-1 or less formally as Latin-1, is part 1 of ISO/IEC 8859, a standard character encoding defined by ISO. It encodes what it refers to as Latin alphabet no. ... The International Phonetic Alphabet (IPA) is a system of phonetic notation devised by linguists to accurately and uniquely represent each of the wide variety of sounds (phones or phonemes) used in spoken human language. ... A diacritic mark or accent mark is an additional mark added to a basic letter. ... The Cyrillic alphabet (or azbuka, from the old name of the first two letters) is an alphabet used to write six natural Slavic languages (Belarusian, Bulgarian, Macedonian, Russian, Serbian, and Ukrainian) and many other languages of the former Soviet Union, Asia and Eastern Europe. ... This article is mainly about Hebrew letters. ... The Arabic alphabet is the script used for writing in the Arabic language. ... 11th century book in Syriac Serto. ...

0780-139F

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Thaana (0780–07BF) X 0 X X X 50 X X Chaeck mark X X X X - X X X X X X X Chaeck mark X
Devanagari (0900–097F) X 104 X X X 106 X Chaeck mark Chaeck mark X X X X Chaeck mark X X X X X X X Chaeck mark X
Bengali (0980–09FF) X 89 X X X 91 X Chaeck mark X X X X X - X X X X X X X X X
Gurmukhi (0A00–0A7F) X 75 X X X 77 X X X X X X X - X X X X X X X X X
Gujarati (0A80–0AFF) X 78 X X X 78 X Chaeck mark X X X X X - X X X X X X X X X
Oriya (0B00–0B7F) X 79 X X X 79 X X X X X X X - X X X X X X X X X
Tamil (0B80–0BFF) X 61 X X X 71 X X X X X X X - X X X X X X X X X
Telugu (0C00–0C7F) X 80 X X X 80 X X X X X X X - X X X X X X X X X
Kannada (0C80–0CFF) X 80 X X X 80 X X X X X X X - X X X X X X X X X
Malayalam (0D00–0D7F) X 78 X X X 78 X X X X X X X - X X X X X X X X X
Sinhala (0D80–0DFF) X 0 X X X X X X X X X X X - X X X X X X X X X
Thai (0E00–0E7F) X 87 Chaeck mark X Chaeck mark 87 X X Chaeck mark X X X X Chaeck mark X 87 X Chaeck mark X Chaeck mark X Chaeck mark X
Lao (0E80–0EFF) X 65 X X X 65 X X Chaeck mark X X X X Chaeck mark X X X X X X X X X
Tibetan (0F00–0FFF) X 168 X X X X X Chaeck mark X X X X X Chaeck mark X X X X X X X X X
Myanmar (Burma) (1000–109F) X 0 X X X 78 X X X X X X X - X X X X X X X X X
Georgian (10A0–10FF) X 78 X 1 X 81 X X Chaeck mark X X 80 X Chaeck mark X X X X X X X Chaeck mark X
Hangul Jamo (1100–11FF) X 240 X X X 240 X X X X X X X - X X X X Chaeck mark X X X X
Ethiopic(Ge'ez) (1200–137F) X 0 X X X 356 X X Chaeck mark X X X X Chaeck mark X X X X X X X Chaeck mark X
Ethiopic Supplement (1380–139F) X 0 X X X 26 X X X X X X X - X X X X X X X X X

Thaana is the writing system for the Dhivehi language spoken in the Maldives. ... Rigveda manuscript in Devanagari (early 19th century) Devanāgarī (देवनागरी — in English pronounced ) (ISCII – IS13194:1991) [1] is an abugida alphabet used to write several Indian languages, including Sanskrit, Hindi, Marathi, Kashmiri, Sindhi, Bihari, Bhili, Konkani, Bhojpuri and Nepali from Nepal. ... The Bengali script is an Abugida system of writing belonging to the Brahmic family of scripts whose use is associated with the Bangla, Assamese, Manipuri and Sylheti languages. ... The Gurmukhī (ਗੁਰਮੁਖੀ) script, derived from the Later Sharada script and standardised by Guru Angad Dev in the 16th century, was designed to write the Punjabi language. ... Excerpt from My experiments with truth - the autobiography of Mahatma Gandhi in its original Gujarati script. ... The Oriya script is used to write the Oriya language. ... The Tamil script is an abugida which has 12 vowels and 18 consonants. ... Telugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language. ... The Kannada script is an abugida of the Brahmic family, primarily to write the Kannada language, one of the Dravidian languages in India. ... The Malayalam script is an abugida of the Brahmic family, used to write the Malayalam language. ... The Sinhala script is used to write the Sinhala language. ... Om Mani Padme Hum, the primary mantra of Tibetan Buddhism written in the Tibetan script, on a rock outside the Potala Palace in Lhasa, Tibet. ... Hangul also refers to a word processing application widely used in Korea. ... The Geez language (or Giiz language) is an ancient language that developed in the Ethiopian Highlands of the Horn of Africa as the language of the peasantry. ...

13A0-1DBF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Cherokee (13A0–13FF) X 0 X X X 85 X X Chaeck mark X X 85 X - X X X X X X X X X
Unified Canadian Aboriginal Syllabics (1400–167F) X 0 X X X 630 X X Chaeck mark X X 630 X Chaeck mark X X X X X X X X X
Ogham (1680–169F) X 0 X X Chaeck mark 29 X X Chaeck mark X X 29 X Chaeck mark X X X X X X X Chaeck mark X
Runic (16A0–16FF) X 0 X 81 Chaeck mark 81 X Chaeck mark Chaeck mark X X 81 X Chaeck mark Chaeck mark X X X X X X Chaeck mark X
Tagalog (Baybayin) (1700–171F) X 0 X X X X X X X X X X X - X X X X X X X X X
Hanunoo (1720–173F) X 0 X X X 2 X X X X X X X - X X X X X X X X X
Buhid (1740–175F) X 0 X X X 20 X X X X X X X - X X X X X X X X X
Tagbanwa (1760–177F) X 0 X X X X X X X X X X X - X X X X X X X X X
Khmer (1780–17FF) X 0 X X X 103 X X X X X X X - X X X X X X X X X
Mongolian (1800–18AF) X 0 X X X 155 X X X X X X X - X X X X X X X X X
Limbu (1900–194F) X 0 X X X 66 X X X X X X X - X X X X X X X X X
Tai Le (1950–197F) X 0 X X X X X X X X X X X - X X X X X X X X X
Tai Lue (1980–19DF) X 0 X X X X X X X X X X X - X X X X X X X X X
Khmer Symbols (19E0–19FF) X 0 X X X X X X X X X X X - X X X X X X X X X
Buginese (1A00–1A1F) X 0 X X X 30 X X X X X X X - X X X X X X X X X
Phonetic Extensions (1D00–1D7F) X 0 X 17 X 109 128 X X 36 128 107 X - Chaeck mark 108 X X X X X Chaeck mark X
Phonetic Extensions Supplement (1D80–1DBF) X 0 X X X X 64 X X 37 64 X X - X X X X X X X X X

Cherokee (Cherokee: Tsalagi) is an Iroquoian language spoken by the Cherokee people. ... Canadian aboriginal syllabic writing (often syllabics for short) is a family of writing schemes which are used to write a number of aboriginal Canadian languages from the Algonquian, Athabaskan and Inuit language families. ... Ogham (Old Irish Ogam) was an alphabet used primarily to represent Gaelic languages. ... Technical note: Due to technical limitations, some web browsers may not display some special characters in this article. ... Baybayin (sometimes called Alibata) is a pre-Hispanic Tagalog writing system that originated from the Javanese script Kavi. ... One of the indigenous scripts of the Philippines; see Baybayin. ... Buhid (ᝊᝓᝑᝒ), or Mangyan, is an indigenous Brahmic script of the Philippines, and is used today by the Mindoro people to write Tagalog, the national language of the Philippines. ... Tagbawna is one of the indigenous writing systems of the Philippines. ... The quick brown fox jumps over the lazy dog translated into Khmer. ... The Limbu (meaning: archer) are an ethnic group that belong to the Kiranti group or Kirat confederation that includes the Rai and Sunuwar. ... Tai Le is the name of a language and the script used to write that language in parts of the Yunnan Province of China, Myanmar (Burma), Laos, and possibly other countries. ... Tai Lue (or Tai Lü, Tai Le; tai51 lɯ11; Xishuangbanna Dai; Chinese: 傣仂语 Dǎilèyǔ) is one of the languages spoken by the Dai people in China. ... This article or section uses Khmer characters which may be rendered as boxes or other nonsensical symbols. ... Buginese (locally Basa Ugi, elsewhere also Bahasa Bugis, Bugis, Bugi, De) is the language spoken by about four million people mainly in the southern part of Sulawesi, Indonesia. ...

1DC0-257F

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Combining Diacritical Marks Supplement (1DC0–1DFF) X 0 X 2 X X 1 X X X 1 X X - X X X X X X X X X
Latin Extended Additional (1E00–1EFF) Chaeck mark 246 Chaeck mark 88 Chaeck mark 246 246 Chaeck mark X 51 246 246 246 Chaeck mark X 246 X Chaeck mark X Chaeck mark Chaeck mark Chaeck mark 8
Greek Extended (1F00–1FFF) X 233 X 233 Chaeck mark 233 X Chaeck mark X 233 X 233 233 Chaeck mark X 233 X Chaeck mark X Chaeck mark X Chaeck mark 4
General Punctuation (2000–206F) Chaeck mark 63 Chaeck mark 65 Chaeck mark 105 73 Chaeck mark X 32 73 97 39 Chaeck mark Chaeck mark 87 Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark 62
Superscripts and Subscripts (2070–209F) X 28 Chaeck mark 9 Chaeck mark 29 34 Chaeck mark X 8 34 29 28 Chaeck mark Chaeck mark 29 Chaeck mark X X X X Chaeck mark 29
Currency Symbols (20A0–20CF) Chaeck mark 13 Chaeck mark 6 Chaeck mark 22 22 Chaeck mark X 3 22 18 14 Chaeck mark X 18 Chaeck mark Chaeck mark X Chaeck mark Chaeck mark Chaeck mark 18
Combining Diacritical Marks for Symbols (20D0–20FF) X 18 Chaeck mark X Chaeck mark 27 X X X X X 27 X Chaeck mark X 2 X X X X X X 27
Letterlike Symbols (2100–214F) X 57 Chaeck mark 13 Chaeck mark 75 2 Chaeck mark Chaeck mark 8 2 74 1 Chaeck mark X 32 Chaeck mark X Chaeck mark X X Chaeck mark 75
Number Forms (2150–218F) X 48 Chaeck mark 4 Chaeck mark 49 49 Chaeck mark Chaeck mark 34 49 49 X Chaeck mark Chaeck mark 49 X X Chaeck mark X Chaeck mark 49
Arrows (2190–21FF) X 91 Chaeck mark 14 Chaeck mark 112 19 Chaeck mark Chaeck mark X 19 112 X Chaeck mark X 20 Chaeck mark X Chaeck mark X X Chaeck mark 112
Mathematical Operators (2200–22FF) Chaeck mark 242 Chaeck mark 24 Chaeck mark 246 17 Chaeck mark X 34 17 256 2 Chaeck mark Chaeck mark 18 Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark Chaeck mark 256
Miscellaneous Technical (2300–23FF) X 123 Chaeck mark 36 Chaeck mark 209 2 X X 13 2 207 X Chaeck mark X 14 X X X X X X 209
Control Pictures (2400–243F) X 37 Chaeck mark X Chaeck mark 39 X X Chaeck mark 2 X 39 X Chaeck mark X 1 Chaeck mark X X X X X 4
Optical Character Recognition (2440–245F) X 11 Chaeck mark X X 11 X X X X X 11 X Chaeck mark X X X X X X X X 11
Enclosed Alphanumerics (2460–24FF) X 139 Chaeck mark X Chaeck mark 160 X X X X X 159 X Chaeck mark Chaeck mark 1 X X Chaeck mark X X Chaeck mark 160
Box Drawing (2500–257F) Chaeck mark 128 Chaeck mark 1 Chaeck mark 128 X Chaeck mark X X X 128 X Chaeck mark X X Chaeck mark X Chaeck mark X Chaeck mark Chaeck mark 128

The term punctuation has two different linguistic meanings: in general, the act and the effect of punctuating, i. ... This article needs to be cleaned up to conform to a higher standard of quality. ... Graphic symbols are often used as a shorthand for currency names. ... An arrow is a graphical symbol like →, ←, used to point or indicate direction, being in its simplest form a line segment with a triangle affixed to one end, and in more complex forms a representation of an actual arrow. ... This article is about operators in mathematics, for other kinds of operators see operator (disambiguation). ... Optical character recognition, usually abbreviated to OCR, involves computer software designed to translate images of typewritten text (usually captured by a scanner) into machine-editable text, or to translate pictures of characters into a standard encoding scheme representing them in (ASCII or Unicode). ... Box drawing characters are widely used in text user interfaces to draw various frames and boxes. ...

2580-2DDF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Block Elements (2580–259F) X 22 Chaeck mark X Chaeck mark 32 X Chaeck mark X 32 X 32 X Chaeck mark X X Chaeck mark X X X X Chaeck mark 32
Geometric Shapes (25A0–25FF) Chaeck mark 80 Chaeck mark 8 Chaeck mark 96 2 Chaeck mark Chaeck mark 96 2 96 X Chaeck mark X 19 Chaeck mark X Chaeck mark X Chaeck mark Chaeck mark 96
Miscellaneous Symbols (2600–26FF) Chaeck mark 106 Chaeck mark 31 Chaeck mark 146 X Chaeck mark X 7 X 125 X Chaeck mark X X X X Chaeck mark X Chaeck mark Chaeck mark 146
Dingbats (2700–27BF) X 160 X 6 X 174 2 Chaeck mark X X 2 160 X Chaeck mark Chaeck mark 3 X X X X X X 174
Miscellaneous Mathematical Symbols-A (27C0–27EF) X 0 X 9 X 35 2 X X 5 2 X X - X X X X X X X X 28
Supplemental Arrows-A (27F0–27FF) X 0 X 2 X 16 X X X X X X X - X X X X X X X X 16
Braille Patterns (2800–28FF) X 0 X X X 256 X X Chaeck mark X X X X Chaeck mark X X X X X X X X 256
Supplemental Arrows-B (2900–297F) X 0 X 6 X 128 X X X X X 111 X - X X X X X X X X 128
Miscellaneous Mathematical Symbols-B (2980–29FF) X 0 X 2 X 128 X X X 1 X 62 X - X 1 X X X X X X 128
Supplemental Mathematical Operators (2A00–2AFF) X 0 X X X 256 X X X 3 X 21 X - X X X X X X X X 256
Miscellaneous Symbols and Arrows (2B00–2BFF) X 0 X X X 20 X X X 2 X X X - X X X X X X X X 14
Glagolitic (2C00–2C5F) X 0 X X X X X X X X X X X - X X X X X X X X X
Coptic (2C80–2CFF) X 0 X X X X X X X X X X X - X X X X X X X X X
Georgian Supplement (2D00–2D2F) X 0 X X X X X X X X X X X - X X X X X X X X X
Tifinagh (2D30–2D7F) X 0 X X X 55 X X X X X X X - X X X X X X X X X
Ethiopic Extended (2D80–2DDF) X 0 X X X 79 X X X X X X X - X X X X X X X X X

The Miscellaneous Symbol plane of Unicode (2600–26FF) contains various glyphs representing things from a variety of categories: Astrological, Astronomical, Chess, Dice, Ideological symbols, Musical notation, Political symbols, Recycling, Religious symbols, Trigrams, Warning Signs and Weather. ... A dingbat is an ornament or spacer used in typesetting, sometimes more formally known as a printers ornament. The term supposedly originated as onomatopoeia in old style metal-type print shops, where extra space around text or illustrations would be filled by dinging an ornament into the space... PREMIER - first The information about the historic site of Safdarjung’s tomb in Delhi, India. ... Tablet inscribed with the Glagolitic alphabet The Glagolitic alphabet or Glagolitsa is the oldest known Slavonic alphabet. ... The Coptic alphabet is an alphabet used for writing the Coptic language. ... This article needs to be cleaned up to conform to a higher standard of quality. ...

2E00-4DBF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Supplemental Punctuation (2E00–2E7F) X 0 X 24 X 26 X X X X X X X - X X X X X X X X X
CJK Radicals Supplement (2E80–2EFF) X 0 X X X 115 X X X X X X X - X X X X X X X X 128
Kangxi Radicals(Kangxi) (2F00–2FDF) X 0 X X X 214 X X X X X X X - X X X X X X X X 214
Ideographic Description Characters (2FF0–2FFF) X 0 X X X 12 X X X X X X X Chaeck mark X X X X X X X X 12
CJK Symbols and Punctuation (3000–303F) X 57 Chaeck mark 12 X 64 X Chaeck mark X X X X X - X X X X Chaeck mark X X Chaeck mark 45
Hiragana (3040–309F) X 90 Chaeck mark X Chaeck mark 90 X Chaeck mark X X X 90 X Chaeck mark X X X X Chaeck mark X X Chaeck mark 93
Katakana (30A0–30FF) X 94 Chaeck mark X Chaeck mark 94 X Chaeck mark X X X 94 X Chaeck mark X X X X Chaeck mark X X Chaeck mark 96
Bopomofo (3100–312F) X 40 Chaeck mark X X 40 X Chaeck mark X X X X X Chaeck mark X X X X X X X Chaeck mark X
Hangul Compatibility Jamo (3130–318F) X 94 Chaeck mark X X 94 X X X X X X X Chaeck mark X X X X Chaeck mark X X X 1
Kanbun (3190–319F) X 16 Chaeck mark X X 16 X Chaeck mark X X X X X - X X X X X X X X 16
Bopomofo Extended (31A0–31BF) X 0 X X X 24 X X X X X X X - X X X X X X X X X
CJK Strokes (31C0–31EF) X 0 X X X 16 X X X X X X X - X X X X X X X X X
Katakana Phonetic Extensions (31F0–31FF) X 0 X X X X X X X X X X X - X X X X X X X X 16
Enclosed CJK Letters and Months (3200–32FF) X 202 Chaeck mark X X 232 X X X X X X X - X X X X Chaeck mark X X X 174
CJK Compatibility (3300–33FF) X 249 Chaeck mark X X 161 X Chaeck mark X X X X X - X X X X Chaeck mark X X X 85
CJK Unified Ideographs Extension A (3400–4DBF) X 20,902 X X X 6,582 X X X X X X X Chaeck mark X X X X Chaeck mark X X X 176

The left part of mā, a Chinese character meaning mother, is a radical that means woman A radical (from Latin radix, meaning root) is a basic identifiable component of every Chinese character. ... The left part of mā, a Chinese character meaning mother, is a radical that means woman A radical (from Latin radix, meaning root) is a basic identifiable component of every Chinese character. ... This article needs cleanup, so as to conform to a higher standard. ... Japanese writing Kanji 漢字 Kana 仮名 Hiragana 平仮名 Katakana 片仮名 Uses Furigana 振り仮名 Okurigana 送り仮名 Rōmaji ローマ字 Hiragana ) are a Japanese syllabary, one of the four Japanese writing systems, along with katakana, kanji and rōmaji (i. ... Japanese writing Kanji 漢字 Kana 仮名 Hiragana 平仮名 Katakana 片仮名 Uses Furigana 振り仮名 Okurigana 送り仮名 Rōmaji ローマ字 Katakana (片仮名) are a Japanese syllabary, one of the four Japanese writing systems. ... Zh yīn F o (注音符號), or Symbols for Annotating Sounds, often abbreviated as Zhuyin, or known as Bopomofo (ㄅㄆㄇㄈ) for the first four syllables of these Chinese phonetic symbols, is the national phonetic system of the Republic of China (based on Taiwan) for teaching the Chinese languages, especially Standard Mandarin... Example of Kaeriten Kanbun (漢文, literally Han writing) is Chinese written for a Japanese audience. ... Zh yīn F o (注音符號), or Symbols for Annotating Sounds, often abbreviated as Zhuyin, or known as Bopomofo (ㄅㄆㄇㄈ) for the first four syllables of these Chinese phonetic symbols, is the national phonetic system of the Republic of China (based on Taiwan) for teaching the Chinese languages, especially Standard Mandarin... Stroke order refers to the way of writing Chinese characters. ...

4DC0-FE2F

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Yijing Hexagram Symbols (4DC0–4DFF) X 64 X X X 64 X X X X X X X - X X X X X X X X X
CJK Unified Ideographs (Han Unification) (4E00–9FFF) X 20,924 Chaeck mark X X 20,924 X Chaeck mark X X X X X Chaeck mark X X X X Chaeck mark X X X 9,829
Yi Syllables (A000–A48F) X 0 X X X 1,165 X X X X X X X - X X X X X X X X X
Yi Radicals (A490–A4CF) X 0 X X X 55 X X X X X X X - X X X X X X X X X
Modifier Tone Letters (A700–A71F) X 0 X X X X 23 X X X 23 X X - X X X X X X X X X
Syloti Nagri (A800–A82F) X 0 X X X X X X X X X X X - X X X X X X X X X
Hangul Syllables (AC00–D7AF) X 11,172 Chaeck mark X X 11,172 X X X X X X X Chaeck mark X X X X Chaeck mark X X X X
High Surrogates (D800–DB7F) X 0 X X X X X X X X X X X - X X X X X X X X X
High Private Use Surrogates (DB80–DBFF) X 0 X X X X X X X X X X X - X X X X X X X X X
Low Surrogates (DC00–DFFF) X 0 X X X X X X X X X X X - X X X X X X X X X
Private Use Area (E000–F8FF) X 0 Chaeck mark X X X X X X X X X X - X X X X X X X Chaeck mark X
CJK Compatibility Ideographs (F900–FAFF) X 302 Chaeck mark X X 421 X X X X X X X Chaeck mark X X X X Chaeck mark X X X 98
Alphabetic Presentation Forms (FB00–FB4F) Chaeck mark 57 Chaeck mark 53 Chaeck mark 58 5 Chaeck mark X 10 5 58 5 Chaeck mark X 32 X Chaeck mark X Chaeck mark Chaeck mark Chaeck mark 2
Arabic Presentation Forms-A (FB50–FDFF) Chaeck mark 593 Chaeck mark X X 155 X Chaeck mark X X X X X Chaeck mark X X X Chaeck mark X Chaeck mark Chaeck mark Chaeck mark 25
Variation Selectors (FE00–FE0F) X 0 X X X 16 16 X X X 16 1 X - X X X X X X X X X
Vertical Forms (FE10–FE1F) X 0 X X X 0 0 X X X 0 0 X - X X X X X X X X X
Combining Half Marks (FE20–FE2F) X 4 X X X 4 X X X X X 4 X Chaeck mark X X X X X X X X X

Alternative meaning: I Ching (monk) The I Ching (Simplified Chinese: 易经; Traditional Chinese: 易經, Hanyu Pinyin: Yì Jīng; Cantonese IPA: jɪk6gɪŋ1; Cantonese Jyutping: jik6ging1; alternative romanizations include I Jing, Yi Ching, Yi King) is the oldest of the Chinese classic texts. ... Han unification is the process used by the authors of Unicode and the Universal Character Set to map multiple character sets of the CJK languages into a single set of unified characters. ... The Yi people (own name in the Cool Mountain dialect: ꆈꌠ, official transcription: Nuosu, IPA: [nɔ̄sÅ«]; Chinese: 彝族, Pinyin: Yìzú; the older name Lolo is now considered derogatory in China, though used officially in Vietnam as Lô Lô) are a modern ethnic group in China and Vietnam. ... Hangul also refers to a word processing application widely used in Korea. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... A Chinese character. ...

FE30-FFFF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
CJK Compatibility Forms (FE30–FE4F) X 28 X X X 28 X X X X X X X Chaeck mark X X X X X X X Chaeck mark 32
Small Form Variants (FE50–FE6F) X 26 Chaeck mark X X 26 X X X X X X X Chaeck mark X X X X X X X Chaeck mark X
Arabic Presentation Forms-B (FE70–FEFF) Chaeck mark 139 Chaeck mark X X 140 1 Chaeck mark X X 1 X X Chaeck mark X 1 X X X Chaeck mark Chaeck mark Chaeck mark X
Halfwidth and Fullwidth Forms (FF00–FFEF) X 223 Chaeck mark X Chaeck mark 124 X X X X X X X Chaeck mark X X X X Chaeck mark X X Chaeck mark 172
Specials (FFF0–FFFF) X 2 X 1 X 3 5 X X 1 5 5 X - X 1 X X X X X X X
           
└─ BMP

Specials album cover The Specials were a British band formed in 1977 in Coventry (see 1977 in music). ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ...

10000-1D7FF

┌─ SMP 
           
              
Range ─┘
Font  ───>
Range ─┐
              
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Linear B Syllabary (10000–1007F) X X X 0 X X X X X X X X X - X X X X X X X X X
Linear B Ideograms (10080–100FF) X X X 2 X X X X X X X X X - X X X X X X X X X
Aegean Numbers (10100–1013F) X X X 2 X X X X X X X X X - X X X X X X X X X
Ancient Greek Numbers (10140–1018F) X X X 75 X X X X X X X X X - X X X X X X X X X
Old Italic (10300–1032F) X X X 35 X X X X X X X X X - X X X X X X X X X
Gothic (10330–1034F) X X X 27 X X X X X X X X X - X X X X X X X X X
Ugaritic (10380–1039F) X X X 0 X X X X X X X X X - X X X X X X X X X
Old Persian (103A0–103DF) X X X 0 X X X X X X X X X - X X X X X X X X X
Deseret (10400–1044F) X X X 0 X X X X X X X X X - X X X X X X X X X
Shavian (10450–1047F) X X X 0 X X X X X X X X X - X X X X X X X X X
Osmanya (10480–104AF) X X X 0 X X X X X X X X X - X X X X X X X X X
Cypriot Syllabary (10800–1083F) X X X 0 X X X X X X X X X - X X X X X X X X X
Kharoshthi (10A00–10A5F) X X X 0 X X X X X X X X X - X X X X X X X X X
Byzantine Musical Symbols (1D000–1D0FF) X X X 0 X X X X X X X X X - X X X X X X X X X
Musical Symbols (1D100–1D1FF) X X X 0 X X X X X X X X X - X X X X X X X X X
Byzantine Musical Symbols (1D000–1D0FF) X X X 0 X X X X X X X X X - X X X X X X X X X
Ancient Greek Musical Notation (1D200–1D24F) X X X 70 X X X X X X X X X - X X X X X X X X X
Tai Xuan Jing Symbols (1D300–1D35F) X X X 0 X X X X X X X X X - X X X X X X X X X
Mathematical Alphanumeric Symbols (1D400–1D7FF) X X X 13 X X 2 X X X 2 X X - X X X X X X X X X
CJK Unified Ideographs Extension B X X X X X X X X X X X X X - X X X X X X X X 339
              
Range ─┘

Font  ──>
Arial Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL Chrysanthi Unicode ClearlyU DejaVu Fonts Doulos SIL Everson Mono Gentium GNU Unifont Junicode Lucida Grande Lucida Sans Unicode Microsoft Sans Serif New Gulim Tahoma Times New Roman TITUS Cyberbit Basic Y.OzFontN
Unicode Fonts

Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Linear B script sample Linear B is a script that was used for writing Mycenaean, an early form of Greek. ... Besides its original meaning, of or relating to the Goths (Gothos, Getas), a Germanic tribe and thus the Gothic language and the Gothic alphabet, the word Gothic has been used to refer to distinctly different things: From a Renaissance perspective (originally Italian, gotico, with connotations of rough, barbarous), it conveyed... The Ugaritic language is known to us only in the form of writings found in the lost city of Ugarit since its discovery by French archaeologists in 1928. ... See Aryan Language or Old Persian For more information visit: *[Ancient Iranian Languages & Literature The Circle of Ancient Iranian Studies (CAIS) ... To meet Wikipedias quality standards, this article or section may require cleanup. ... Posthumously funded by and named for Irish playwright George Bernard Shaw, the Shavian alphabet (also known as Shaw alphabet) was conceived as a way to provide simple, phonetic orthography for the English language to replace the difficulties of the conventional spelling. ... An invented script for the Somali language, now replaced both officially and in practise by the Latin alphabet. ... The Kharoṣṭhī script, also known as the Gāndhārī script, is an ancient alphabetic script used by the Gandhara culture of historic northwest India to write the Gandhari and Sanskrit languages (the Gandhara kingdom was located along the present-day border between Afghanistan and Pakistan between the Indus River and the... Mathematical alphanumeric symbols are modifications of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles (one example is blackboard bold, or double-struck (in Unicode terminology)). Unicode now includes many such symbols (in the range U+1D400 . ...

See also

The Unicode BMP Fallback Font is a Unicode font containing a glyph for every character in the Basic multilingual plane. ... This page compares Unicode encodings. ... This is a list of typefaces. ... Typographic units are the units of measurement used in typography or typesetting. ... A type foundry is a company that produces and/or distributes typefaces. ... An excerpt of HTML code with syntax highlighting In computing, HyperText Markup Language (HTML) is a markup language designed for the creation of web pages with hypertext and other information to be displayed in a web browser. ... Calligraphy in a Latin Bible of AD 1407 on display in Malmesbury Abbey, Wiltshire, England. ... Font-management programs are a class of computer utility software that allows the user to activate or deactive fonts on a computer. ... In computing, Unicode is the international standard whose goal is to provide the means to encode the text of every document people want to store in computers. ... The term Alt codes is used to refer to a number of Unicode input methods that allow characters to be entered by typing a characters code point in concert with the Alt key. ... An IME for inputting Japanese characters in Mac OS 9 An input method editor (IME) is a program or operating system component that allows computer users to enter complex characters and symbols (such as Chinese, Japanese, Korean and Tibetan characters), using a standard Western keyboard. ... Since the Chinese language uses a logographic script—that is a script where one or two character corresponds roughly to one word or meaning—there are vastly more characters, or glyphs, than there are keys on a standard computer keyboard. ... Japanese input methods are the methods used to input Japanese characters on a computer. ... This is intended as a non-exhaustive list of input methods for UNIX platforms. ... Computers and other typing devices offer many different keyboard layouts, for people to be able to input data in different languages. ... Code page is the traditional IBM term used for a specific character encoding table: a mapping in which a sequence of bits, usually a single octet representing integer values 0 through 255, is associated with a specific character. ...

References

  1. ^ Ken Lunde, CJKV Information Processing, O'Reilly Inc, 1999. Page 128, "CJKV character form differences"
  2. ^ Version info for Everson Mono font in Macromedia Fontographer v4.1.3 2003-02-13.

External links

  • ISO/IEC JTC1/SC2/WG2, the working group in charge of ISO 10646
  • Fonts and Keyboards at Unicode.org
  • Unicode Font Guide For Free/Libre Open Source Operating Systems - a huge index of high quality free fonts
  • Alan Wood's Unicode Resources


 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments
Please enter the 5-letter protection code

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms.