FACTOID # 141: Norwegians drink 10.7 kilograms of coffee per person each year. They also lead the globe in anxiety disorders. Maybe it’s time to switch to herbal tea.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
WHAT'S NEW
RELATED ARTICLES
People who viewed "Latin 1" also viewed:
RECENT ARTICLES
More Recent Articles »
 

FACTS & STATISTICS    Simple view

  1. Select countries to view: (hold down Control key and click to select several)

     

     

    Compare:

     

     

  1. Select fact or statistic: (* = graphable)

     

     

     

  2. (OPTIONAL) Compare to statistic: (both need to be graphable)

     

     

     

  3. View result as:

     

       
(OR) SEARCH ALL encyclopedia, stats & forums:   

Encyclopedia > Latin 1

ISO 8859-1, more formally cited as ISO/IEC 8859-1 or less formally as Latin-1, is part 1 of ISO/IEC 8859, a standard character encoding defined by ISO. It encodes what it refers to as Latin alphabet no. 1, consisting of 191 characters from the Latin script, each encoded as a single 8-bit code value. These code values can be used in almost any data interchange system to communicate in the following European languages (with the exception of correct quotation marks and apostrophe for many of them): Albanian, Basque, Catalan, Danish, Dutch (missing "IJ", "ij"), English, Faroese, French (missing only œ), Finnish (missing "š", "ž"), German, Icelandic (missing „ and “), Irish, Italian, Norwegian, Portuguese, Rhaeto-Romanic, Scottish, Spanish, Swedish. Other languages covered include Afrikaans and Swahili. Thus, this character encoding is used throughout The Americas, Western Europe, Oceania, and much of Africa.

Contents

ISO/IEC 8859-1

ISO/IEC 8859-1 suffers from a number of deficiencies, including the omission of a few French letters, a single glyph representation for the Dutch Y, two Finnish letters used for transcription of some foreign names and in a few loanwords, and the lack of common glyphs such as the dagger †, typographic quotation marks and dashes, and other characters. Additionally the euro symbol is not encoded. For this reason, ISO/IEC 8859-15 has been developed as an update of ISO/IEC 8859-1 to add the euro sign and other required additional characters. (This required however the removal of some less used characters from ISO/IEC 8859-1, including fraction symbols and letter-free diacritics: ¤, ¦, ¨, ´, ¸, ¼, ½, and ¾.) Since all 191 characters encoded by ISO/IEC 8859-1 are graphic and compatible with most web browsers, they can be shown as glyphs in the following table. Since they would not normally be visible, the space character, the no-break space character, and the soft hyphen character are represented by abbreviations for their names. All other characters are represented literally. In the table, the row and column headings indicate the hexadecimal digit combinations to produce the 8-bit code value; e.g., the letter L is at code point 4C (hex), or binary 01001100.

ISO/IEC 8859-1
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x unused
1x
2x SP ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~
8x unused
9x
Ax NBSP ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ SHY ® ¯
Bx ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Dx Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
Ex à á â ã ä å æ ç è é ê ë ì í î ï
Fx ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

Code values 00-1F, 7F, and 80-9F are not assigned to characters by ISO/IEC 8859-1.


ISO 8859-1 vs ISO-8859-1

The IANA has approved ISO-8859-1 (note the extra hyphen), a superset of ISO/IEC 8859-1, for use on the Internet. This character map, or character set or code page, supplements the assignments made by ISO/IEC 8859-1, mapping control characters to code values 00-1F, 7F, and 80-9F. It thus provides for 256 characters via every possible 8-bit value. The IANA allows all of the following aliases for ISO-8859-1 to be used case-insensitively:

  • ISO_8859-1:1987
  • ISO_8859-1
  • ISO-8859-1
  • iso-ir-100
  • csISOLatin1
  • latin1
  • l1
  • IBM819
  • CP819

The name Latin-1 is an informal alias unrecognized by ISO or the IANA, but is perhaps meaningful in some computer software. The following table shows the ISO-8859-1 character map. Control characters, the space character, the no-break space character, and the soft hyphen character are represented by 2-, 3-, or 4-letter abbreviations for their names. All other characters are represented literally.

ISO-8859-1
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x NUL SOH STX ETX EOT ENQ ACK BEL BS TAB LF VT FF CR SO SI
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2x SP ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL
8x PAD HOP BPH NBH IND NEL SSA ESA HTS HTJ VTS PLD PLU RI SS2 SS3
9x DCS PU1 PU2 STS CCH MW SPA EPA SOS SGCI SCI CSI ST OSC PM APC
Ax NBSP ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ SHY ® ¯
Bx ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Dx Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
Ex à á â ã ä å æ ç è é ê ë ì í î ï
Fx ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

There are additional parts to the ISO/IEC 8859 standard that have corresponding IANA-approved character maps, e.g. ISO/IEC 8859-10 (Latin alphabet no. 6) is very similar to character map ISO-8859-10. Each of the ISO/IEC 8859-x parts encodes characters in the same way: they cover the ASCII range (hex 20-7E) plus 96 additional characters in the A0-FF range, for a total of 191 characters. The ISO-8859-x maps each add the ISO 646 C0 "control" characters from 00-1F, a control character at 7F, and control characters in the 80-9F range, thus encompassing a total of 256 characters. ISO-8859-1 is unique among these maps in that its coded characters are equivalent to the first 256 code points of Unicode. ISO-8859-1 is the standard encoding used by the X Window System on most Unix machines.


Windows-1252

The legacy components of Microsoft Windows use, by default, an encoding that is a superset of ISO/IEC 8859-1, but differs from ISO-8859-1, using displayable characters rather than control characters in the 80-9F range. Windows calls it ANSI generically, but depending on where the operating system was sold, the character set will have another name, e.g. CP1252 in the US and Western European markets, with the IANA-approved name Windows-1252. The following table shows Windows-1252, with changes from ISO-8859-1 highlighted:

Windows-1252 (CP1252)
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x NUL SOH STX ETX EOT ENQ ACK BEL BS TAB LF VT FF CR SO SI
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2x SP ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL
8x   ƒ ˆ Š Œ   Ž  
9x   ˜ š œ   ž Ÿ
Ax NBSP ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ SHY ® ¯
Bx ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Dx Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
Ex à á â ã ä å æ ç è é ê ë ì í î ï
Fx ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

In Windows-1252, positions 81, 8D, 8F, 90, and 9D are unused. The euro character at position 80 was not present in earlier versions of this code page.


MacRoman

Older Apple Macintosh computers use an encoding, Mac-Roman, that differs from ISO 8859-1 in the first 32 and beyond the first 127 characters, but does include all characters present in ISO 8859-1 at other locations, with the exception of the soft hyphen. In contrast MacRoman includes multiple characters which are not in ISO 8859-1. The euro glyph replaced the previous generic currency sign. The following table shows MacRoman, with the differences from ISO-8859-1 highlighted:

MacRoman
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x                   TAB LF     CR    
1x                                
2x SP ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~  
8x Ä Å Ç É Ñ Ö Ü á à â ä ã å ç é è
9x ê ë í ì î ï ñ ó ò ô ö õ ú ù û ü
Ax ° ¢ £ § ß ® © ´ ¨ Æ Ø
Bx ± ¥ µ π ª º Ω æ ø
Cx ¿ ¡ ¬ ƒ « » NBSP À Ã Õ Œ œ
Dx ÷ ÿ Ÿ
Ex · Â Ê Á Ë È Í Î Ï Ì Ó Ô
Fx Ò Ú Û Ù ı ˆ ˜ ¯ ˘ ˙ ˚ ¸ ˝ ˛ ˇ

In the table above, 20 is the regular SPACE character, and CA is the NO-BREAK SPACE. F0 is a glyph depicting the Apple logo, which does not exist in Unicode. 00–08, 0B and 0C, 0E–1F and 7F are unused.


The distinction between ISO 8859-1, ISO-8859-1, Windows-1252, and MacRoman is a common source of confusion among computer programmers and on the internet.


External links

  • ISO/IEC 8859-1:1998 (http://anubis.dkuug.dk/JTC1/SC2/WG3/docs/n411.pdf) final draft of the standard (PDF)
  • Windows Codepages (http://www.microsoft.com/globaldev/reference/WinCP.asp)
  • Differences between ANSI, ISO-8859-1 and MacRoman Character Sets (http://www.alanwood.net/demos/charsetdiffs.html)
  • The Letter Database (http://www.eki.ee/letter/)
  • ASCII - ISO 8859-1 Table with HTML Entity Names (http://www.bbsinc.com/iso8859.html)

  Results from FactBites:
 
BibleGateway.com Passage Lookup: Language=latin (933 words)
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
  More results at FactBites »


 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments
Please enter the 5-letter protection code

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms.