HTML Character Set      Page Info

The HTML Document Character Set


This <table> is a reference for the HTML Document Character Set. In addition to the standard ISO-8859-1 (Latin-1) character repertoire, I have included a selection of Unicode characters. This is designed as a look-up reference for HTML authors; if you are looking for a browser check, see Ian Graham's entity test page.
(Due to the number of cells in the table, this page may take a while to render. Don't panic!)

Characters above #160 are sorted by function (alphabetic, diacritic, punctuation, and math/currency), and 63 GIFs are provided for the characters not in the {union set} of {ISO-8859-1} and {Macintosh} (U.S.) character sets.

See my ASCII-EBCDIC chart, complete with hexadecimal codes and those mysterious control characters in the 128-159 range. That's a big <table>, too (256 x 6).

If you have additional questions, see my Q & A page.


Decimal Code Char Description Entity Notes PostScript Name

ISO-8859-1 Characters (Supplementary Characters in bottom section)
&#00; - &#08;Unused
&#09; Horizontal tab  
&#10; Line feed  
&#11; - &#12;Unused
&#13; Carriage Return  
&#14; - &#31;Unused
&#32; Space
 space
&#33;!Exclamation mark  exclam
&#34;"Quotation mark&quot; quotedbl
&#35;#Number sign  numbersign
&#36;$Dollar sign  dollar
&#37;%Percent sign  percent
&#38;&Ampersand&amp; ampersand
&#39;'Apostrophe  quotesingle
&#40;(Left parenthesis  parenleft
&#41;)Right parenthesis  parenright
&#42;*Asterisk  asterisk
&#43;+Plus sign  plus
&#44;,Comma  comma
&#45;-Hyphen  hyphen
&#46;.Period (fullstop)  period
&#47;/Solidus  slash
&#48; - &#57;0 - 9Decimal digits
&#58;:Colon  colon
&#59;;Semi-colon  semicolon
&#60;<Less than&lt; less
&#61;=Equals sign  equal
&#62;>Greater than&gt; greater
&#63;?Question mark  question
&#64;@Commercial at  at
&#65; - &#90;A - ZUppercase letters
&#91;[Left square bracket  bracketleft
&#92;\Reverse solidus  backslash
&#93;]Right square bracket  bracketright
&#94;^Caret  asciicircum
&#95;_Horizontal bar  underscore
&#96;`Grave accent  grave
&#97; - &#122;a - zLowercase letters
&#123;{Left curly brace  braceleft
&#124;|Vertical bar  bar
&#125;}Right curly brace  braceright
&#126;~Tilde  asciitilde
&#127; - &#159;Unused (truly!)
&#160; Nonbreaking space&nbsp; nbspace
nobreakspace
&#192;ÀCapital A, grave accent&Agrave; Agrave
&#224;àSmall a, grave accent&agrave; agrave
&#193;ÁCapital A, acute accent&Aacute; Aacute
&#225;áSmall a, acute accent&aacute; aacute
&#194;ÂCapital A, circumflex accent&Acirc; Acircumflex
&#226;âSmall a, circumflex accent&acirc; acircumflex
&#195;ÃCapital A, tilde&Atilde; Atilde
&#227;ãSmall a, tilde&atilde; atilde
&#196;ÄCapital A, dieresis or umlaut mark&Auml; Adieresis
&#228;äSmall a, dieresis or umlaut mark&auml; adieresis
&#197;ÅCapital A, ring&Aring; Aring
&#229;åSmall a, ring&aring; aring
&#198;ÆCapital AE dipthong (ligature)&AElig; AE
&#230;æSmall ae dipthong (ligature)&aelig; ae
&#199;ÇCapital C, cedilla&Ccedil; Ccedilla
&#231;çSmall c, cedilla&ccedil; ccedilla
&#208;GIF: ÐCapital Eth, Icelandic&ETH;c1Eth
&#240;GIF: ðSmall eth, Icelandic&eth;c1eth
&#200;ÈCapital E, grave accent&Egrave; Egrave
&#232;èSmall e, grave accent&egrave; egrave
&#201;ÉCapital E, acute accent&Eacute; Eacute
&#233;éSmall e, acute accent&eacute; eacute
&#202;ÊCapital E, circumflex accent&Ecirc; Ecircumflex
&#234;êSmall e, circumflex accent&ecirc; ecircumflex
&#203;ËCapital E, dieresis or umlaut mark&Euml; Edieresis
&#235;ëSmall e, dieresis or umlaut mark&euml; edieresis
&#204;ÌCapital I, grave accent&Igrave; Igrave
&#236;ìSmall i, grave accent&igrave; igrave
&#205;ÍCapital I, acute accent&Iacute; Iacute
&#237;íSmall i, acute accent&iacute iacute
&#206;ÎCapital I, circumflex accent&Icirc; Icircumflex
&#238;îSmall i, circumflex accent&icirc; icircumflex
&#207;ÏCapital I, dieresis or umlaut mark&Iuml; Idieresis
&#239;ïSmall i, dieresis or umlaut mark&iuml; idieresis
&#181;µMicro sign&micro; mu
&#209;ÑCapital N, tilde&Ntilde; Ntilde
&#241;ñSmall n, tilde&ntilde; ntilde
&#210;ÒCapital O, grave accent&Ograve; Ograve
&#242;òSmall o, grave accent&ograve; ograve
&#211;ÓCapital O, acute accent&Oacute; Oacute
&#243;óSmall o, acute accent&oacute; oacute
&#212;ÔCapital O, circumflex accent&Ocirc; Ocircumflex
&#244;ôSmall o, circumflex accent&ocirc; ocircumflex
&#213;ÕCapital O, tilde&Otilde; Otilde
&#245;õSmall o, tilde&otilde; otilde
&#214;ÖCapital O, dieresis or umlaut mark&Ouml; Odieresis
&#246;öSmall o, dieresis or umlaut mark&ouml; odieresis
&#216;ØCapital O, slash&Oslash; Oslash
&#248;øSmall o, slash&oslash; oslash
&#223;ßSmall sharp s, German (sz ligature)&szlig; germandbls
&#222;GIF: ÞCapital THORN, Icelandic&THORN;c1Thorn
&#254;GIF: þSmall thorn, Icelandic&thorn;c1thorn
&#217;ÙCapital U, grave accent&Ugrave; Ugrave
&#249;ùSmall u, grave accent&ugrave; ugrave
&#218;ÚCapital U, acute accent&Uacute; Uacute
&#250;úSmall u, acute accent&uacute; uacute
&#219;ÛCapital U, circumflex accent&Ucirc; Ucircumflex
&#251;ûSmall u, circumflex accent&ucirc; ucircumflex
&#220;ÜCapital U, dieresis or umlaut mark&Uuml; Udieresis
&#252;üSmall u, dieresis or umlaut mark&uuml; udieresis
&#221;GIF: ÝCapital Y, acute accent&Yacute;c1Yacute
&#253;GIF: ýSmall y, acute accent&yacute;c1yacute
&#255;ÿSmall y, dieresis or umlaut mark&yuml; ydieresis
&#168;¨Umlaut&uml; dieresis
&#175;¯Macron accent&macr; macron
&#180;´Acute accent&acute; acute
&#184;¸Cedilla&cedil; cedilla
&#161;¡Inverted exclamation&iexcl; exclamdown
&#191;¿Inverted question mark&iquest; questiondown
&#183;·Middle dot&middot; periodcentered
&#166;GIF: ¦Broken vertical bar&brvbar;c1brokenbar
&#171;«Left angle quote&laquo; guillemotleft
&#187;»Right angle quote&raquo; guillemotright
&#182;Paragraph sign&para; paragraph
&#167;§Section sign&sect; section
&#169;©Copyright&copy; copyright
copyrightserif
&#174;®Registered trademark&reg; registered
registerserif
&#185;GIF: ¹Superscript one&sup1;c1onesuperior
&#178;GIF: ²Superscript two&sup2;c1twosuperior
&#179;GIF: ³Superscript three&sup3;c1threesuperior
&#173;­Soft hyphen&shy;e1minus(?)
&#215;GIF: ×Multiply sign&times;c1multiply
&#247;÷Division sign&divide; divide
&#188;GIF: ¼Fraction one-fourth&frac14;c1onequarter
&#189;GIF: ½Fraction one-half&frac12;c1onehalf
&#190;GIF: ¾Fraction three-fourths&frac34;c1threequarters
&#170;ªFeminine ordinal&ordf; ordfeminine
&#186;ºMasculine ordinal&ordm; ordmasculine
&#172;¬Not sign&not; logicalnot
&#176;°Degree sign&deg; degree
&#177;±Plus or minus&plusmn; plusminus
&#164;¤General currency sign&curren; currency
&#162;¢Cent sign&cent; cent
&#163;£Pound sterling&pound; sterling
&#165;¥Yen sign&yen; yen

Supplementary Characters (Unicode UCS-2 is "charset=UNICODE-1-1")
&#916;GIF: Δdelta&Delta;e0Delta
&#402;GIF: ƒflorin&fnof;e0florin
&#937;GIF: Ωomega&Omega;e0Omega
&#338;GIF: ŒOE ligature&OElig;e0OE
&#339;GIF: œoe ligature&oelig;e0oe
&#352;GIF: ŠScaron&Scaron;e0Scaron
&#353;GIF: šscaron&scaron;e0scaron
&#376;GIF: ŸYdieresis&Yuml;e0Ydieresis
&#305;GIF: ıdotless i&inodot;e0dotlessi
&#710;GIF: ˆcircumflex&circ;e0circumflex
&#711;GIF: ˇcaron&caron;e0caron
&#728;GIF: ˘breve&breve;e0breve
&#729;GIF: ˙dot accent&dot;e0dotaccent
&#730;GIF: ˚ring&ring;e0ring
&#731;GIF: ˛ogonek&ogon;e0ogonek
&#732;GIF: ˜tilde&tilde;e0tilde
&#733;GIF: ˝double acute accent&dblac;e0hungarumlaut
&#8211;GIF: –en dash&ndash;e1endash
&#8212;GIF: —em dash&mdash;e1emdash
&#8224;GIF: †dagger&dagger;e0dagger
&#8225;GIF: ‡double dagger&Dagger;e0daggerdbl
&#8226;GIF: •bullet&bull;e0bullet
&#8230;GIF: …ellipsis&hellip;e0ellipsis
&#8216;GIF: ‘quote left&lsquo;e0quoteleft
&#8217;GIF: ’quote right&rsquo;e0quoteright
&#8218;GIF: ‚quote single base&lsquor;e0quotesinglbase
&#8220;GIF: “quote double left&ldquo;e0quotedblleft
&#8221;GIF: ”quote double right&rdquo;e0quotedblright
&#8222;GIF: „quote double base&ldquor;e0quotedblbase
&#8249;GIF: ‹guille single left&lsaquo;e?guilsinglleft
&#8250;GIF: ›guille single right&rsaquo;e?guilsinglright
&#8482;GIF: ™trademark, TM&trade;e1trademark
trademarkserif
&#8480;GIF: ℠service mark, SM   
&#8471;GIF: ℗sound recording copyright, (P)   
&#8730;GIF: √radical&radic;e0radical
&#8734;GIF: ∞infinity&infin;e0infinity
&#8747;GIF: ∫integral&int;e0integral
&#8706;GIF: ∂partial differential&part;e0partialdiff
&#8773;GIF: ≅approximately equal&ap;e0approxequal
&#8800;GIF: ≠not equal&ne;e0notequal
&#8804;GIF: ≤less than or equal&le;e0lessequal
&#8805;GIF: ≥greater than or equal&ge;e0greaterequal
&#931;GIF: Σsummation&sum;e0summation
&#8240;GIF: ‰per thousand (mille)&permil;e0perthousand
&#8260;GIF: ⁄fraction separator  fraction
&#8719; product&prod;e0product
&#960; pi&pi;e0pi
&#9674; lozenge (diamond)&loz;e0lozenge
&#8355; franc  franc
&#63743; Apple logo  apple
&#8984;GIF: ⌘command key   
&#8997;GIF: ⌥option key   
&#9774;GIF: ☮peace symbol   
&#9775;GIF: ☯yin yang   

Notes
c1 Character not in charset="macintosh" (x-mac-roman, code page 10000)
c2 Character not in charset="window-1252" (code page 1252)
c3 Character not in charset="yyyyyy" (What is Unix's charset name?)
.
e? Entity proposed for ISO-8879 but not yet approved
e0 Entity from ISO-8879 (not yet recognized by HTML agents)
e1 Entity not recognized by Netscape 1.1
e2 Entity not recognized by NCSA Mosaic 2.0 (I need info...)
e3 Entity not recognized by Lynx 2.4 (I need info...)
h2 Introduced in HTML 2.0 (RFC 1866)
h3 Proposed for HTML 3.0
hW Introduced in HTML 3.2
h4 Introduced in HTML 4.0/4.01
The supplementary characters use Unicode decimal values. Although most current browsers do not yet support Unicode, this is the direction in which HTML is heading.

Since Unicode is designed for information exchange rather than typography, there is only one glyph per character (although the same character may appear in more than one code page).
Unicode is not a font technology. Adobe has developed CID fonts (see the tech notes) to handle large character sets, Bitstream has developed the Cyberbit font (8,500 Unicode characters), and Apple's QuickDraw GX font technology supports multiple glyphs per character (such as swash capitals, superscripts and subscripts, true small caps, true ligatures, etc.) under MacOS. In Mac OS 8.5, Apple introduced a new API to support layout and drawing of Unicode text called ATSUI (Apple Text Services for Unicode Imaging). Since many Windows TrueType fonts, and all OpenType fonts, are set up to be used for Unicode exclusively, they're a good fit for use by ATSUI. Apple includes this API in the core of Mac OS X, which supports multiple encodings. For more information about fonts on the Web, see the W3C page Fonts and the Web.


[Hand-coded in Tex-Edit Plus] [MacOS]

 
Info:   Site Maps About Me Site Info
Actions: Search Communicate Download

<URL:http://www.natural-innovations.com/wa/doc-charset.html> (page info)

natural-innovations.com (c) 1995-2012 Walter Ian Kaye