Description Code Entity name
=================================== ============ ==============
quotation mark " --> " " --> "
ampersand & --> & & --> &
less-than sign < --> < < --> <
greater-than sign > --> > > --> >
Description Char Code Entity name
=================================== ==== ============ ==============
euro sign &128; -->
undefined &129; -->
single low-9 quotation mark &130; -->
latin small letter f with hook &131; -->
double low-9 quotation mark &132; -->
horizontal ellipsis &133; -->
dagger &134; -->
double dagger &135; -->
modifier letter circumflex accent &136; -->
per mille sign &137; -->
latin capital letter s with caron &138; -->
single left-pointing angle quote mark &139; -->
latin capital ligature oe &140; -->
undefined &141; -->
latin capital letter z with caron &142; -->
undefined &143; -->
undefined &144; -->
left single quotation mark &145; -->
right single quotation mark &146; -->
left double quotation mark &147; -->
right double quotation mark &148; -->
bullet &149; -->
en dash &150; -->
em dash &151; -->
small tilde &152; -->
trade mark sign &153; -->
latin small letter s with caron &154; -->
single right-pointing angle quote mark &155; -->
latin small ligature oe &156; -->
undefined &157; -->
latin small letter z with caron &158; -->
latin capital letter y with diaeresis &159; -->
non-breaking space   --> -->
inverted exclamation ¡ --> ¡ ¡ --> ¡
cent sign ¢ --> ¢ ¢ --> ¢
pound sterling £ --> £ £ --> £
general currency sign ¤ --> ¤ ¤ --> ¤
yen sign ¥ --> ¥ ¥ --> ¥
broken vertical bar ¦ --> ¦ ¦ --> ¦
Non-standard &brkbar; --> &brkbar;
section sign § --> § § --> §
umlaut (dieresis) ¨ --> ¨ ¨ --> ¨
Non-standard ¨ --> ¨
copyright © --> © © --> ©
feminine ordinal ª --> ª ª --> ª
left angle quote, guillemotleft « --> « « --> «
not sign ¬ --> ¬ ¬ --> ¬
soft hyphen ­ --> ­ -->
registered trademark ® --> ® ® --> ®
macron accent ¯ --> ¯ ¯ --> ¯
Non-standard &hibar; --> &hibar;
degree sign ° --> ° ° --> °
plus or minus ± --> ± ± --> ±
superscript two ² --> ² ² --> ²
superscript three ³ --> ³ ³ --> ³
acute accent ´ --> ´ ´ --> ´
micro sign µ --> µ µ --> µ
paragraph sign ¶ --> ¶ ¶ --> ¶
middle dot · --> · · --> ·
cedilla ¸ --> ¸ ¸ --> ¸
superscript one ¹ --> ¹ ¹ --> ¹
masculine ordinal º --> º º --> º
right angle quote, guillemotright » --> » » --> »
fraction one-fourth ¼ --> ¼ ¼ --> ¼
fraction one-half ½ --> ½ ½ --> ½
fraction three-fourths ¾ --> ¾ ¾ --> ¾
inverted question mark ¿ --> ¿ ¿ --> ¿
capital A, grave accent À --> À À --> À
capital A, acute accent Á --> Á Á --> Á
capital A, circumflex accent  -->   --> Â
capital A, tilde à --> à à --> Ã
capital A, dieresis or umlaut mark Ä --> Ä Ä --> Ä
capital A, ring Å --> Å Å --> Å
capital AE diphthong (ligature) Æ --> Æ Æ --> Æ
capital C, cedilla Ç --> Ç Ç --> Ç
capital E, grave accent È --> È È --> È
capital E, acute accent É --> É É --> É
capital E, circumflex accent Ê --> Ê Ê --> Ê
capital E, dieresis or umlaut mark Ë --> Ë Ë --> Ë
capital I, grave accent Ì --> Ì Ì --> Ì
capital I, acute accent Í --> Í Í --> Í
capital I, circumflex accent Î --> Î Î --> Î
capital I, dieresis or umlaut mark Ï --> Ï Ï --> Ï
capital Eth, Icelandic Ð --> Ð Ð --> Ð
Non-standard Đ --> Đ
capital N, tilde Ñ --> Ñ Ñ --> Ñ
capital O, grave accent Ò --> Ò Ò --> Ò
capital O, acute accent Ó --> Ó Ó --> Ó
capital O, circumflex accent Ô --> Ô Ô --> Ô
capital O, tilde Õ --> Õ Õ --> Õ
capital O, dieresis or umlaut mark Ö --> Ö Ö --> Ö
multiply sign × --> × × --> ×
capital O, slash Ø --> Ø Ø --> Ø
capital U, grave accent Ù --> Ù Ù --> Ù
capital U, acute accent Ú --> Ú Ú --> Ú
capital U, circumflex accent Û --> Û Û --> Û
capital U, dieresis or umlaut mark Ü --> Ü Ü --> Ü
capital Y, acute accent Ý --> Ý Ý --> Ý
capital THORN, Icelandic Þ --> Þ Þ --> Þ
small sharp s, German (sz ligature) ß --> ß ß --> ß
small a, grave accent à --> à à --> à
small a, acute accent á --> á á --> á
small a, circumflex accent â --> â â --> â
small a, tilde ã --> ã ã --> ã
small a, dieresis or umlaut mark ä --> ä ä --> ä
small a, ring å --> å å --> å
small ae diphthong (ligature) æ --> æ æ --> æ
small c, cedilla ç --> ç ç --> ç
small e, grave accent è --> è è --> è
small e, acute accent é --> é é --> é
small e, circumflex accent ê --> ê ê --> ê
small e, dieresis or umlaut mark ë --> ë ë --> ë
small i, grave accent ì --> ì ì --> ì
small i, acute accent í --> í í --> í
small i, circumflex accent î --> î î --> î
small i, dieresis or umlaut mark ï --> ï ï --> ï
small eth, Icelandic ð --> ð ð --> ð
small n, tilde ñ --> ñ ñ --> ñ
small o, grave accent ò --> ò ò --> ò
small o, acute accent ó --> ó ó --> ó
small o, circumflex accent ô --> ô ô --> ô
small o, tilde õ --> õ õ --> õ
small o, dieresis or umlaut mark ö --> ö ö --> ö
division sign ÷ --> ÷ ÷ --> ÷
small o, slash ø --> ø ø --> ø
small u, grave accent ù --> ù ù --> ù
small u, acute accent ú --> ú ú --> ú
small u, circumflex accent û --> û û --> û
small u, dieresis or umlaut mark ü --> ü ü --> ü
small y, acute accent ý --> ý ý --> ý
small thorn, Icelandic þ --> þ þ --> þ
small y, dieresis or umlaut mark ÿ --> ÿ ÿ --> ÿ
-->
"what your browser does with it"
-->
"what your browser does with it"
÷ -->
÷" in the 4th column, this means your browser
doesn't know about the entity name "divide" and just puts it
literally.
This table grew out of an overview of the "ISO
Latin-1 Character Set" overview related to the Hyper-G Text Format
(HTF).
The entity names &brkbar; and Đ
seem to be unique to HTF.
The entity name &hibar; has been supported by X Mosaic
but seems to be replaced with ¯.
The entity names ¨ and ¨ should
be equivalent.
The standards stuff:
The
HTML 2.0 Standard
includes a section on
Character Entity Sets
and an overview on the
HTML Coded Character Set
(The entity names are derived from ISO 8879).
Or have a look at the
Latin-1 Character Entities
as listed in an draft for the
HTML 3.0 specification.
The
Appendix II
of CERN's
HTML+ Discussion Document
contains a
table
(in PostScript format) of the proposed character entities for HTML+ and their
corresponding character codes for Unicode and the Adobe Latin-1 & Symbol
character sets.
Please note that there is nothing wrong with using characters of ISO Latin-1 above 127: the normal transmission protocol for the WWW, HTTP/1.0, uses the 8bit ISO latin-1 as default encoding. (Thanks to Roman Czyborra for pointing this out!)
Other information:
server/ddx/sun/Compose.list.