You are here: Test Case Description for copy-entities > Test Case character entity references in HTML 4.01

Character entity references in HTML 4.01

A complete list of supported character entities from the HTML 4.01 standard. Each entity is named, and both a "name" (of the form &xxxx;) and "code" (of the form &#nnn;) sample are attempted (sometimes the code may work when the name doesn't).

Standard ASCII Chart 0-127

Note 1: The ascii values of 0 (zero) through 32 are control sequences.

Note 2: ascii value 127 is not defined.

Entity Code Named Coded Description
Entity Code Named Coded Description
exclamation 33 "!" exclamation point
double quote 34 """ double quote
pound sign 35 "#" pound sign
dollar sign 36 "$" dollar sign
percent sign 37 "%" percent sign
ampersand 38 "&" ampersand
single quote 39 "'" close single quote
left paren 40 "(" left paren
right paren 41 ")" right paren
asterisk 42 "*" asterisk
plus sign 43 "+" plus sign
comma 44 "," comma
minus sign 45 "-" minus sign
period 46 "." period
forward slash 47 "/" forward slash
0 48 "0" zero
1 49 "1" one
2 50 "2" two
3 51 "3" three
4 52 "4" four
5 53 "5" five
6 54 "6" six
7 55 "7" seven
8 56 "8" eight
9 57 "9" nine
colon 58 ":" colon
semi-colon 59 ";" semi-colon
less than 60 "<" less than
equal sin 61 "=" equal sign
greater than 62 ">" greater than
question mark 63 "?" question mark
at sign 64 "@" at sign
A 65 "A" upper A
B 66 "B" upper B
C 67 "C" upper C
D 68 "D" upper D
E 69 "E" upper E
F 70 "F" upper F
G 71 "G" upper G
H 72 "H" upper H
I 73 "I" upper I
J 74 "J" upper J
K 75 "K" upper K
L 76 "L" upper L
M 77 "M" upper M
N 78 "N" upper N
O 79 "O" upper O
P 80 "P" upper P
Q 81 "Q" upper Q
R 82 "R" upper R
S 83 "S" upper S
T 84 "T" upper T
U 85 "U" upper U
V 86 "V" upper V
W 87 "W" upper W
X 88 "X" upper X
Y 89 "Y" upper Y
Z 90 "Z" upper Z
left square bracket 91 "[" left square bracket
backward slash 92 "\" backward slash
right square bracket 93 "]" right square bracket
caret 94 "^" caret
underscore 95 "_" underscore
open single quote 96 "`" open single quote
a 97 "a" lower a
b 98 "b" lower b
c 99 "c" lower c
d 100 "d" lower d
e 101 "e" lower e
f 102 "f" lower f
g 103 "g" lower g
h 104 "h" lower h
i 105 "i" lower i
j 106 "j" lower j
k 107 "k" lower k
l 108 "l" lower l
m 109 "m" lower m
n 110 "n" lower n
o 111 "o" lower o
p 112 "p" lower p
q 113 "q" lower q
r 114 "r" lower r
s 115 "s" lower s
t 116 "t" lower t
u 117 "u" lower u
v 118 "v" lower v
w 119 "w" lower w
x 120 "x" lower x
y 121 "y" lower y
z 122 "z" lower z
left curly bracket 123 "{" left curly bracket
or bar 124 "|" or bar
right curly bracket 125 "}" right curly bracket
tilde 126 "~" tilde

Note 2: ascii value 127 is not defined.

Latin-1 Supplement 128-159: used for control characters

Entity Code Named Coded Description
Entity Code Named Coded Description
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159

Latin-1 Supplement 160-255

Latin 1 entities (code positions 160-255) from Web Design Group (WDG)

Entity Code Named Coded Description
Entity Code Named Coded Description
nbsp 160 "&nbsp;" " " no-break space = non-breaking space
iexcl 161 "&iexcl;" "¡" inverted exclamation mark
cent 162 "&cent;" "¢" cent sign
pound 163 "&pound;" "£" pound sign
curren 164 "&curren;" "¤" currency sign
yen 165 "&yen;" "¥" yen sign = yuan sign
brvbar 166 "&brvbar;" "¦" broken bar = broken vertical bar
sect 167 "&sect;" "§" section sign
uml 168 "&uml;" "¨" diaeresis = spacing diaeresis
copy 169 "&copy;" "©" copyright sign
ordf 170 "&ordf;" "ª" feminine ordinal indicator
laquo 171 "&laquo;" "«" left-pointing double angle quotation mark= left pointing guillemet
not 172 "&not;" "¬" not sign
shy 173 "&shy;" "­" soft hyphen = discretionary hyphen
reg 174 "&reg;" "®" registered sign = registered trade mark sign
macr 175 "&macr;" "¯" macron = spacing macron = overline = APL overbar
deg 176 "&deg;" "°" degree sign
plusmn 177 "&plusmn;" "±" plus-minus sign = plus-or-minus sign
sup2 178 "&sup2;" "²" superscript two = superscript digit two = squared
sup3 179 "&sup3;" "³" superscript three = superscript digit three = cubed
acute 180 "&acute;" "´" acute accent = spacing acute
micro 181 "&micro;" "µ" micro sign
para 182 "&para;" "¶" pilcrow sign = paragraph sign
middot 183 "&middot;" "·" middle dot = Georgian comma = Greek middle dot
cedil 184 "&cedil;" "¸" cedilla = spacing cedilla
sup1 185 "&sup1;" "¹" superscript one = superscript digit one
ordm 186 "&ordm;" "º" masculine ordinal indicator
raquo 187 "&raquo;" "»" right-pointing double angle quotation mark = right pointing guillemet
frac14 188 "&frac14;" "¼" vulgar fraction one quarter fraction one quarter
frac12 189 "&frac12;" "½" vulgar fraction one half fraction one half
frac34 190 "&frac34;" "¾" vulgar fraction three quarters fraction three quarters
iquest 191 "&iquest;" "¿" inverted question mark turned question mark
Agrave 192 "&Agrave;" "À" latin capital letter A with grave latin capital letter A grave
Aacute 193 "&Aacute;" "Á" latin capital letter A with acute
Acirc 194 "&Acirc;" "Â" latin capital letter A with circumflex
Atilde 195 "&Atilde;" "Ã" latin capital letter A with tilde
Auml 196 "&Auml;" "Ä" latin capital letter A with diaeresis
Aring 197 "&Aring;" "Å" latin capital letter A with ring above latin capital letter A ring
AElig 198 "&AElig;" "Æ" latin capital letter AE latin capital ligature AE
Ccedil 199 "&Ccedil;" "Ç" latin capital letter C with cedilla
Egrave 200 "&Egrave;" "È" latin capital letter E with grave
Eacute 201 "&Eacute;" "É" latin capital letter E with acute
Ecirc 202 "&Ecirc;" "Ê" latin capital letter E with circumflex
Euml 203 "&Euml;" "Ë" latin capital letter E with diaeresis
Igrave 204 "&Igrave;" "Ì" latin capital letter I with grave
Iacute 205 "&Iacute;" "Í" latin capital letter I with acute
Icirc 206 "&Icirc;" "Î" latin capital letter I with circumflex
Iuml 207 "&Iuml;" "Ï" latin capital letter I with diaeresis
ETH 208 "&ETH;" "Ð" latin capital letter ETH
Ntilde 209 "&Ntilde;" "Ñ" latin capital letter N with tilde
Ograve 210 "&Ograve;" "Ò" latin capital letter O with grave
Oacute 211 "&Oacute;" "Ó" latin capital letter O with acute
Ocirc 212 "&Ocirc;" "Ô" latin capital letter O with circumflex
Otilde 213 "&Otilde;" "Õ" latin capital letter O with tilde
Ouml 214 "&Ouml;" "Ö" latin capital letter O with diaeresis
times 215 "&times;" "×" multiplication sign
Oslash 216 "&Oslash;" "Ø" latin capital letter O with stroke latin capital letter O slash
Ugrave 217 "&Ugrave;" "Ù" latin capital letter U with grave
Uacute 218 "&Uacute;" "Ú" latin capital letter U with acute
Ucirc 219 "&Ucirc;" "Û" latin capital letter U with circumflex
Uuml 220 "&Uuml;" "Ü" latin capital letter U with diaeresis
Yacute 221 "&Yacute;" "Ý" latin capital letter Y with acute
THORN 222 "&THORN;" "Þ" latin capital letter THORN
szlig 223 "&szlig;" "ß" latin small letter sharp s = ess-zed
agrave 224 "&agrave;" "à" latin small letter a with grave latin small letter a grave
aacute 225 "&aacute;" "á" latin small letter a with acute
acirc 226 "&acirc;" "â" latin small letter a with circumflex
atilde 227 "&atilde;" "ã" latin small letter a with tilde
auml 228 "&auml;" "ä" latin small letter a with diaeresis
aring 229 "&aring;" "å" latin small letter a with ring above latin small letter a ring
aelig 230 "&aelig;" "æ" latin small letter ae latin small ligature ae
ccedil 231 "&ccedil;" "ç" latin small letter c with cedilla
egrave 232 "&egrave;" "è" latin small letter e with grave
eacute 233 "&eacute;" "é" latin small letter e with acute
ecirc 234 "&ecirc;" "ê" latin small letter e with circumflex
euml 235 "&euml;" "ë" latin small letter e with diaeresis
igrave 236 "&igrave;" "ì" latin small letter i with grave
iacute 237 "&iacute;" "í" latin small letter i with acute
icirc 238 "&icirc;" "î" latin small letter i with circumflex
iuml 239 "&iuml;" "ï" latin small letter i with diaeresis
eth 240 "&eth;" "ð" latin small letter eth
ntilde 241 "&ntilde;" "ñ" latin small letter n with tilde
ograve 242 "&ograve;" "ò" latin small letter o with grave
oacute 243 "&oacute;" "ó" latin small letter o with acute
ocirc 244 "&ocirc;" "ô" latin small letter o with circumflex
otilde 245 "&otilde;" "õ" latin small letter o with tilde
ouml 246 "&ouml;" "ö" latin small letter o with diaeresis
divide 247 "&divide;" "÷" division sign
oslash 248 "&oslash;" "ø" latin small letter o with stroke, latin small letter o slash
ugrave 249 "&ugrave;" "ù" latin small letter u with grave
uacute 250 "amp;uacute;" "ú" latin small letter u with acute
ucirc 251 "&ucirc;" "û" latin small letter u with circumflex
uuml 252 "&uuml;" "ü" latin small letter u with diaeresis
yacute 253 "&yacute;" "ý" latin small letter y with acute
thorn 254 "&thorn;" "þ" latin small letter thorn with
yuml 255 "&yuml;" "ÿ" latin small letter y with diaeresis

Character entity references for symbols, mathematical symbols, and Greek letters

24.3 Character entity references for symbols, mathematical symbols, and Greek letters from W3C HTML 4.01 spec

Latin Extended-B

Entity Code Named Coded Description
Entity Code Named Coded Description
fnof 402 "&fnof;" "ƒ" latin small f with hook = function = florin

Greek

Note 1: there is no Sigmaf, and no U+03A2 character either.

Entities for Greek letters from Web Design Group (WDG)

Entity Code Named Coded Description
Entity Code Named Coded Description
Alpha 913 "&Alpha;" "Α" greek capital letter alpha
Beta 914 "&Beta;" "Β" greek capital letter beta
Gamma 915 "&Gamma;" "Γ" greek capital letter gamma
Delta 916 "&Delta;" "Δ" greek capital letter delta
Epsilon 917 "&Epsilon;" "Ε" greek capital letter epsilon
Zeta 918 "&Zeta;" "Ζ" greek capital letter zeta
Eta 919 "&Eta;" "Η" greek capital letter eta
Theta 920 "&Theta;" "Θ" greek capital letter theta
Iota 921 "&Iota;" "Ι" greek capital letter iota
Kappa 922 "&Kappa;" "Κ" greek capital letter kappa
Lambda 923 "&Lambda;" "Λ" greek capital letter lambda
Mu 924 "&Mu;" "Μ" greek capital letter mu
Nu 925 "&Nu;" "Ν" greek capital letter nu
Xi 926 "&Xi" "Ξ" greek capital letter xi
Omicron 927 "&Omicron;" "Ο" greek capital letter omicron
Pi 928 "&Pi;" "Π" greek capital letter pi
Rho 929 "&Rho;" "Ρ" greek capital letter rho
Sigma 931 "&Sigma;" "Σ" greek capital letter sigma [1]
Tau 932 "&Tau;" "Τ" greek capital letter tau
Upsilon 933 "&Upsilon;" "Υ" greek capital letter upsilon
Phi 934 "&Phi;" "Φ" greek capital letter phi
Chi 935 "&Chi;" "Χ" greek capital letter chi
Psi 936 "&Psi;" "Ψ" greek capital letter psi
Omega 937 "&Omega;" "Ω" greek capital letter omega
alpha 945 "&alpha;" "α" greek small letter alpha
beta 946 "&beta;" "β" greek small letter beta
gamma 947 "&gamma;" "γ" greek small letter gamma
delta 948 "&delta;" "δ" greek small letter delta
epsilon 949 "&epsilon;" "ε" greek small letter epsilon
zeta 950 "&zeta;" "ζ" greek small letter zeta
eta 951 "&eta;" "η" greek small letter eta
theta 952 "&theta;" "θ" greek small letter theta
iota 953 "&iota;" "ι" greek small letter iota
kappa 954 "&kappa;" "κ" greek small letter kappa
lambda 955 "&lambda;" "λ" greek small letter lambda
mu 956 "&mu;" "μ" greek small letter mu
nu 957 "&nu;" "ν" greek small letter nu
xi 958 "&xi" "ξ" greek small letter xi
omicron 959 "&omicron;" "ο" greek small letter omicron
pi 960 "&pi;" "π" greek small letter pi
rho 961 "&rho;" "ρ" greek small letter rho
sigmaf 962 "&sigmaf;" "ς" greek small letter final sigma
sigma 963 "&sigma;" "σ" greek small letter sigma
tau 964 "&tau;" "τ" greek small letter tau
upsilon 965 "&upsilon;" "υ" greek small letter upsilon
phi 966 "&phi;" "φ" greek small letter phi
chi 967 "&chi;" "χ" greek small letter chi
psi 968 "&psi;" "ψ" greek small letter psi
omega 969 "&omega;" "ω" greek small letter omega
thetasym 977 "&thetasym;" "ϑ" greek small letter theta symbol
upsih 978 "&upsih;" "ϒ" greek upsilon with hook symbol
piv 982 "&piv;" "ϖ" greek pi symbol

General Punctuation

Note 1: bullet is NOT the same as bullet operator.

Entity Code Named Coded Description
Entity Code Named Coded Description
bull 8226 "&bull;" "•" bullet = black small circle [1]
hellip 8230 "&hellip;" "…" horizontal ellipsis = three dot leader
prime 8242 "&prime;" "′" prime = minutes = feet
Prime 8243 "&Prime;" "″" double prime = seconds = inches
oline 8254 "&oline;" "‾" overline = spacing overscore
frasl 8260 "&frasl;" "⁄" fraction slash

Letterlike Symbols

Note 1: alef symbol is NOT the same as hebrew letter alef, U+05D0 although the same glyph could be used to depict both characters.

Entity Code Named Coded Description
Entity Code Named Coded Description
weierp 8472 "&weierp;" "℘" script capital P = power set = Weierstrass p
image 8465 "&image;" "ℑ" blackletter capital I = imaginary part
real 8476 "&real;" "ℜ" blackletter capital R = real part symbol
trade 8482 "&trade;" "™" trade mark sign
alefsym 8501 "&alefsym;" "ℵ" alef symbol = first transfinite cardinal

Arrows

Note 1: Unicode does not say that lArr is the same as the "is implied by" arrow but also does not have any other character for that function. So? lArr can be used for "is implied by" as ISOtech suggests

Note 2: Unicode does not say this is the "implies" character but does not have another character with this function so rArr can be used for "implies" as ISOtech suggests

Entity Code Named Coded Description
Entity Code Named Coded Description
larr 8592 "&larr;" "←" leftwards arrow
uarr 8593 "&uarr;" "↑" upwards arrow
rarr 8594 "&rarr;" "→" rightwards arrow
darr 8595 "&darr;" "↓" downwards arrow
harr 8596 "&harr;" "↔" left right arrow
crarr 8629 "&crarr;" "↵" downwards arrow with corner leftwards = carriage return
lArr 8656 "&lArr;" "⇐" leftwards double arrow [1]
uArr 8657 "&uArr;" "⇑" upwards double arrow
rArr 8658 "&rArr;" "⇒" rightwards double arrow [2]
dArr 8659 "&dArr;" "⇓" downwards double arrow
hArr 8660 "&hArr;" "⇔" left right double arrow

Mathematical Operators

Note 1: should there be a more memorable name than "ni"?

Note 2: prod is NOT the same character as U+03A0 "greek capital letter pi" though the same glyph might be used for both

Note 3: sum is NOT the same character as U+03A3 "greek capital letter sigma" though the same glyph might be used for both

Note 4: tilde operator is NOT the same character as the tilde although the same glyph might be used to represent both

Note 5: note that nsup, "not a superset of font encoding and is not included. Should it be, for symmetry? It is in ISOamsn

Note 6: dot operator is NOT the same character as U+00B7 middle dot

Entity Code Named Coded Description
Entity Code Named Coded Description
forall 8704 "&forall;" "∀" for all
part 8706 "&part;" "∂" partial differential
exist 8707 "&exist;" "∃" there exists
empty 8709 "&empty;" "∅" empty set = null set = diameter
nabla 8711 "&nabla;" "∇" nabla = backward difference
isin 8712 "&isin;" "∈" element of
notin 8713 "&notin;" "∉" not an element of
ni 8715 "&ni;" "∋" contains as member [1]
prod 8719 "&prod;" "∏" n-ary product = product sign [2]
sum 8721 "&sum;" "∑" n-ary sumation [3]
minus 8722 "&minus;" "−" minus sign
lowast 8727 "&lowast;" "∗" asterisk operator
radic 8730 "&radic;" "√" square root = radical sign
prop 8733 "&prop;" "∝" proportional to
infin 8734 "&infin;" "∞" infinity
ang 8736 "&ang;" "∠" angle
and 8743 "&and;" "∧" logical and = wedge
or 8744 "&or;" "∨" logical or = vee
cap 8745 "&cap;" "∩" intersection = cap
cup 8746 "&cup;" "∪" union = cup
int 8747 "&int;" "∫" integral
there4 8756 "&there4;" "∴" therefore
sim 8764 "&sim;" "∼" tilde operator = varies with = similar to [4]
cong 8773 "&cong;" "≅" approximately equal to
asymp 8776 "&asymp;" "≈" almost equal to = asymptotic to
ne 8800 "&ne;" "≠" not equal to
equiv 8801 "&equiv;" "≡" identical to
le 8804 "&le;" "≤" less-than or equal to
ge 8805 "&ge;" "≥" greater-than or equal to
sub 8834 "&sub;" "⊂" subset of
sup 8835 "&sup;" "⊃" superset of [5]
nsub 8836 "&nsub;" "⊄" not a subset of
sube 8838 "&sube;" "⊆" subset of or equal to
supe 8839 "&supe;" "⊇" superset of or equal to
oplus 8853 "&oplus;" "⊕" circled plus = direct sum
otimes 8855 "&otimes;" "⊗" circled times = vector product
perp 8869 "&perp;" "⊥" up tack = orthogonal to = perpendicular
sdot 8901 "&sdot;" "⋅" dot operator [6]

Miscellaneous Technical

Note 1: lang is NOT the same character as U+003C "less than" or U+2039 "single left-pointing angle quotation mark"

Note 2: rang is NOT the same character as U+003E "greater than" or U+203A "single right-pointing angle quotation mark"

Entity Code Named Coded Description
Entity Code Named Coded Description
lceil 8968 "&lceil;" "⌈" left ceiling = apl upstile
rceil 8969 "&rceil;" "⌉" right ceiling
lfloor 8970 "&lfloor;" "⌊" left floor = apl downstile
rfloor 8971 "&rfloor;" "⌋" right floor
lang 9001 "&lang;" "⟨" left-pointing angle bracket = bra [1]
rang 9002 "&rang;" "⟩" right-pointing angle bracket = ket [2]

Geometric Shapes

Entity Code Named Coded Description
Entity Code Named Coded Description
loz 9674 "&loz;" "◊" lozenge

Miscellaneous Symbols

Note 1: black here seems to mean filled as opposed to hollow

Entity Code Named Coded Description
Entity Code Named Coded Description
spades 9824 "&spades;" "♠" black spade suit [1]
clubs 9827 "&clubs;" "♣" black club suit = shamrock
hearts 9829 "&hearts;" "♥" black heart suit = valentine
diams 9830 "&diams;" "♦" black diamond suit

24.4 Character entity references for markup-significant and internationalization characters

Latin Extended-A

Note 1: ligature is a misnomer, this is a separate character in some languages

Entity Code Named Coded Description
Entity Code Named Coded Description
OElig 338 "&OElig;" "Œ" latin capital ligature OE
oelig 339 "&oelig;" "œ" latin small ligature oe [1]
Scaron 352 "&Scaron;" "Š" latin capital letter S with caron
scaron 353 "&scaron;" "š" latin small letter s with caron
Yuml 376 "&Yuml;" "Ÿ" latin capital letter Y with diaeresis

Spacing Modifier Letters

Entity Code Named Coded Description
Entity Code Named Coded Description
circ 710 "&circ;" "ˆ" modifier letter circumflex accent
tilde 732 "&tilde;" "˜" small tilde

General Punctuation

Note 1: lsaquo is proposed but not yet ISO standardized

Note 2: rsaquo is proposed but not yet ISO standardized

Entity Code Named Coded Description
Entity Code Named Coded Description
ensp 8194 &ensp; " " en space
emsp 8195 &emsp; " " em space
thinsp 8201 &thinsp; " " thin space
zwnj 8204 &zwnj; "‌" zero width non-joiner
zwj 8205 &zwj; "‍" zero width joiner
lrm 8206 &lrm; "‎" left-to-right mark
rlm 8207 &rlm; "‏" right-to-left mark
ndash 8211 &ndash; "–" en dash
mdash 8212 &mdash; "—" em dash
lsquo 8216 &lsquo; "‘" left single quotation mark
rsquo 8217 &rsquo; "’" right single quotation mark
sbquo 8218 &sbquo; "‚" single low-9 quotation mark
ldquo 8220 &ldquo; "“" left double quotation mark
rdquo 8221 &rdquo; "”" right double quotation mark
bdquo 8222 &bdquo; "„" double low-9 quotation mark
dagger 8224 &dagger; "†" dagger
Dagger 8225 &Dagger; "‡" double dagger
permil 8240 &permil; "‰" per mille sign
lsaquo 8249 &lsaquo; "‹" single left-pointing angle quotation mark [Note 1]
rsaquo 8250 &rsaquo; "›" single right-pointing angle quotation mark [Note 2]
Entity Code Named Coded Description
Entity Code Named Coded Description
euro 8364 &euro; "€" euro sign

(Thanks to Beth Epperson for compiling this entity list!)