Code Page 1252 characters and substitutions

This page doesn't display non-Windows platforms, and will not display completely on N4.x — this is the point of the page, as the first block characters aren't portable, despite what people might think.

Characters and codepoints
Characters as cp1252 values Unicode code-points (decimal) Characters escaped by codepoint
 ƒ 8364 129 8218 402  ƒ
8222 8230 8224 8225
ˆ Š 710 8240 352 8249 ˆ Š
Œ  Ž  338 141 381 143 Œ  Ž 
 144 8216 8217 8220 
8221 8226 8211 8212
˜ š 732 8482 353 8250 ˜ š
œ  ž Ÿ 339 157 382 376 œ  ž Ÿ

Characters appearing as an empty cell in the first block correspond to no printing character, even in cp1252, the usual Windows character to machine representation mapping. Character values in the block run 128–131 (0x80 to 0x83) in the top row, 132–135 (0x84 to 0x88) in the next, down to 156–159 (0x9C to 0x9F) in the last row.

If you want a printing character glyph from the first block in the upper left then you shouldn't use the character directly from a Windows machine, nor use an &#<decimal>; or &#x<hex>; with a value from 0x80 to 0x9f (128 to 159), but one of the named escapes shown below.

N.B. If you are targetting PocketIE, you need to use the Unicode codepoint number (decimal or hex); if you are also targetting your content at Netscape 4.0, use the decimal for these characters — so for a bullet, use &#8226; rather than &#bull; , even though it makes the code less legible.

Characters and named escape sequences
Characters escaped by name Named escape sequence, plus [not xml]
ƒ &#x20AC; [&#euro;] &#x201A; [&sbquo;] &fnof;
&#x201E; [&bdquo;] &#x2026; [&hellip;] &#x2020; [&dagger;] &#x2021; [&Dagger;]
ˆ Š &circ; &#x2030; [&permil;] &Scaron; &#x2039; [&lsaquo;]
Œ Ž &OElig; &#x017d;
&#x2018; [&lsquo;] &#x2019; [&rsquo;] &#x201C; [&ldquo;]
&#x201D; [&rdquo;] &#x2022; [&bull;] &#x2013; [&ndash;] &#x2014; [&mdash;]
˜ š &tilde; &#x2122 [&trade;] &scaron; &#x203A; [&rsaquo;]
œ ž Ÿ &oelig; &#x017e; &Yuml;

This page attempts to give some idea of where to use &ndash; and &mdash; — there are other useful characters to know to improve your typography, if you don't need to support NN4:–

You should continue to use ‘-’ (Unicode hyphen-minus) rather than the Unicode for hyphen, &#8208; (‐) or for minus, &#8722; (−) as they aren't well supported even in modern browsers. Also even when they work, as in Firefox, for my chosen fonts, the hyphen has too much space at either side to really be a proper ligature, and − (minus) is too big for my comfort, being larger than a – (ndash). Similarly ' is Unicode apostrophe and should be used as such rather than either ‘ or ′.