sandbox
Windows-1252 sandbox (decode)
Convert a Windows-1252 (CP1252) byte into a Unicode code point.
Hex (with or without 0x). Windows-1252 encodes one character per byte.
U+20AC€
1 byteDecimal
8364
Byte
0x80
Binary
10000000
CP1252 range
Range 0x80 → 0x9F - the Microsoft extension. This is where Windows-1252 diverges from Latin-1: 27 printable characters (€ at 0x80, ™ at 0x99, em-dash — at 0x97, smart quotes, ...) instead of C1 controls.
Step-by-step breakdown
- 01
Read the value in binary
Write the byte value as 8 bits.
10000000 - 02
Convert to a code point
The byte is in the Microsoft extension. The CP1252 table maps it to
U+20AC(decimal8364). Classic trap: if you read this same byte as Latin-1, you get an invisible C1 control instead of a printable character.U+20AC