charset.school
Decode Windows-1252

sandbox

Windows-1252 sandbox (decode)

Convert a Windows-1252 (CP1252) byte into a Unicode code point.

Hex (with or without 0x). Windows-1252 encodes one character per byte.

U+20AC
1 byte

Decimal

8364

Byte

0x80

Binary

10000000

CP1252 range

Range 0x800x9F - the Microsoft extension. This is where Windows-1252 diverges from Latin-1: 27 printable characters ( at 0x80, at 0x99, em-dash at 0x97, smart quotes, ...) instead of C1 controls.

Step-by-step breakdown

  1. 01

    Read the value in binary

    Write the byte value as 8 bits.

    10000000
  2. 02

    Convert to a code point

    The byte is in the Microsoft extension. The CP1252 table maps it to U+20AC (decimal 8364). Classic trap: if you read this same byte as Latin-1, you get an invisible C1 control instead of a printable character.

    U+20AC
charset.school

Teaching tool. No tracking, no ads.

Developed by Florent Sorel