Windows Code Page IDs


This is a list of the code page IDs used in Microsoft Windows.

I didn't find a complete reference in Microsoft documentation or anywhere on the net. So I decided to put this information together myself and provide it here. It may be helpful for someone. If somebody knows of a better reference, I will drop this.

The list is rather raw, incomplete (I guess) and far away from being perfect. Please send additions, corrections and suggestions to me.


CP ID Char Set Name (or something...) IANA Char Set ID Src. Remarks
037 EBCDIC IBM037 1
290 IBM Extended English Katakana IBM290 6 EBCDIC?, SNA Server TrnsDT, =20290
300 IBM Japanese Character Sets (?) ? 6 EBCDIC?, SNA Server TrnsDT
437 MS-DOS United States IBM437 1
500 EBCDIC "500V1" IBM500 1
708 Arabic (ASMO 708) ISO-8859-6 1
709 Arabic (ASMO 449+, BCON V4) ASMO_449 1
710 MS-DOS Arabic (Transparent Arabic) ? 1
711 Arabic (Nafitha Enhanced) ? 2
720 Arabic (Transparent ASMO) ? 1
737 MS-DOS Greek (formerly 437G) ? 1
775 MS-DOS Baltic IBM775 1
833 IBM Hangul Extended Single-Byte (?) ? 6 EBCDIC?, SNA Server TrnsDT
834 Korean Host Double-byte ? 6 EBCDIC?, SNA Server TrnsDT
835 IBM Traditional Chinese Character Sets (?) ? 6 EBCDIC?, SNA Server TrnsDT
836 Simplified Chinese Single-Byte (?) ? 6 EBCDIC?, SNA Server TrnsDT
837 IBM Simplified Chinese Character Sets (?) ? 6 EBCDIC?, SNA Server TrnsDT
850 MS-DOS Multilingual (Latin I) IBM850 1
852 MS-DOS Slavic (Latin II) IBM852 1
855 IBM Cyrillic (primarily Russian) IBM855 1 MS-DOS
857 IBM Turkish IBM857 1 MS-DOS
860 MS-DOS Portuguese IBM860 1
861 MS-DOS Icelandic IBM861 1
862 MS-DOS Hebrew IBM862 1
863 MS-DOS Canadian-French IBM863 1
864 MS-DOS Arabic IBM864 1
865 MS-DOS Nordic IBM865 1
866 MS-DOS Russian, Cyrillic IBM866 1
869 IBM Modern Greek IBM869 1 MS-DOS
870 IBM EBCDIC Latin-2 IBM870 ?
874 Thai ? 1 Windows/MS-DOS
875 IBM Greek EBCDIC IBM423 (?) 1
932 Japanese ? 1 MS-DOS, Shift-JIS(?)
936 Chinese (China, Singapore) GB_2312-80 1 MS-DOS, (Simplified Chinese)
949 Korean (Wansung) KS_C_5601-1987 1 MS-DOS, (Unified Hangul)
950 Chinese (Taiwan) Big5 1 MS-DOS, (Traditional Chinese)
1026 IBM Turkish EBCDIC IBM1026 3
1027 IBM Extended Lowercase English ? 6 EBCDIC?, SNA Server TrnsDT
1200 Universal Alphabet (Unicode) (ISO-10646-UCS-2) 1 Little Endian
1201 Universal Alphabet (Unicode) (ISO-10646-UCS-2) 1 Big Endian
1250 Windows Central European windows-1250 1
1251 Windows Cyrillic windows-1251 1
1252 Windows Western European/US windows-1252 1
1253 Windows Greek windows-1253 1
1254 Windows Turkish windows-1254 1
1255 Windows Hebrew windows-1255 1
1256 Windows Arabic windows-1256 1
1257 Windows Baltic windows-1257 1
1258 Windows Vietnamese windows-1258 1 +MS-DOS
1361 Windows Korean KS_C_5601-1992 (?) 1
10000 Macintosh Roman ? 1
10001 Macintosh Japanese ? 1
10002 Macintosh Chinese ? ?
10003 Macintosh Korean ? ?
10004 Macintosh Arabic ? ?
10005 Macintosh Hebrew ? ?
10006 Macintosh Greek 1 ? 1
10007 Macintosh Cyrillic ? 1
10008 Macintosh ? ? ?
10010 Macintosh Romanian ? ?
10017 Macintosh Ukrainian ? ?
10029 Macintosh Latin 2 ? 1
10079 Macintosh Icelandic ? 1
10081 Macintosh Turkish ? 1
10082 Macintosh Croatian ? ?
20105 IA5 IRV DIN_66003 4
20106 IA6 (German) DIN_66003 4
20107 IA6 (Swedish) SEN_850200_B 4
20108 IA6 (Norwegian) NS_4551-1 4
20261 T.61 T.61-8bit 5 Teletext
20269 ISO-6937 ? ?
20273 IBM EBCDIC Germany IBM273 (?) 6
20277 IBM EBCDIC Denmark/Norway IBM277 (?) 6
20278 IBM EBCDIC Finland/Sweden IBM278 (?) 6
20280 IBM EBCDIC Italy IBM280 (?) 6
20284 IBM EBCDIC Latin America/Spain IBM284 (?) 6
20285 IBM EBCDIC United Kingdom IBM285 (?) 6
20290 IBM EBCDIC Japanese IBM290 ? Kana, =290
20297 IBM EBCDIC France IBM297 6
20420 IBM EBCDIC Arabic IBM420 6
20423 IBM EBCDIC Greek IBM423 6
20833 Korean (IBM EBCDIC?) ? ?
20838 IBM EBCDIC Thai IBM-Thai 6
20866 Russian - KOI8-R KOI8-R 4
20871 IBM EBCDIC Icelandic IBM871 6
20880 IBM EBCDIC Cyrillic IBM880 6
20905 IBM EBCDIC Turkish IBM905 6
21025 IBM EBCDIC Cyrillic ? 6
21027 Japanese (?) ? ?
21866 Ukranian - KOI8-RU KOI8-U ? RFC 2319
28591 ISO 8859-1 Western ISO_8859-1:1987 4
28592 ISO 8859-2 Eastern Europe ISO_8859-2:1987 4
28593 ISO 8859-3 Turkish ISO_8859-3:1988 4
28594 ISO 8859-4 Baltic ISO_8859-4:1988 4
28595 ISO 8859-5 Cyrillic ISO_8859-5:1988 4
28596 ISO 8859-6 Arabic ISO_8859-6:1987 4
28597 ISO 8859-7 Greek ISO_8859-7:1987 4
28598 ISO 8859-8 Hebrew ISO_8859-8:1988 4
28599 ISO 8859-9 ISO_8859-9:1989 4
29001 ? ? ?
50000 User Defined (x-user-defined) 4 (*50000)
50220 Japanese (JIS) ISO-2022-JP (?) 4
50221 Japanese (JIS) ISO-2022-JP (?) 4 Allow 1 byte kana
50222 Japanese (JIS) ISO-2022-JP (?) 4 Allow 1 byte kana
50225 Korean ISO-2022-KR 4
50932 Japanese (autodetect) ? 4
50949 Korean (autodetect) ? 4
51932 Japanese (EUC) EUC-JP 4
51949 Korean (EUC) EUC-KR 4
52936 Simplified Chinese HZ-GB-2312 4
65000 Unicode UTF-7 UTF-7 4 (*65000)
65001 Unicode UTF-8 UTF-8 4 (*65001)

* Additional Remarks
50000 50000 seems to be a pseudo code page. It appears in the Windows registry under HKEY_CLASSES_ROOT\MIME\Database\Codepage and HKEY_CURRENT_USER\Software\Microsoft\Internet Explorer\International. A corresponding NLS file does not exist.
65000
65001
65000 (UTF-7) and 65001 (UTF-8) are pseudo codepages. There are no corresponding NLS files. The code page IDs can only be used with WideCharToMultiByte( ) and MultiByteToWideChar( ) API calls.

# Source
1 Windows SDK Documentation, Code Page Identifiers
2 Microsoft RTF Specification
3 Nadine Kano: Developing international software for Windows 95 and Windows NT.
4 Windows SDK Documentation, Component Library
5 Microsoft KBase Article Q153233
6 Microsoft SNA Server NLS Reference


©2000 by Torsten Mohrin. Updated 2000-05-09. [Email] [Home] [About]