IANA encoding | Java Canonical Name | Language | Comment |
UTF-8 | UTF8 | 8bit Universal character set | |
UTF-16 | UTF-16 | 16bit Universal character set | |
US-ASCII | ASCII | American Standard Code for Information Interchange | |
windows-1250 | Cp1250 | Eastern European (Albanian, Croatian, Czech, English, German, Hungarian, Latin, Polish, Romanian, Slovak, Slovenian, Serbian) | Windows encoding |
windows-1251 | Cp1251 | Eastern European (Cyrillic-based: Bulgarian, Byelorussian, Macedonian, Russian, Serbian, Ukrainian | Windows encoding |
windows-1252 | Cp1252 | Western European (Albanian, Basque, Breton, Catalan, Danish, Dutch, English, Faeroese, Finnish, French, German, Greenlandic, Icelandic, Irish Gaelic, Italian, Latin, Luxemburgish, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish) | Windows encoding |
windows-1253 | Cp1253 | Greek | Windows encoding |
windows-1254 | Cp1254 | Turkish | Windows encoding |
windows-1255 | Cp1255 | Hebrew | Windows encoding |
windows-1256 | Cp1256 | Arabic | Windows encoding |
windows-1257 | Cp1257 | Baltic | Windows encoding |
windows-1258 | Cp1258 | Vietnamese | Windows encoding |
ISO-8859-1 | ISO8859_1 | Western European (Albanian, Basque, Breton, Catalan, Danish, Dutch, English, Faeroese, Finnish, French, German, Greenlandic, Icelandic, Irish Gaelic, Italian, Latin, Luxemburgish, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish) | Euro Symbol is not supported |
ISO-8859-2 | ISO8859_2 | Eastern European (Albanian, Croatian, Czech, English, German, Hungarian, Latin, Polish, Romanian, Slovak, Slovenian, Serbian) | Euro Symbol is not supported |
ISO-8859-3 | ISO8859_3 | Southeastern European (Afrikaans, Catalan, Dutch, English, Esperanto, German, Italian, Maltese, Spanish, Turkish) | |
ISO-8859-4 | ISO8859_4 | Northern European (Danish, English, Estonian, Finnish, German, Greenlandic, Latin, Latvian, Lithuanian, Norwegian, Sテ。mi, Slovenian, Swedish) | |
ISO-8859-5 | ISO8859_5 | Eastern European (Cyrillic-based: Bulgarian, Byelorussian, Macedonian, Russian, Serbian, Ukrainian) | |
ISO-8859-6 | ISO8859_6 | Arabic | |
ISO-8859-7 | ISO8859_7 | Greek | |
ISO-8859-8 | ISO8859_8 | Hebrew | |
ISO-8859-9 | ISO8859_9 | Western European (Albanian, Basque, Breton, Catalan, Cornish, Danish, Dutch, English, Finnish, French, Frisian, Galician, German, Greenlandic, Irish Gaelic, Italian, Latin, Luxemburgish, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish, Turkish) | |
ISO-8859-13 | ISO8859_13 | Baltic Rim (English, Estonian, Finnish, Latin, Latvian, Norwegian) | |
ISO-8859-15 | ISO8859_15 | Western European (Albanian, Basque, Breton, Catalan, Danish, Dutch, English, Faeroese, Finnish, French, German, Greenlandic, Icelandic, Irish Gaelic, Italian, Latin, Luxemburgish, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish) | ISO-8859-1 with Euro symbol support |
windows-31j | MS932 | Japanese | Windows encoding |
EUC-JP | EUC_JP | Japanese | EUC encoding used on Unix platform |
Shift_JIS | SJIS | Japanese | Shift JIS, does not support MS external characters |
ISO-2022-JP | ISO2022JP | Japanese | JIS X 0201, 0208, in ISO 2022 form, this is used for e-mail |
x-mswin-936 | MS936 | Simplified Chinese | Windows encoding, This is not registered in IANA. |
GB18030 | GB18030 | Simplified Chinese | PRC standard |
x-EUC-CN | EUC_CN | Simplified Chinese | GB2312, EUC encoding |
GBK | GBK | Simplified Chinese | |
x-windows-949 | MS949 | Korean | Windows encoding, this is not registered in IANA. |
EUC-KR | EUC_KR | Korean | KS C 5601, EUC encoding |
x-windows-950 | MS950 | Traditional Chinese | Windows encoding, this is not registered in IANA |
x-MS950-HKSCS | MS950_HKSCS | Traditional Chinese with Hong Kong extensions | Windows encoding, this is not registered in IANA |
x-EUC-TW | EUC_TW | Traditional Chinese | CNS11643 (Plane 1-3), EUC encoding, this is not registered in IANA |
Big5 | Big5 | Traditional Chinese | |
Big5-HKSCS | Big5_HKSCS | Traditional Chinese | Big5 with Hong Kong extensions |
TIS-620 | TIS620 | Thai |
Advertisement
384,409
pages
Character Encoding Recommendation for Languages
Advertisement