Available Character Encodings

DeltaWalker supports the same wide-range of character encodings supported by the Java 2 Platform, Standard Edition, v. 1.6 or newer. A non-exhaustive list of encodings follows.

The first column in both tables contains the names of the encodings, a.k.a. canonical names, that DeltaWalker uses. When an encoding has an alternative name, it is listed in the second column of the tables. Although the alternative names are being gradually phased out, they are listed here as many people may be familiar with them. The third column gives a brief description of each

Basic Character Encodings

Name

Alternative Name

Description

US-ASCII

ASCII

American Standard Code for Information Interchange

windows-1250

Cp1250

Windows Eastern European

windows-1251

Cp1251

Windows Cyrillic

windows-1252

Cp1252

Windows Latin-1

windows-1253

Cp1253

Windows Greek

windows-1254

Cp1254

Windows Turkish

windows-1257

Cp1257

Windows Baltic

ISO-8859-1

ISO8859_1

ISO 8859-1, Latin Alphabet No. 1

ISO-8859-2

ISO8859_2

Latin Alphabet No. 2

ISO-8859-4

ISO8859_4

Latin Alphabet No. 4

ISO-8859-5

ISO8859_5

Latin/Cyrillic Alphabet

ISO-8859-7

ISO8859_7

Latin/Greek Alphabet

ISO-8859-9

ISO8859_9

Latin Alphabet No. 5

ISO-8859-13

ISO8859_13

Latin Alphabet No. 7

ISO-8859-15

ISO8859_15

Latin Alphabet No. 9

KOI8-R

KOI8_R

KOI8-R, Russian

UTF-8

UTF8

Eight-bit UCS Transformation Format

UTF-16

 

Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark

UTF-16BE

UnicodeBigUnmarked

Sixteen-bit Unicode Transformation Format, big-endian byte order

UTF-16LE

UnicodeLittleUnmarked

Sixteen-bit Unicode Transformation Format, little-endian byte order

 

Extended Encodings

Name

Alternative Name

Description

windows-1255

Cp1255

Windows Hebrew

windows-1256

Cp1256

Windows Arabic

windows-1258

Cp1258

Windows Vietnamese

ISO-8859-3

ISO8859_3

Latin Alphabet No. 3

ISO-8859-6

ISO8859_6

Latin/Arabic Alphabet

ISO-8859-8

ISO8859_8

Latin/Hebrew Alphabet

windows-31j

MS932

Windows Japanese

EUC-JP

EUC_JP

JISX 0201, 0208 and 0212, EUC encoding Japanese

x-EUC-JP-LINUX

EUC_JP_LINUX

JISX 0201, 0208 , EUC encoding Japanese

Shift_JIS

SJIS

Shift-JIS, Japanese

ISO-2022-JP

ISO2022JP

JIS X 0201, 0208, in ISO 2022 form, Japanese

x-mswin-936

MS936

Windows Simplified Chinese

GB18030

 

Simplified Chinese, PRC standard

x-EUC-CN

EUC_CN

GB2312, EUC encoding, Simplified Chinese

GBK

 

GBK, Simplified Chinese

ISCII91

ISCII91

ISCII91 encoding of Indic scripts

x-windows-949

MS949

Windows Korean

EUC-KR

EUC_KR

KS C 5601, EUC encoding, Korean

ISO-2022-KR

ISO2022KR

ISO 2022 KR, Korean

x-windows-950

MS950

Windows Traditional Chinese

x-MS950-HKSCS

MS950_HKSCS

Windows Traditional Chinese with Hong Kong extensions

x-EUC-TW

EUC_TW

CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese

Big5

 

Big5, Traditional Chinese

Big5-HKSCS

Big5_HKSCS

Big5 with Hong Kong extensions, Traditional Chinese

TIS-620

TIS620

TIS620, Thai

See Also