[Project X0213][Japanese version]

JIS X 0213 Code Mapping Tables

Mapping Tables between JIS X 0213 and Unicode

Below are mapping (conversion) tables between JIS X 0213 encodings and Unicode (ISO/IEC 10646 UCS). Feel free to copy. Note that these tables are made by volunteers and not authorized by any official organization for standardization even though they are intended to be identical to the standard mapping.

These tables are based on the tables for JIS X 0213:2000 available at "JIS-UCS 変換表" and modified according to JIS X 0213:2004. The table for Shift_JIS-2004 above is equivalent to the table available at "JIS X 0213:2004対応による変更 点".

If you find an error in these tables, please contact the author via "JIS X 0213 Mailing List" or the 相談所 in JIS X 0213 Wiki.

Mapping Tables between Single Byte Code and JIS X 0213

Brief Description of JIS X 0213 Encodings

JIS X 0213 defines two double-byte ISO2022-compliant coded character sets. Kanji Set 1 (or Plane 1) is a superset of JIS X 0208 containing kanji, hiragana, katakana, Latin, Greek, symbols, and so on. Kanji Set 2 (or Plane 2) is a newly designed coded character set containing kanji characters only. Several encoding schemes of these character sets are defined in JIS X 0213. Some of them are encoding methods using JIS X 0213 and ASCII (or JIS X 0201) together.

Shift_JIS-2004

Shift_JIS-2004 is the JIS X 0213 version of Shift_JIS encoding. It is a superset of Shift_JIS. In this encoding, JIS X 0213 double-byte code is transformed by a formula.

Shift_JISX0213 is the name of the previous version of this encoding. iconv uses the name Shift_JISX0213 for Shift_JIS-2004.

EUC-JIS-2004

EUC-JIS-2004 is the JIS X 0213 version of EUC-JP encoding. This encoding is compliant to ISO/IEC 2022 but no escape sequence is used. It is a superset of EUC-JP except for JIS X 0212 in EUC-JP (and JIS X 0212 in EUC-JP is rarely used).

0x8E (Single Shift 2) and 0x8F (Single Shift 3) are "single shift" codes; i.e., they have effect on just one following character. When representing a sequence of kanji of plane 2, 0x8F needs to appear before each single kanji.

EUC-JISX0213 is the name of the previous version of this encoding. iconv uses the name EUC-JISX0213 for EUC-JIS-2004.

ISO-2022-JP-2004

ISO-2022-JP-2004 is the JIS X 0213 version of ISO-2022-JP encoding. It is a 7-bit stateful encoding using escape sequences to switch coded character sets.

Escape sequences used in this encoding are following:

For compatibility reason, "1B 24 28 4F" (escape sequence for plane 1 of JIS X 0213:2000) can be used with a little restriction. Also, "1B 24 42" (escape sequence for JIS X 0208:1983) can be used with some restriction.

ISO-2022-JP-3 is the name of the previous version of this encoding.

JIS X 0213 7-bit code

JIS X 0213 7-bit code is a pure double-byte code; i.e., no single-byte code like ASCII is used together. It is a superset of 7-bit code of JIS X 0208.

Plane 1 and 2 of JIS X 0213 are switched by Shift Out (SO) and Shift In (SI) control characters. The initial state is plane 1. SO switches to plane 2. When SI appears after that, the state is back to plane 1. That is, this encoding is stateful.

JIS X 0213 8-bit code

JIS X 0213 8-bit code is a pure double-byte code; i.e., no single-byte code like ASCII is used together. It is a superset of 7-bit code of JIS X 0208.

Related Document


Copyright © 2006-2013 Project X0213