Asian Character Sets
The Asian character sets that we support include Chinese, Japanese, Korean, and Thai. These can be complicated. For example, the Chinese sets must allow for thousands of different characters. See , "The cp932 Character Set", for additional information about the cp932 and sjis character sets.
For answers to some common questions and problems relating support for Asian character sets in MySQL, see "MySQL 5.6 FAQ: MariaDB Chinese, Japanese, and Korean Character Sets".
big5(Big5 Traditional Chinese) collations:big5_binbig5_chinese_ci(default)
cp932(SJIS for Windows Japanese) collations:cp932_bincp932_japanese_ci(default)
eucjpms(UJIS for Windows Japanese) collations:eucjpms_bineucjpms_japanese_ci(default)
euckr(EUC-KR Korean) collations:euckr_bineuckr_korean_ci(default)
gb2312(GB2312 Simplified Chinese) collations:gb2312_bingb2312_chinese_ci(default)
gbk(GBK Simplified Chinese) collations:gbk_bingbk_chinese_ci(default)
sjis(Shift-JIS Japanese) collations:sjis_binsjis_japanese_ci(default)
tis620(TIS620 Thai) collations:tis620_bintis620_thai_ci(default)
ujis(EUC-JP Japanese) collations:ujis_binujis_japanese_ci(default)
The big5_chinese_ci collation sorts on number of strokes.
For additional information about Asian collations in MySQL, see Collation-Charts.Org (big5, cp932, eucjpms, euckr, gb2312, gbk, sjis, tis620, ujis).