- 浏览: 167422 次
- 性别:
- 来自: 广州
文章分类
最新评论
-
兰斯洛特1987:
顶!!!!谢谢分享.最近我也在研究这玩意...
Java语言的Hook实现 -
pu02203:
我把Confidant.jar, 丢进去eclipse, 里面 ...
重磅推出诛仙辅助软件第二波:Java版按键精灵 -
泣血端午:
Calculagraph 这是哪个类啊?
A星算法 -
haitaohehe:
我使用的是jstl1.0 可是在使用<c:set va ...
JSTL1.0和JSTL1.1的区别 -
micheal19840929:
学习楼主,我也测试一下~看看兼容性吧。lanlanzhilia ...
手机版飞鸽传书:无线牵
The classes java.io.InputStreamReader, java.io.OutputStreamWriter, java.lang.String, and classes in the java.nio.charset package can convert between Unicode and a number of other character encodings. The supported encodings vary between different implementations of the Java 2 platform. The java.lang package specification and the class description for java.nio.charset.Charset list the encodings that any implementation of the Java 2 platform, Standard Edition, v. 1.4.2 is required to support.
Sun's Java 2 Software Development Kit, Standard Edition, v. 1.4.2 for all platforms (SolarisTM operating environment, Linux, and Microsoft Windows) and the Java 2 Runtime Environment, Standard Edition, v. 1.4.2 for Solaris and Linux support all encodings shown on this page. Sun's Java 2 Runtime Environment, Standard Edition, v. 1.4.2 for Windows may be installed as a complete international version or as a European languages version. The J2RE installer by default installs a European languages version if it recognizes that the host operating system only supports European languages. If the installer recognizes that any other language is needed, or if the user requests support for non-European languages in a customized installation, a complete international version is installed. The European languages version only supports the encodings shown in the first table. The international version (which includes the lib/charsets.jar file) supports all encodings shown on this page.
The following tables show the encoding sets supported by J2SE 1.4.2. The canonical names used by the new java.nio APIs are in many cases not the same as those used in the java.io and java.lang APIs.
Basic Encoding Set (contained in lib/rt.jar)
Supported by java.nio, java.io and java.lang APIs
US-ASCII |
ASCII |
American Standard Code for Information Interchange |
windows-1250 |
Cp1250 |
Windows Eastern European |
windows-1251 |
Cp1251 |
Windows Cyrillic |
windows-1252 |
Cp1252 |
Windows Latin-1 |
windows-1253 |
Cp1253 |
Windows Greek |
windows-1254 |
Cp1254 |
Windows Turkish |
windows-1257 |
Cp1257 |
Windows Baltic |
ISO-8859-1 |
ISO8859_1 |
ISO 8859-1, Latin Alphabet No. 1 |
ISO-8859-2 |
ISO8859_2 |
Latin Alphabet No. 2 |
ISO-8859-4 |
ISO8859_4 |
Latin Alphabet No. 4 |
ISO-8859-5 |
ISO8859_5 |
Latin/Cyrillic Alphabet |
ISO-8859-7 |
ISO8859_7 |
Latin/Greek Alphabet |
ISO-8859-9 |
ISO8859_9 |
Latin Alphabet No. 5 |
ISO-8859-13 |
ISO8859_13 |
Latin Alphabet No. 7 |
ISO-8859-15 |
ISO8859_15 |
Latin Alphabet No. 9 |
KOI8-R |
KOI8_R |
KOI8-R, Russian |
UTF-8 |
UTF8 |
Eight-bit UCS Transformation Format |
UTF-16 |
UTF-16 |
Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark |
UTF-16BE |
UnicodeBigUnmarked |
Sixteen-bit Unicode Transformation Format, big-endian byte order |
UTF-16LE |
UnicodeLittleUnmarked |
Sixteen-bit Unicode Transformation Format, little-endian byte order |
Not available |
UnicodeBig |
Sixteen-bit Unicode Transformation Format, big-endian byte order, with byte-order mark |
Not available |
UnicodeLittle |
Sixteen-bit Unicode Transformation Format, little-endian byte order, with byte-order mark |
Extended Encoding Set (contained in lib/charsets.jar)
Supported by java.nio, java.io and java.lang APIs
windows-1255 |
Cp1255 |
Windows Hebrew |
windows-1256 |
Cp1256 |
Windows Arabic |
windows-1258 |
Cp1258 |
Windows Vietnamese |
ISO-8859-3 |
ISO8859_3 |
Latin Alphabet No. 3 |
ISO-8859-6 |
ISO8859_6 |
Latin/Arabic Alphabet |
ISO-8859-8 |
ISO8859_8 |
Latin/Hebrew Alphabet |
windows-31j |
MS932 |
Windows Japanese |
EUC-JP |
EUC_JP |
JISX 0201, 0208 and 0212, EUC encoding Japanese |
x-EUC-JP-LINUX |
EUC_JP_LINUX |
JISX 0201, 0208 , EUC encoding Japanese |
Shift_JIS |
SJIS |
Shift-JIS, Japanese |
ISO-2022-JP |
ISO2022JP |
JIS X 0201, 0208, in ISO 2022 form, Japanese |
x-mswin-936 |
MS936 |
Windows Simplified Chinese |
GB18030 |
GB18030 |
Simplified Chinese, PRC standard |
x-EUC-CN |
EUC_CN |
GB2312, EUC encoding, Simplified Chinese |
GBK |
GBK |
GBK, Simplified Chinese |
ISCII91 |
ISCII91 |
ISCII91 encoding of Indic scripts |
x-windows-949 |
MS949 |
Windows Korean |
EUC-KR |
EUC_KR |
KS C 5601, EUC encoding, Korean |
ISO-2022-KR |
ISO2022KR |
ISO 2022 KR, Korean |
x-windows-950 |
MS950 |
Windows Traditional Chinese |
x-MS950-HKSCS |
MS950_HKSCS |
Windows Traditional Chinese with Hong Kong extensions |
x-EUC-TW |
EUC_TW |
CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese |
Big5 |
Big5 |
Big5, Traditional Chinese |
Big5-HKSCS |
Big5_HKSCS |
Big5 with Hong Kong extensions, Traditional Chinese |
TIS-620 |
TIS620 |
TIS620, Thai |
Extended Encoding Set (contained in lib/charsets.jar)
Supported by java.io and java.lang APIs
Big5_Solaris |
Big5 with seven additional Hanzi ideograph character mappings for the Solaris zh_TW.BIG5 locale |
Cp037 |
USA, Canada (Bilingual, French), Netherlands, Portugal, Brazil, Australia |
Cp273 |
IBM Austria, Germany |
Cp277 |
IBM Denmark, Norway |
Cp278 |
IBM Finland, Sweden |
Cp280 |
IBM Italy |
Cp284 |
IBM Catalan/Spain, Spanish Latin America |
Cp285 |
IBM United Kingdom, Ireland |
Cp297 |
IBM France |
Cp420 |
IBM Arabic |
Cp424 |
IBM Hebrew |
Cp437 |
MS-DOS United States, Australia, New Zealand, South Africa |
Cp500 |
EBCDIC 500V1 |
Cp737 |
PC Greek |
Cp775 |
PC Baltic |
Cp838 |
IBM Thailand extended SBCS |
Cp850 |
MS-DOS Latin-1 |
Cp852 |
MS-DOS Latin-2 |
Cp855 |
IBM Cyrillic |
Cp856 |
IBM Hebrew |
Cp857 |
IBM Turkish |
Cp858 |
Variant of Cp850 with Euro character |
Cp860 |
MS-DOS Portuguese |
Cp861 |
MS-DOS Icelandic |
Cp862 |
PC Hebrew |
Cp863 |
MS-DOS Canadian French |
Cp864 |
PC Arabic |
Cp865 |
MS-DOS Nordic |
Cp866 |
MS-DOS Russian |
Cp868 |
MS-DOS Pakistan |
Cp869 |
IBM Modern Greek |
Cp870 |
IBM Multilingual Latin-2 |
Cp871 |
IBM Iceland |
Cp874 |
IBM Thai |
Cp875 |
IBM Greek |
Cp918 |
IBM Pakistan (Urdu) |
Cp921 |
IBM Latvia, Lithuania (AIX, DOS) |
Cp922 |
IBM Estonia (AIX, DOS) |
Cp930 |
Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5026 |
Cp933 |
Korean Mixed with 1880 UDC, superset of 5029 |
Cp935 |
Simplified Chinese Host mixed with 1880 UDC, superset of 5031 |
Cp937 |
Traditional Chinese Host miexed with 6204 UDC, superset of 5033 |
Cp939 |
Japanese Latin Kanji mixed with 4370 UDC, superset of 5035 |
Cp942 |
IBM OS/2 Japanese, superset of Cp932 |
Cp942C |
Variant of Cp942 |
Cp943 |
IBM OS/2 Japanese, superset of Cp932 and Shift-JIS |
Cp943C |
Variant of Cp943 |
Cp948 |
OS/2 Chinese (Taiwan) superset of 938 |
Cp949 |
PC Korean |
Cp949C |
Variant of Cp949 |
Cp950 |
PC Chinese (Hong Kong, Taiwan) |
Cp964 |
AIX Chinese (Taiwan) |
Cp970 |
AIX Korean |
Cp1006 |
IBM AIX Pakistan (Urdu) |
Cp1025 |
IBM Multilingual Cyrillic: Bulgaria, Bosnia, Herzegovinia, Macedonia (FYR) |
Cp1026 |
IBM Latin-5, Turkey |
Cp1046 |
IBM Arabic - Windows |
Cp1047 |
Latin-1 character set for EBCDIC hosts |
Cp1097 |
IBM Iran (Farsi)/Persian |
Cp1098 |
IBM Iran (Farsi)/Persian (PC) |
Cp1112 |
IBM Latvia, Lithuania |
Cp1122 |
IBM Estonia |
Cp1123 |
IBM Ukraine |
Cp1124 |
IBM AIX Ukraine |
Cp1140 |
Variant of Cp037 with Euro character |
Cp1141 |
Variant of Cp273 with Euro character |
Cp1142 |
Variant of Cp277 with Euro character |
Cp1143 |
Variant of Cp278 with Euro character |
Cp1144 |
Variant of Cp280 with Euro character |
Cp1145 |
Variant of Cp284 with Euro character |
Cp1146 |
Variant of Cp285 with Euro character |
Cp1147 |
Variant of Cp297 with Euro character |
Cp1148 |
Variant of Cp500 with Euro character |
Cp1149 |
Variant of Cp871 with Euro character |
Cp1381 |
IBM OS/2, DOS People's Republic of China (PRC) |
Cp1383 |
IBM AIX People's Republic of China (PRC) |
Cp33722 |
IBM-eucJP - Japanese (superset of 5050) |
ISO2022_CN_CNS |
CNS11643 in ISO 2022 CN form, Traditional Chinese (conversion from Unicode only) |
ISO2022_CN_GB |
GB2312 in ISO 2022 CN form, Simplified Chinese (conversion from Unicode only) |
JISAutoDetect |
Detects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only) |
MS874 |
Windows Thai |
MacArabic |
Macintosh Arabic |
MacCentralEurope |
Macintosh Latin-2 |
MacCroatian |
Macintosh Croatian |
MacCyrillic |
Macintosh Cyrillic |
MacDingbat |
Macintosh Dingbat |
MacGreek |
Macintosh Greek |
MacHebrew |
Macintosh Hebrew |
MacIceland |
Macintosh Iceland |
MacRoman |
Macintosh Roman |
MacRomania |
Macintosh Romania |
MacSymbol |
Macintosh Symbol |
MacThai |
Macintosh Thai |
MacTurkish |
Macintosh Turkish |
MacUkraine |
Macintosh Ukraine |
转自:http://java.sun.com/j2se/1.4.2/docs/guide/intl/encoding.doc.html
发表评论
-
快速开方算法比较
2010-08-18 17:34 1390快速开方算法比较: public class Test { ... -
Java父线程与子线程
2010-06-12 12:49 2761问:在父线程中New了一个子线程,想在停止父线程时也停止子线程 ... -
反射在设计模式中的应用
2010-06-08 10:52 738利用设计模式可以使我 ... -
Reader和InputStream的区别
2010-05-28 13:21 1281Reader支持16位的Unicode字符输出,InputSt ... -
[原创]多线程之队列式执行任务
2010-04-29 10:24 967前言:最近做的一个项目正好要用到多线程,而且要实现 ... -
java内存的思考与总结
2010-02-10 16:13 01. java中堆与栈 在Java程 ... -
关于switch语句
2010-02-10 16:10 814switch语句在一定程度上是有动态跳转(函数指针就是一种动态 ... -
代码优化之-优化除法
2010-02-09 13:57 2517说明:文章中的很多数据可能在新的CPU或不同的CPU或不同的系 ... -
应用JAD批量反编译CLASS文件
2010-02-09 12:31 819反编译别人的程序是不对的,但是有时候由于某种原因,不得不反编译 ... -
数组快速复制
2010-02-08 15:33 708data[0] = 0 ; int len = 1 ; i ... -
深刻理解Java编程的7个例子
2010-01-14 10:13 6441. 阅读下列代码回答问 ... -
原子变量(AtomicLong, AtomicInteger, AtomicReference)
2009-11-06 09:30 3323J2SE 5.0提供了一组atomic class来帮助我们简 ... -
重磅推出诛仙辅助软件第二波:Java版按键精灵
2009-09-16 18:43 3996前言:上次推出诛仙答题辅助软件用起来不错(想上十名榜不难,YY ... -
Java语言的Hook实现
2009-08-25 17:39 11384引言:最近在玩完美时 ...
相关推荐
6 Java的i18n问题 7 Java国际化——资源包的使用 8 国际化 9 Supported Encodings 10 native2ascii - Native-to-ASCII Converter 11 Jive 3.x UTF-8中文附件问题 12 JF3.1.3 JKB...
4 Jive的中文问题及解决办法 5 Java程序的国际化和本地化介绍 6 Java的i18n问题 7 Java国际化——资源包的使用 8 国际化 9 Supported Encodings 10 native2ascii - Native-to-...
Supported VNC encodings: raw, copyrect, rre, hextile, tight, tightPNG WebSocket SSL/TLS encryption (i.e. "wss://") support 24-bit true color and 8 bit colour mapped Supports desktop resize ...
lxml是python的一个解析库,支持HTML和XML的解析,支持XPath解析方式,下面这篇文章主要给大家介绍了关于python3解析库lxml的安装与使用的相关资料,文中通过示例代码介绍的非常详细,需要的朋友可以参考下
■Asymmetric key paddings/encodings: ISO9796d1, OAEP, and PKCS#1. ■Digests: GOST3411, MD2, MD4, MD5, RIPEMD128, RIPEMD160, RIPEMD256, RIPEMD320, SHA-1, SHA-224, SHA-256, SHA-384, SHA-512, Tiger, and...
set fileencodings=utf-8,gb2312,gbk set encoding=gbk ``` 这些设置项确保了VIM在打开、保存文件时使用GBK编码,同时也能正确地处理UTF-8编码的文件。 此外,在Ubuntu的终端使用过程中,可能会遇到使用cat命令查看...
* If users input a character that is not supported by current encoding, this character will be converted to U+XXXX. Unicode format. * Regular Expression search & replace (by using Boost.Xpressive). * ...
Variants supported 11. Further information 12. Version history _______________________________________________________________________________ 1. ARM assembly compatibility There are a few ...
Operating Systems No Longer Supported API and Feature Removals Code Cleanups Porting to Python 3.4 Changes in ‘python’ Command Behavior Changes in the Python API Changes in the C API Changed ...
sudogedit /var/lib/locales/supported.d/local ``` **步骤二:** 在文件中加入以下内容: ``` zh_CN.GBK GBK zh_CN.GB2312 GB2312 zh_CN.GB18030 GB18030 ``` **步骤三:** 运行命令更新本地化设置: ``` sudo...
- **Forwarding and Multiple Inheritance:** In Objective-C, multiple inheritance is not directly supported. However, the document discusses how message forwarding can be used in scenarios involving ...
**步骤1:** 编辑 `/var/lib/locales/supported.d/local` 文件,并在其中添加以下内容: ``` zh_CN.UTF8 UTF-8 zh_CN.GBK GBK zh_CN.GB2312 GB2312 ``` **步骤2:** 通过运行 `sudo dpkg-reconfigure --force ...
WARNING: AnonTLS is not supported on Android 6, so to use Vino, the built-in VNC server for GNOME you have to disable Vino encryption requirement with "gsettings set org.gnome.Vino require-encryption ...
Lua strings are fully 8-bit clean, so simple uses are supported (like storing and retrieving), but there's no built in support for more sophisticated uses. For a fuller story, see below. Unicode ...
¹ 在的意义上设置添加到您的Gemfile : gem 'characteristics'用法 # All supported encodingschar_info = Characteristics . create ( character )char_info . valid? # => true / falsechar_info . unicode? # =...
- **配置语言环境**:编辑`/var/lib/locales/supported.d/zh`文件,例如: ```bash zh_CN.GB18030 GB18030 zh_CN.GBK GBK zh_CN.GB2312 GB2312 zh_CN.GB18031 GB18031 zh_HK.BIG5 BIG5 zh_TW.BIG5 BIG5 ``` ...
打开终端,使用`sudo vim /var/lib/locales/supported.d/local`编辑文件,添加如下内容: ``` zh_CN.GBK GBK zh_CN.GB2312 GB2312 zh_CN.GB18030 GB18030 ``` 然后执行`sudo dpkg-reconfigure locales`使改动...
When Netatalk is being switched to UTF-8 which is supported in version 2 then it is NOT sufficient to rename the file names. There needs to be done more. See ...
documents, however this remains supported for backward compatibility reasons. ; Note that this directive does not control the , which can be ; used regardless of this directive. ; Default Value: On ;...