`

(转)Character Sets

阅读更多

http://a4esl.org/c/charset.html

Character Sets
charset=

Some Basic Ones

charset=big5
Chinese Traditional (Big5)
charset=euc-kr
Korean (EUC)
charset=iso-8859-1
Western Alphabet
charset=iso-8859-2
Central European Alphabet (ISO)
charset=iso-8859-3
Latin 3 Alphabet (ISO)
charset=iso-8859-4
Baltic Alphabet (ISO)
charset=iso-8859-5
Cyrillic Alphabet (ISO)
charset=iso-8859-6
Arabic Alphabet (ISO)
charset=iso-8859-7
Greek Alphabet (ISO)
charset=iso-8859-8
Hebrew Alphabet (ISO)
charset=koi8-r
Cyrillic Alphabet (KOI8-R)
charset=shift-jis
Japanese (Shift-JIS)
charset=x-euc
Japanese (EUC)
charset=utf-8
Universal Alphabet (UTF-8)
charset=windows-1250
Central European Alphabet (Windows)
charset=windows-1251
Cyrillic Alphabet (Windows)
charset=windows-1252
Western Alphabet (Windows)
charset=windows-1253
Greek Alphabet (Windows)
charset=windows-1254
Turkish Alphabet
charset=windows-1255
Hebrew Alphabet (Windows)
charset=windows-1256
Arabic Alphabet (Windows)
charset=windows-1257
Baltic Alphabet (Windows)
charset=windows-1258
Vietnamese Alphabet (Windows)
charset=windows-874
Thai (Windows)

A Longer List

Arabic (ASMO 708)
##charset=ASMO-708
Arabic (DOS)
##charset=DOS-720
Arabic (ISO)
##charset=iso-8859-6
Arabic (Mac)
##charset=x-mac-arabic
Arabic (Windows)
##charset=windows-1256
Baltic (DOS)
##charset=ibm775
Baltic (ISO)
##charset=iso-8859-4
Baltic (Windows)
##charset=windows-1257
Central European (DOS)
##charset=ibm852
Central European (ISO)
##charset=iso-8859-2
Central European (Mac)
##charset=x-mac-ce
Central European (Windows)
##charset=windows-1250
Chinese Simplified (EUC)
##charset=EUC-CN
Chinese Simplified (GB2312)
##charset=gb2312
Chinese Simplified (HZ)
##charset=hz-gb-2312
Chinese Simplified (Mac)
##charset=x-mac-chinesesimp
Chinese Traditional (Big5)
##charset=big5
Chinese Traditional (CNS)
##charset=x-Chinese-CNS
Chinese Traditional (Eten)
##charset=x-Chinese-Eten
Chinese Traditional (Mac)
##charset=x-mac-chinesetrad
##charset=950
Cyrillic (DOS)
##charset=cp866
Cyrillic (ISO)
##charset=iso-8859-5
Cyrillic (KOI8-R)
##charset=koi8-r
Cyrillic (KOI8-U)
##charset=koi8-u
Cyrillic (Mac)
##charset=x-mac-cyrillic
Cyrillic (Windows)
##charset=windows-1251
Europa
##charset=x-Europa
German (IA5)
##charset=x-IA5-German
Greek (DOS)
##charset=ibm737
Greek (ISO)
##charset=iso-8859-7
Greek (Mac)
##charset=x-mac-greek
Greek (Windows)
##charset=windows-1253
##charset= 
Greek, Modern (DOS)
##charset=ibm869
Hebrew (DOS)
##charset=DOS-862
Hebrew (ISO-Logical)
##charset=iso-8859-8-i
Hebrew (ISO-Visual)
##charset=iso-8859-8
Hebrew (Mac)
##charset=x-mac-hebrew
Hebrew (Windows)
##charset=windows-1255
IBM EBCDIC (Arabic)
##charset=x-EBCDIC-Arabic
IBM EBCDIC (Cyrillic Russian)
##charset=x-EBCDIC-CyrillicRussian
IBM EBCDIC (Cyrillic Serbian-Bulgarian)
##charset=x-EBCDIC-CyrillicSerbianBulgarian
IBM EBCDIC (Denmark-Norway)
##charset=x-EBCDIC-DenmarkNorway
IBM EBCDIC (Denmark-Norway-Euro)
##charset=x-ebcdic-denmarknorway-euro
IBM EBCDIC (Finland-Sweden)
##charset=x-EBCDIC-FinlandSweden
IBM EBCDIC (Finland-Sweden-Euro)
##charset=x-ebcdic-finlandsweden-euro
IBM EBCDIC (Finland-Sweden-Euro)
##charset=x-ebcdic-finlandsweden-euro
IBM EBCDIC (France-Euro)
##charset=x-ebcdic-france-euro
IBM EBCDIC (Germany)
##charset=x-EBCDIC-Germany
IBM EBCDIC (Germany-Euro)
##charset=x-ebcdic-germany-euro
IBM EBCDIC (Greek Modern)
##charset=x-EBCDIC-GreekModern
IBM EBCDIC (Greek)
##charset=x-EBCDIC-Greek
IBM EBCDIC (Hebrew)
##charset=x-EBCDIC-Hebrew
IBM EBCDIC (Icelandic)
##charset=x-EBCDIC-Icelandic
IBM EBCDIC (Icelandic-Euro)
##charset=x-ebcdic-icelandic-euro
IBM EBCDIC (International-Euro)
##charset=x-ebcdic-international-euro
IBM EBCDIC (Italy)
##charset=x-EBCDIC-Italy
IBM EBCDIC (Italy-Euro)
##charset=x-ebcdic-italy-euro
IBM EBCDIC (Japanese and Japanese Katakana)
##charset=x-EBCDIC-JapaneseAndKana
IBM EBCDIC (Japanese and Japanese-Latin)
##charset=x-EBCDIC-JapaneseAndJapaneseLatin
IBM EBCDIC (Japanese and US-Canada)
##charset=x-EBCDIC-JapaneseAndUSCanada
IBM EBCDIC (Japanese katakana)
##charset=x-EBCDIC-JapaneseKatakana
IBM EBCDIC (Korean and Korean Extended)
##charset=x-EBCDIC-KoreanAndKoreanExtended
IBM EBCDIC (Korean Extended)
##charset=x-EBCDIC-KoreanExtended
IBM EBCDIC (Multilingual Latin-2)
##charset=CP870
IBM EBCDIC (Simplified Chinese)
##charset=x-EBCDIC-SimplifiedChinese
IBM EBCDIC (Spain)
##charset=X-EBCDIC-Spain
IBM EBCDIC (Spain-Euro)
##charset=x-ebcdic-spain-euro
IBM EBCDIC (Thai)
##charset=x-EBCDIC-Thai
IBM EBCDIC (Traditional Chinese)
##charset=x-EBCDIC-TraditionalChinese
IBM EBCDIC (Turkish Latin-5)
##charset=CP1026
IBM EBCDIC (Turkish)
##charset=x-EBCDIC-Turkish
IBM EBCDIC (UK)
##charset=x-EBCDIC-UK
IBM EBCDIC (UK-Euro)
##charset=x-ebcdic-uk-euro
IBM EBCDIC (US-Canada)
##charset=ebcdic-cp-us
IBM EBCDIC (US-Canada-Euro)
##charset=x-ebcdic-cp-us-euro
Icelandic (DOS)
##charset=ibm861
Icelandic (Mac)
##charset=x-mac-icelandic
ISCII Assamese
##charset=x-iscii-as
ISCII Bengali
##charset=x-iscii-be
ISCII Devanagari
##charset=x-iscii-de
ISCII Gujarathi
##charset=x-iscii-gu
ISCII Kannada
##charset=x-iscii-ka
ISCII Malayalam
##charset=x-iscii-ma
ISCII Oriya
##charset=x-iscii-or
ISCII Panjabi
##charset=x-iscii-pa
ISCII Tamil
##charset=x-iscii-ta
ISCII Telugu
##charset=x-iscii-te
Japanese (EUC)
##charset=euc-jp
##charset=x-euc-jp
Japanese (JIS)
##charset=iso-2022-jp
Japanese (JIS-Allow 1 byte Kana - SO/SI)
##charset=iso-2022-jp
Japanese (JIS-Allow 1 byte Kana)
##charset=csISO2022JP
Japanese (Mac)
##charset=x-mac-japanese
Japanese (Shift-JIS)
##charset=shift_jis
Korean
##charset=ks_c_5601-1987
Korean (EUC)
##charset=euc-kr
Korean (ISO)
##charset=iso-2022-kr
Korean (Johab)
##charset=Johab
Korean (Mac)
##charset=x-mac-korean
Latin 3 (ISO)
##charset=iso-8859-3
Latin 9 (ISO)
##charset=iso-8859-15
Norwegian (IA5)
##charset=x-IA5-Norwegian
OEM United States
##charset=IBM437
Swedish (IA5)
##charset=x-IA5-Swedish
Thai (Windows)
##charset=windows-874
Turkish (DOS)
##charset=ibm857
Turkish (ISO)
##charset=iso-8859-9
Turkish (Mac)
##charset=x-mac-turkish
Turkish (Windows)
##charset=windows-1254
Unicode
##charset=unicode
Unicode (Big-Endian)
##charset=unicodeFFFE
Unicode (UTF-7)
##charset=utf-7
Unicode (UTF-8)
##charset=utf-8
US-ASCII
##charset=us-ascii
Vietnamese (Windows)
##charset=windows-1258
Western European (DOS)
##charset=ibm850
Western European (IA5)
##charset=x-IA5
Western European (ISO)
##charset=iso-8859-1
Western European (Mac)
##charset=macintosh
Western European (Windows)
##charset=Windows-1252
分享到:
评论

相关推荐

    character-sets[1] character-sets[1] character-sets[1]

    ### 字符集(Character Sets) #### 标题与描述中的知识点 从提供的标题与描述来看,文本主要关注的是字符集的定义与应用。虽然标题与描述部分显得有些重复,但可以推断出其核心意图是介绍字符集的概念、重要性和...

    PLSQL Developer 10.0.5.1710 中文版

    Character Sets Character size: 2 byte(s) CharSetID: 852 NCharSetID: 2000 Unicode Support: True NLS_LANG: SIMPLIFIED CHINESE_CHINA.ZHS16GBK NLS_NCHAR_CHARACTERSET: AL16UTF16 NLS_CHARACTERSET: ...

    编译原理及实践课后习题答案

    编译原理及实践课后习题答案,冯博琴译 The exercises of Chapter Two 2.1 write regular expression for the following character sets,or give reasons why no regular expression can be written:

    Universal Character Set Detector

    The list of possible character sets that can be returned from the library as of the most recent update are: Big5 EUC-JP EUC-KR GB18030 gb18030 HZ-GB-2312 IBM855 IBM866 ISO-2022-CN ISO-2022-...

    Investigation of the Lower Resistance Meridians III. Reasoning on the Histological Basis of Acupuncture Meridians

    在《Investigation of the Lower Resistance Meridians III. Reasoning on the Histological Basis of Acupuncture Meridians》这篇论文中,作者杨威生对低阻经络进行了深入的研究和探讨,试图揭示中医经络的组织学...

    teraterm-4.79

    teraterm-4.79 串口调试工具 Tera Term is a free ...Japanese, English, Russian and Korean character sets. UTF-8 character encoding. Message catalog(Japanese, English, German, French, Russian and Korean

    MySQL客户端选项中的字符集配置.pdf

    在实际应用中,如果需要指定客户端使用特定的字符集目录,还可以使用`--charactersets-dir`选项来指定字符集文件的位置。例如: - **Unix环境下**: ```ini [client] character-sets-dir=/usr/local/mysql/...

    network-programming-go

    Dive into key topics in network architecture and Go, such as data serialization, application level protocols, character sets and encodings. This book covers network architecture and gives an overview ...

    mysql乱码问题解决

    'character_sets_dir', 'D:\soft_work\mysql\share\charsets\' 果然发现有几个不对的 运行命令:SHOW VARIABLES LIKE 'collation_%'; 结果如下 'collation_connection', 'utf8_general_ci' 'collation_database', '...

    DIHtmlParser 7.6.0 Full Source D4-XE7 UnOfficial

    Reads and writes over 70 character sets natively independent of the OS More than 150 are supported with the help of DIConverters Operates on TStreams memory buffers or strings Returns a single piece...

    ESF.Database.Migration.Toolkit.Professional.Edition

    character-sets(e.g.: UTF8, CP1250 etc) tables, it also converts data character-set automation. You also can transform field name/datatypes/nullable/precision/numscale or filte data in migrating.

    ttfspec2.zip Windows字体文件(.TTF)格式详解

    TTCH04.DOC Chapter 4 - Character Sets TTCH04a.DOC Chapter 4 continued - WGL4.0 Character Set TTCH05.DOC Chapter 5 - Instructing Glyphs TTCH06a.DOC Chapter 6 - The TrueType Instruction Set TTCH06b.DOC ...

    DBVIEW,DB数据库查看

    You can view(form, table) and edit in OEM(DOS) or Windows character sets, create new DBF file, quick view, get information about fields, get statistics information for each field (Min,Max,Average, Sum...

    【经验总结】解决 BurpSuite Pro v2020.1 版本中文乱码问题

    来到 User Options –> Display –> Character Sets,在第四个选项中选择 UTF-8,中文乱码的问题就可以得到解决。 这个时候不管我字体选择的是哪一个,中文都是显示正常的。 0x02 总结 在网上找了很多解决办法,...

    mysql字符集1

    MySQL 5.6.21版本支持多种字符集,这些字符集可以通过查询`information_schema`库中的`character_sets`表来查看。`CHARACTER_SET_NAME`列包含了所有可用的字符集名称,例如GBK、UTF8、BIG5等。`DEFAULT_COLLATE_NAME...

    DotNetOpenMail

    multipart/related and multipart/mixed MIME messages in various character sets and various mime encodings such as quoted-printable, 7bit, 8bit and base64 without needing to know too much about the ...

Global site tag (gtag.js) - Google Analytics