english deutsch
Arabic
Arabic Windows: Arabizing Windows Applications to , Computing in Arabic, LangBox International: Linux and Unix Arabic suppo, The FarsiWeb Project
Chinese
RFC 1842 - ASCII Printable Characters-Based Chines, RFC 1922 - Chinese Character Encoding for Internet
CJKV
Character Tables by Koichi Yasuoka, CJK Character Sets and Encoding Forms, CJKV Character Set Server
Cyrillic
Cyrillic in different encodings, Fingertip Software: ISO 8859-5 table, ISO 8859-5 Latin/Cyrillic Alphabet, KOI8-R Russian Character Set, KOI8-U Ukrainian Character Set, Minority Languages of Russia on the Net, Russify Everything, Slavic Text Processing
Greek
Advanced Topics on Computing in Greek, Greek and Coptic language fonts, Greek Font Unicode Converter, ISO Character Set 8859-7, Microsoft Windows Code Page 1253, Unicode Polytonic Greek for the Web
Hangul
RFC 1557 - Korean Character Encoding for Internet
Hebrew
A Users' Guide to Yiddish on the Internet, Hebrew Cantillation Marks and their Encoding, Hebrew characters in XML and XHTML, Jonathan Rosenne's Hebrew Page, Mikledet Hebrew Virtual Keyboard, The Hebrew Alphabet, Understanding Yiddish Information Processing, Yiddish and Unix
Indic
Unicode, Computing in Indian Languages using Java, Indian Scripts Input System, Indix - Indian Language Computing Project, ISCII and ISCLAP, ISCII white paper, Kannada Localisation Initiative, Standardization and Implementations of Thai Langua, Tamil Numerals, Tamil Unicode, The Indic Computing Project
Japanese
RFC 1468 - Japanese Character Encoding for Interne, RFC 2237 - Japanese Character Encoding for Interne
Latin
Vietnamese, Estonian Standard EVS 8:1993, HTML 4.0 Latin-1 Entities, ISO Latin 1 Character Entities, Wikipedia - ISO-8859
Native American
Government of Nunavut - Pigiarniq Font, Proposal for encoding the Cherokee script, Tiro Typeworks
Unicode
Adobe: Unicode and Glyph Names, ConScript Unicode Registry, Decimal, Hexadecimal Character Codes in HTML Unico, Demystifying Unicode, F4: ASCII - Unicode Bridge, Fingertipsoft: Character Set Converter, Fontboard, Free Online Unicode Character Map, IBM developerWorks: Unicode, International Unicode Conferences
3rdpageSearch
Front end to several search engines and portals that allows you to enter queries in various character sets.
A Brief History of Character Codes
A concise history of the development of character encoding in Western and East Asian languages, including ASCII, EBCDIC, Unicode and TRON.
An Early History of Character Set Standardization
Covers the beginnings of the ASCII standards from ASCII-1963 onwards and information on Cyrillic, Japanese, Korean, Thai and Vietnamese encoding systems, including various localized versions of EBCDIC. With tables and links to other resources.
ASCII and EBCDIC Compared
A comparison of two of these two basic encoding systems, with tables.
Basis Technology: Presentations and Papers
A wide range of articles on Unicode, East Asian localization and Internationalization issues.
Character Set Issues beyond HTML3.2
Internationalization issues beyond HTML3.2 and ISO-8859-1. Includes information on Baltic encodings.
Characters and Encodings
A tutorial on character code issues in digital processing and transfer of text data, on the Internet or otherwise. Includes tables and a detailed listing of control codes. In English and Finnish.
Chilkat Charset Conversion Component
A character set conversion component for Unicode, Japanese, Chinese, Korean, Cyrillic, Arabic, Hebrew, Thai, Vietnamese and all Western languages.
Dan's Web Tips: Characters and Fonts
Hints and tips about character sets and fonts in web development. Includes links to related resources.
ECMA: Character Code Structure and Extension Techn
Specifies the structure of ECMA-35, for 8-bit codes and 7-bit codes which provide for the coding of character sets, with a detailed PDF document.
eGrannie: ASCII-EBCDIC chart
A side-by-side comparision of ASCII and EBCDIC encoding.
EKI Letter Database
Query character sets, encoding, codepages and Unicode information in an easy-to-use web form. Held at the Institute of the Estonian Language.
GNU Aspell: Czyborra.com Mirror
Information on Latin and non-Latin encoding systems, codepages and character sets by Roman Czyborra.
HTML Document Representation
Chapter covering document character sets and encodings in HTML from the World Wide Web Consortium's HTML 4.0 Specification.
HTML Validation: Using Character Encodings
How to validate HTML documents in various character encodings.
IANA: Character Sets
The official names for character sets that may be used in the Internet and referred to in Internet documentation - held at the Internet Assigned Number Authority.
ISO 639 Language Names
The standard names for use in SGML and XML, including a complete list of language name codes.
LangBox International
Codetables for ISO 8859-6, ASMO 449 plus, ASMO 708 (Arabic) and ISO 8859-8 (Hebrew) and further information about the company's work in multilingual UNIX.
MS Windows characters in HTML
A review of the HTML authoring problems caused by some special characters which belong to MS Windows character set but not to ISO Latin 1. Includes technical details and substitution tables. In English and Finnish.
ScientificPublications.com: Czyborra.com Mirror
Mirror of Roman Czyborra's work on character sets and encoding systems. In English and German.
Tips & Techniques for Foreign Content on the W
Pennsylvania State University's guide to reading and publishing different languages on the web. Includes details of various encoding systems and links.
Tutorial: Shady Characters
A tutorial that explains HTML character sets, character encodings and character references from Webreference.com.
World Wide Web Consortium
Covers code tables, Unicode, HTML and XML and links to other resources and discusses internationalization and localization issues relating to character sets.
Xceed Binary Encoding Library
A library for Windows developers that allows applications to encode binary data and files into text and vice-versa.