Unicode is a standard “registry” of characters designed to catalogue all the world’s writing systems and serve as a superset of all previously-existing character sets used and supported by just about everyone and all their software. Get into the details on Stack Overflow or see some of the Unicode encodings on Wikipedia.
Table of Contents
- Charts and character data
- String analysis
- Shape recognition
- Conversion (programming)
- Conversion (decorative or linguistic)
- Coverage: fonts & support
- ASCII/Unicode art & Kaomoji
- Inserting characters
Charts and character dataPDF charts on unicode.org — The only complete and perpetually up-to-date reference. List of Unicode blocks on Wikipedia — Also available as a TXT file on unicode.org
UNIDATA on unicode.org — “This directory contains the final data files for the Unicode Character Database, for Version 8.0.0 of the Unicode Standard.”
CopyPasteCharacter — Contains a bunch of common characters for easy copying.
Xah Lee’s Unicode gallery — Includes a search engine, various categorical galleries, and discussion.
Variants on unicode.org — A list of variant ligatures for using alternate forms.
Unicode planes reference — Confused about what “BMP” means? Check here.
String analysisUnicode Inspector — By @timwhitlock, displays the codepoint, byte breakdown, block, symbol, name, and surrogates for each character in a string.
codepoints.net — Powerful search engine with previews in Unifont
Shape recognitionshapecatcher — Recognizes Unicode characters through a drawing field.
Kanji search on jisho.org — Find kanji characters by their parts.
Google Translate — Click the pencil icon in the input area to enable shape recognition. Great for writing short bits of text in a language you don’t know.
Conversion (programming)Xem’s EscApe utility on Github — By Maxime Euzière. Converts any Unicode string to 33 different escape sequences. New: v2 beta!
ASCII Xlate — Converts plain ASCII, binary, octal, hex, base32, base64, ASCII85, and decimal ASCII. Also calculates various hashes!
Conversion (decorative or linguistic)Unicate — Aconverts to various latin Unicode alphabets (e.g. fullwidth, math scripts, etc.).
Text converter on qaz.wtf — Similar to the above.Zalgo generator on eeemo.net — generates Zalgo text.
Convert Text on the Chrome web store — Converts the case of text as a Chrome extension. Includes Zalgo generator and fullwidth transform.
Acrostic generator on 9999yea.rs — Dubiously useful. Also converts to Unicode Math Monospace.
Abbreviator on 9999yea.rs — Saves characters in tweets by using precomposed characters.
Unitools — A good compilation of a bunch of other tools. Honorary mention.
Coverage: fonts & supportAlan Wood’s Unicode font list — Probably the most complete list of high-coverage fonts. PragmataPro — A monospaced programming font with 6,000 glyphs (and rising). CharacterMap — Analyzes glyphs from font files.
Google Noto Fonts — A set of sans and serif fonts supporting 581 languages (as of April 2016), with about 50% glyph coverage.
Preview of all codepoints in the BMP — Useful for testing coverage.Fonts for ancient scripts — Symbola 8.00 is available here.
EmojiFull Emoji Charts — all emoji with comparison pictures of implementations on various platforms.
Emoji Symbols: Background Data on unicode.org, a.k.a. (L2/09-027) — Japanese carrier background data for the original emoji import (mostly historic value).
List of emoji ZWJ ligatures on unicode.org — Note: These are not actually emoji or part of the Unicode standard. Implementation, support, and blame lies entirely with the third parties involved.
Text vs Emoji reference on unicode.org — Shows which characters (should) render as emoji or text.
ASCII/Unicode art & Kaomojicutekaomoji.com
Vim — Insert mode:
<C-X> 8 <CR> xxxx <CR>
Mac OS X — (☑︎ Unicode Hex Input)
Windows — (☑︎ Registry Key)
Unix in GTK applications —
Misc.Random Unicode character generator on jsfiddle — By @FakeUnicode on Twitter.
my unicode toys — by rebecca