rebecca turner

Unicode Resources

Created by rebecca with major help from @FakeUnicode

Unicode is a standard “registry” of characters designed to catalogue all the world’s writing systems and serve as a superset of all previously-existing character sets used and supported by just about everyone and all their software. Get into the details on Stack Overflow or see some of the Unicode encodings on Wikipedia.

Table of Contents

Charts and character data

PDF charts on — The only complete and perpetually up-to-date reference.

List of Unicode blocks on Wikipedia — Also available as a TXT file on

UNIDATA on — “This directory contains the final data files for the Unicode Character Database, for Version 8.0.0 of the Unicode Standard.”

CopyPasteCharacter — Contains a bunch of common characters for easy copying.

Proposed new characters on

Xah Lee’s Unicode gallery — Includes a search engine, various categorical galleries, and discussion.

Variants on — A list of variant ligatures for using alternate forms.

Unicode planes reference — Confused about what “BMP” means? Check here.

String analysis

Unicode Inspector — By @timwhitlock, displays the codepoint, byte breakdown, block, symbol, name, and surrogates for each character in a string.

Scarfboy search and string analysis

Understanding UTF-8 on jsfiddle — By @FakeUnicode on Twitter. Displays binary/hex breakdown of strings of text.

What Unicode character is this? — By BabelStone.


Scarfboy search and string analysis — Powerful search engine with previews in Unifont


Shape recognition

shapecatcher — Recognizes Unicode characters through a drawing field.

Kanji search on — Find kanji characters by their parts.

Handwritten kanji recognition — Like shapecatcher for kanji.

Google Translate — Click the pencil icon in the input area to enable shape recognition. Great for writing short bits of text in a language you don’t know.

Mouse input for Chinese characters

Conversion (programming)

Xem’s EscApe utility on Github — By Maxime Euzière. Converts any Unicode string to 33 different escape sequences. New: v2 beta!

ASCII Xlate — Converts plain ASCII, binary, octal, hex, base32, base64, ASCII85, and decimal ASCII. Also calculates various hashes!

Guide to converting to UTF-8 in vasious programming languages

Conversion (decorative or linguistic)

Unicate — Aconverts to various latin Unicode alphabets (e.g. fullwidth, math scripts, etc.).

Text converter on — Similar to the above.

Zalgo generator on — generates Zalgo text.

Convert Text on the Chrome web store — Converts the case of text as a Chrome extension. Includes Zalgo generator and fullwidth transform.

Strikethrough converter — By Adam Varga.

Emojify text on jsfiddle — By @FakeUnicode on Twitter. Transforms text to emoji.

Acrostic generator on — Dubiously useful. Also converts to Unicode Math Monospace.

Abbreviator on — Saves characters in tweets by using precomposed characters.

Unitools — A good compilation of a bunch of other tools. Honorary mention.

Nepali converter

Braille converter

Coverage: fonts & support

Alan Wood’s Unicode font list — Probably the most complete list of high-coverage fonts.

PragmataPro — A monospaced programming font with 6,000 glyphs (and rising).

CharacterMap — Analyzes glyphs from font files.

Google Noto Fonts — A set of sans and serif fonts supporting 581 languages (as of April 2016), with about 50% glyph coverage.

Unicode fonts by writing system

Preview of all codepoints in the BMP — Useful for testing coverage.

GNU Unifont

Fonts for ancient scripts — Symbola 8.00 is available here.

BabelStone’s Han font

Everson Mono font

Hanazono font

Code2000 font


Full Emoji Charts — all emoji with comparison pictures of implementations on various platforms.

Emoji Symbols: Background Data on, a.k.a. (L2/09-027) — Japanese carrier background data for the original emoji import (mostly historic value).

List of proposals and their associated codepoints on Wikipedia

Possible upcoming emoji on

List of emoji ZWJ ligatures on — Note: These are not actually emoji or part of the Unicode standard. Implementation, support, and blame lies entirely with the third parties involved.

Text vs Emoji reference on — Shows which characters (should) render as emoji or text.

ASCII/Unicode art & Kaomoji

List of 10,000+ kaomoji

ASCII art collection on

ASCII art on

ASCII text art generator

Inserting characters

Vim — Insert mode: <C-v>uxxxx

Emacs<C-X> 8 <CR> xxxx <CR>

Mac OS X — (☑︎ Unicode Hex Input) <⌥-xxxx>

Windows — (☑︎ Registry Key) <A-xxxx>

Unix in GTK applications<C-S-uxxxx>


Random Unicode character generator on jsfiddle — By @FakeUnicode on Twitter.

List of Unicode arrows — Courtesy of @fabrizioschiavi of Pragmata Pro fame.

Emoji allowed in Twitter usernames

Character/byte counter

my unicode toys — by rebecca