Main Page

From CharacterDB

Jump to:navigation, search

Welcome to CharacterDB, an open and free database on the structure of Han characters.

CharacterDB is a collaborative effort for gathering semantic data on the structure of Chinese characters' appearances (glyphs) for Hanzi, Kanji and Hanja. We gather data that can be used by the cjklib project and other projects. Currently there are 74717 characters and 72378 glyphs listed in this wiki.

For all Chinese characters encoded by Unicode we collect character decomposition and stroke order data. For example we record that is constituted by , in an outer-inner fashion (). Its stroke order (actually the stroke order of its glyph 國/0) is ㇑㇕㇐㇑㇕㇐㇀㇂㇒㇔㇐ (S-HZ H S-HZ-H T XG-P-D H) with totally 11 strokes. There are some characters that include this character as component themselves: , , , , , , , , , , .

CharacterDB needs your help. There are many Chinese characters encoded in Unicode and many still lack proper data. Join in and add or correct entries!

We are in beta stage, see Todo for what yet needs to be done.

CharacterDB was presented at the Wikimania 2010. The slides are here.

Some entry points

This wiki provides several views on the data. Start with the entry points below to get accustomed with them:

Navigation
Toolbox