Handwriting recognition for Chinese characters

From TC11
Jump to: navigation, search

Datasets -> Datasets List -> Current Page

Created: 2010-04-30
Last updated: 2011-001-28

Current Version



The aim of this task is to automatically recognize a series of characters that written on paper or handwriting input device.

This is a topic that has received a lot of attention lately, including the shape normalization methods for handwritten Chinese character recognition [1], and various methods for online recognition of Chinese characters (see [2] for a review).

Evaluation Protocol

This task consists of a total number of 909,818 handwriting characters. We provide one training set (the character subsets: GB1, GB2, Letter and Digit, 832,650 samples) and one testing set (the documents subset, 77,168 samples). For evaluation, the overall recognition rate should be reported.


1. C. L. Liu, and K. Marukawa, 2005, “Pseudo two-dimensional shape normalization methods for handwritten Chinese character recognition,” Pattern Recognition, vol. 38, no. 12, pp. 2242-2255, Dec.

2. C. L. Liu, S. Jaeger, and M. Nakagawa, 2004, “Online recognition of Chinese characters: The state-of-the-art,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 2, pp. 198-213.

Related Dataset

This page is editable only by TC11 Officers .