Difference between revisions of "Handwriting recognition for Chinese characters"

From TC11
Jump to: navigation, search
Line 27: Line 27:
 
=Related Dataset=
 
=Related Dataset=
 
* [[Harbin Institute of Technology Opening Recognition Corpus for Chinese Characters (HIT-OR3C)]]
 
* [[Harbin Institute of Technology Opening Recognition Corpus for Chinese Characters (HIT-OR3C)]]
 +
  
 
----
 
----
 
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]].
 
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]].

Revision as of 14:52, 1 May 2010

Created: 2010-04-30
Last updated: 2010-005-01

Description

The aim of this task is to automatically recognize a series of characters that written on paper or handwriting input device.

This is a topic that has received a lot of attention lately, including the shape normalization methods for handwritten Chinese character recognition [1], and various methods for online recognition of Chinese characters (see [2] for a review).

Evaluation Protocol

This task consists of a total number of 909,818 handwriting characters. We provide one training set (the character subsets: GB1, GB2, Letter and Digit, 832,650 samples) and one testing set (the documents subset, 77,168 samples). For evaluation, the overall recognition rate should be reported.

References

1. C. L. Liu, and K. Marukawa, 2005, “Pseudo two-dimensional shape normalization methods for handwritten Chinese character recognition,” Pattern Recognition, vol. 38, no. 12, pp. 2242-2255, Dec.

2. C. L. Liu, S. Jaeger, and M. Nakagawa, 2004, “Online recognition of Chinese characters: The state-of-the-art,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 2, pp. 198-213.

Related Dataset



This page is editable only by TC11 Officers .