Difference between revisions of "Datasets"

From TC11
Jump to: navigation, search
(Machine-print OCR)
(On-line: vectorial, (xt, yt))
Line 12: Line 12:
  
 
=== On-line:  vectorial,  (xt, yt) ===
 
=== On-line:  vectorial,  (xt, yt) ===
 +
 +
* [http://www.cedar.buffalo.edu/Linguistics/database.html CEDAR On-line Handwriting Database]
 +
 +
* [http://hwr.nici.kun.nl/unipen/ UNIPEN database] (Click on link 'CDROMs')
 +
 +
* [ftp://ftp.ics.uci.edu/pub/machine-learning-databases/pendigits/ Ethem Alpaydin's on-line digit db]
 +
 +
* [ftp://ftp.ics.uci.edu/pub/machine-learning-databases/optdigits/ Ethem Alpaydin's optical digit db]
 +
 +
* [http://www.cse.salford.ac.uk/prima/TC11//kuchibue.html Kuchibue & Nakayosi] (by Masaaki Nakagawa and Stefan Jaeger)
 +
      Together, these databases comprise more than 3 million Japanese characters from 283 writers.
 +
 +
* [http://www.ai.rug.nl/~lambert/unipen/icdar-03-competition/ The Informal Competition of Recognizing On-line Words (ICROW)] by the Unipen Foundation
  
 
=== Off-line:  image,  I(x,y) ===
 
=== Off-line:  image,  I(x,y) ===
  
 
=== Combined on-line/offline handwriting ===
 
=== Combined on-line/offline handwriting ===

Revision as of 08:47, 28 August 2009

Optical Character Recognition (OCR)

Machine-print OCR

Handwriting

On-line: vectorial, (xt, yt)

     Together, these databases comprise more than 3 million Japanese characters from 283 writers.

Off-line: image, I(x,y)

Combined on-line/offline handwriting