KAIST Scene Text Database

From TC11
Revision as of 21:02, 17 October 2012 by Dimos (talk | contribs) (Version 1.0)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Datasets -> Datasets List -> Current Page

Created: 2011-01-11
Last updated: 2012-10-17

Contact Author

Prof. Jin Hyung Kim
Artificial Intelligence and Pattern Recognition Lab,
Computer Science Department of KAIST, KOREA
Tel: 82-42-350-3517
Email: Jkim @ kaist.ac.kr
Seonghun Lee
Artificial Intelligence and Pattern Recognition Lab,
Computer Science Department of KAIST, KOREA
Email: leesh @ ai.kaist.ac.kr


link=http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike License

Current Version

Example of images and ground truth information in the KAIST dataset.



Scene Text, Korean, English, Signboard, Mobile phone image, Indoor image, Outdoor image


The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480.

The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Mixed (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops.

Related Ground Truth Data

Related Tasks


  1. Jehyun Jung, SeongHun Lee, Min Su Cho, and Jin Hyung Kim, “Touch TT: Scene Text Extractor Using Touch Screen Interface“, ETRI Journal 2011
  2. SeongHun Lee, Min Su Cho, Kyomin Jung, and Jin Hyung Kim, "Scene Text Extraction with Edge Constraint and Text Collinearity Link," 20th International Conference on Pattern Recognition (ICPR), August 2010, Istanbul, Turkey.

Submitted Files

Version 1.0

Complete Download (the directory structure of the zip file reflects the structure below) (364 MB)

This page is editable only by TC11 Officers .