Difference between revisions of "KAIST Scene Text Database"

From TC11
Jump to: navigation, search
(Version 1.0)
(Version 1.0)
 
(4 intermediate revisions by the same user not shown)
Line 21: Line 21:
 
  Email: Jkim @ kaist.ac.kr
 
  Email: Jkim @ kaist.ac.kr
  
  SeongHun Lee
+
  Seonghun Lee
 
  Artificial Intelligence and Pattern Recognition Lab,
 
  Artificial Intelligence and Pattern Recognition Lab,
 
  Computer Science Department of KAIST, KOREA
 
  Computer Science Department of KAIST, KOREA
Line 27: Line 27:
  
 
=License=
 
=License=
[[Image:CC_BY-SA.png|right|link=http://creativecommons.org/licenses/by-sa/3.0/| Creative Commons License]]
+
[[Image:CC_BY-SA.png|right|link=http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons License]]
  
This work is licensed under a [http://creativecommons.org/licenses/by-sa/3.0/| Creative Commons Attribution-ShareAlike License]
+
This work is licensed under a [http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons Attribution-ShareAlike License]
  
 
=Current Version=
 
=Current Version=
Line 57: Line 57:
  
 
==Version 1.0==
 
==Version 1.0==
 +
[http://www.iapr-tc11.org/dataset/KAIST_SceneText/KAIST_all.zip Complete Download] (the directory structure of the zip file reflects the structure below) (364 MB)
 
* Korean Language
 
* Korean Language
 
** Digital Camera
 
** Digital Camera
Line 69: Line 70:
 
** Mobile Phone
 
** Mobile Phone
 
*** Signboard
 
*** Signboard
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M.K)A-shadow.zip Shadow] (645.54 KB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)A-shadow.zip Shadow] (645.54 KB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M.K)B-light.zip Light] (336.4 KB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)B-light.zip Light] (336.4 KB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M.K)C-outdoor.zip Outdoor] (2.43 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)C-outdoor.zip Outdoor] (2.43 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M.K)D-indoor.zip Indoor] (1.40 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)D-indoor.zip Indoor] (1.40 MB)
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M.K)E-bookCover.zip Book Cover] (18.63 MB)
+
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)E-bookCover.zip Book Cover] (18.63 MB)
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M.K)F-others.zip Others] (430.55 KB)
+
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)F-others.zip Others] (430.55 KB)
 
* English Language
 
* English Language
 
** Digital Camera
 
** Digital Camera
Line 101: Line 102:
 
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)A-outdoor.zip Outdoor] (2.42 MB)
 
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)A-outdoor.zip Outdoor] (2.42 MB)
 
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)B-indoor.zip Indoor] (1.72 MB)
 
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)B-indoor.zip Indoor] (1.72 MB)
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)A-bookCover.zip Book Cover] (3.21 MB)
+
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)C-bookCover.zip Book Cover] (3.21 MB)
  
 
----
 
----
 
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]].
 
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]].

Latest revision as of 12:02, 17 October 2012

Datasets -> Datasets List -> Current Page

Created: 2011-01-11
Last updated: 2012-10-17

Contact Author

Prof. Jin Hyung Kim
Artificial Intelligence and Pattern Recognition Lab,
Computer Science Department of KAIST, KOREA
Tel: 82-42-350-3517
Email: Jkim @ kaist.ac.kr
Seonghun Lee
Artificial Intelligence and Pattern Recognition Lab,
Computer Science Department of KAIST, KOREA
Email: leesh @ ai.kaist.ac.kr

License

link=http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike License

Current Version

Example of images and ground truth information in the KAIST dataset.

1.0

Keywords

Scene Text, Korean, English, Signboard, Mobile phone image, Indoor image, Outdoor image

Description

The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480.

The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Mixed (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops.

Related Ground Truth Data

Related Tasks

References

  1. Jehyun Jung, SeongHun Lee, Min Su Cho, and Jin Hyung Kim, “Touch TT: Scene Text Extractor Using Touch Screen Interface“, ETRI Journal 2011
  2. SeongHun Lee, Min Su Cho, Kyomin Jung, and Jin Hyung Kim, "Scene Text Extraction with Edge Constraint and Text Collinearity Link," 20th International Conference on Pattern Recognition (ICPR), August 2010, Istanbul, Turkey.

Submitted Files

Version 1.0

Complete Download (the directory structure of the zip file reflects the structure below) (364 MB)


This page is editable only by TC11 Officers .