Difference between revisions of "KAIST Scene Text Database"

From TC11
Jump to: navigation, search
(License)
(Version 1.0)
 
(12 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[Datasets]] -> Current Page
+
[[Datasets]] -> [[Datasets List]] -> Current Page
  
 
{| style="width: 100%"
 
{| style="width: 100%"
Line 21: Line 21:
 
  Email: Jkim @ kaist.ac.kr
 
  Email: Jkim @ kaist.ac.kr
  
  SeongHun Lee
+
  Seonghun Lee
 
  Artificial Intelligence and Pattern Recognition Lab,
 
  Artificial Intelligence and Pattern Recognition Lab,
 
  Computer Science Department of KAIST, KOREA
 
  Computer Science Department of KAIST, KOREA
Line 27: Line 27:
  
 
=License=
 
=License=
[[Image:CC_BY-SA.png|right|link=http://creativecommons.org/licenses/by-sa/3.0/|Creative Commons License]]
+
[[Image:CC_BY-SA.png|right|link=http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons License]]
  
 
This work is licensed under a [http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons Attribution-ShareAlike License]
 
This work is licensed under a [http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons Attribution-ShareAlike License]
  
 
=Current Version=
 
=Current Version=
[[Image:KAIST_DB_Thumb.jpg|400px|thumb|right]]
+
[[Image:KAIST_DB_Thumb.jpg|400px|thumb|right| Example of images and ground truth information in the KAIST dataset.]]
 
1.0
 
1.0
  
Line 41: Line 41:
 
The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480.
 
The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480.
  
The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Complex (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops.
+
The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Mixed (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops.
  
 
=Related Ground Truth Data=
 
=Related Ground Truth Data=
Line 47: Line 47:
  
 
=Related Tasks=
 
=Related Tasks=
* [[Scene Text Extraction or Detection]]
+
* [[Scene Text Localisation in the KAIST Dataset]]
 +
* [[Scene Text Segmentation in the KAIST Dataset]]
  
 
=References=
 
=References=
Line 56: Line 57:
  
 
==Version 1.0==
 
==Version 1.0==
 +
[http://www.iapr-tc11.org/dataset/KAIST_SceneText/KAIST_all.zip Complete Download] (the directory structure of the zip file reflects the structure below) (364 MB)
 
* Korean Language
 
* Korean Language
 
** Digital Camera
 
** Digital Camera
Line 77: Line 79:
 
** Digital Camera
 
** Digital Camera
 
*** Signboard
 
*** Signboard
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)A-shadow.zip Shadow] (2.16 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)A-shadow.zip Shadow] (2.16 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)B-light.zip Light] (2.33 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)B-light.zip Light] (2.33 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)C-outdoor1.zip Outdoor 1] (8.44 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)C-outdoor2.zip Outdoor 2] (10.53 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)C-outdoor3.zip Outdoor 3] (2.46 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)C-outdoor1.zip Outdoor 1] (8.44 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)C-outdoor2.zip Outdoor 2] (10.53 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)C-outdoor3.zip Outdoor 3] (2.46 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)D-indoor.zip Indoor] (8.44 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)D-indoor.zip Indoor] (8.44 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)E-night.zip Night] (2.32 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)E-night.zip Night] (2.32 MB)
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,S)F-others.zip Others] (6.17 MB)
+
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)F-others.zip Others] (6.17 MB)
 
** Mobile Phone
 
** Mobile Phone
 
*** Signboard
 
*** Signboard
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,M)A-outdoor.zip Outdoor] (828.2 KB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.M)A-outdoor.zip Outdoor] (828.2 KB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,M)B-indoor.zip Indoor] (2.36 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.M)B-indoor.zip Indoor] (2.36 MB)
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E,M)C-bookCover.zip Light] (2.22 MB)
+
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.M)C-bookCover.zip Light] (2.22 MB)
* Complex
+
* Mixed Language Content
 
** Digital Camera
 
** Digital Camera
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)A-shadow.zip Shadow] (2.18 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)A-shadow.zip Shadow] (2.18 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)B-light.zip Light] (460.19 KB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)B-light.zip Light] (460.19 KB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)C-outdoor1.zip Outdoor 1] (24.34 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)C-outdoor2.zip Outdoor 2] (12.01 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)C-outdoor3.zip Outdoor 3] (11.69 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)C-outdoor4.zip Outdoor 4] (11.90 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)C-outdoor5.zip Outdoor 5] (11.73 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)C-outdoor6.zip Outdoor 6] (7.87 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor1.zip Outdoor 1] (24.34 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor2.zip Outdoor 2] (12.01 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor3.zip Outdoor 3] (11.69 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor4.zip Outdoor 4] (11.90 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor5.zip Outdoor 5] (11.73 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor6.zip Outdoor 6] (7.87 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)D-indoor1.zip Indoor 1] (10.94 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)D-indoor2.zip Indoor 2] (1.44 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)D-indoor1.zip Indoor 1] (10.94 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)D-indoor2.zip Indoor 2] (1.44 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)E-night.zip Night] (4.45 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)E-night.zip Night] (4.45 MB)
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,S)F-others.zip Others] (5.13 MB)
+
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)F-others.zip Others] (5.13 MB)
 
** Mobile Phone
 
** Mobile Phone
 
*** Signboard
 
*** Signboard
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,M)A-outdoor.zip Outdoor] (2.42 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)A-outdoor.zip Outdoor] (2.42 MB)
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,M)B-indoor.zip Indoor] (1.72 MB)
+
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)B-indoor.zip Indoor] (1.72 MB)
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C,M)A-bookCover.zip Book Cover] (3.21 MB)
+
*** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)C-bookCover.zip Book Cover] (3.21 MB)
  
 
----
 
----
 
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]].
 
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]].

Latest revision as of 12:02, 17 October 2012

Datasets -> Datasets List -> Current Page

Created: 2011-01-11
Last updated: 2012-10-17

Contact Author

Prof. Jin Hyung Kim
Artificial Intelligence and Pattern Recognition Lab,
Computer Science Department of KAIST, KOREA
Tel: 82-42-350-3517
Email: Jkim @ kaist.ac.kr
Seonghun Lee
Artificial Intelligence and Pattern Recognition Lab,
Computer Science Department of KAIST, KOREA
Email: leesh @ ai.kaist.ac.kr

License

link=http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike License

Current Version

Example of images and ground truth information in the KAIST dataset.

1.0

Keywords

Scene Text, Korean, English, Signboard, Mobile phone image, Indoor image, Outdoor image

Description

The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480.

The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Mixed (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops.

Related Ground Truth Data

Related Tasks

References

  1. Jehyun Jung, SeongHun Lee, Min Su Cho, and Jin Hyung Kim, “Touch TT: Scene Text Extractor Using Touch Screen Interface“, ETRI Journal 2011
  2. SeongHun Lee, Min Su Cho, Kyomin Jung, and Jin Hyung Kim, "Scene Text Extraction with Edge Constraint and Text Collinearity Link," 20th International Conference on Pattern Recognition (ICPR), August 2010, Istanbul, Turkey.

Submitted Files

Version 1.0

Complete Download (the directory structure of the zip file reflects the structure below) (364 MB)


This page is editable only by TC11 Officers .