Difference between revisions of "KAIST Scene Text Database"
(→Version 1.0) |
(→Version 1.0) |
||
(4 intermediate revisions by the same user not shown) | |||
Line 21: | Line 21: | ||
Email: Jkim @ kaist.ac.kr | Email: Jkim @ kaist.ac.kr | ||
− | + | Seonghun Lee | |
Artificial Intelligence and Pattern Recognition Lab, | Artificial Intelligence and Pattern Recognition Lab, | ||
Computer Science Department of KAIST, KOREA | Computer Science Department of KAIST, KOREA | ||
Line 27: | Line 27: | ||
=License= | =License= | ||
− | [[Image:CC_BY-SA.png|right|link=http://creativecommons.org/licenses/by-sa/3.0/ | + | [[Image:CC_BY-SA.png|right|link=http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons License]] |
− | This work is licensed under a [http://creativecommons.org/licenses/by-sa/3.0/ | + | This work is licensed under a [http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons Attribution-ShareAlike License] |
=Current Version= | =Current Version= | ||
Line 57: | Line 57: | ||
==Version 1.0== | ==Version 1.0== | ||
+ | [http://www.iapr-tc11.org/dataset/KAIST_SceneText/KAIST_all.zip Complete Download] (the directory structure of the zip file reflects the structure below) (364 MB) | ||
* Korean Language | * Korean Language | ||
** Digital Camera | ** Digital Camera | ||
Line 69: | Line 70: | ||
** Mobile Phone | ** Mobile Phone | ||
*** Signboard | *** Signboard | ||
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)A-shadow.zip Shadow] (645.54 KB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)B-light.zip Light] (336.4 KB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)C-outdoor.zip Outdoor] (2.43 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)D-indoor.zip Indoor] (1.40 MB) |
− | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M | + | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)E-bookCover.zip Book Cover] (18.63 MB) |
− | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M | + | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(M,K)F-others.zip Others] (430.55 KB) |
* English Language | * English Language | ||
** Digital Camera | ** Digital Camera | ||
Line 101: | Line 102: | ||
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)A-outdoor.zip Outdoor] (2.42 MB) | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)A-outdoor.zip Outdoor] (2.42 MB) | ||
**** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)B-indoor.zip Indoor] (1.72 MB) | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)B-indoor.zip Indoor] (1.72 MB) | ||
− | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M) | + | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)C-bookCover.zip Book Cover] (3.21 MB) |
---- | ---- | ||
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]]. | This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]]. |
Latest revision as of 12:02, 17 October 2012
Datasets -> Datasets List -> Current Page
|
Contents
Contact Author
Prof. Jin Hyung Kim Artificial Intelligence and Pattern Recognition Lab, Computer Science Department of KAIST, KOREA Tel: 82-42-350-3517 Email: Jkim @ kaist.ac.kr
Seonghun Lee Artificial Intelligence and Pattern Recognition Lab, Computer Science Department of KAIST, KOREA Email: leesh @ ai.kaist.ac.kr
License
This work is licensed under a Creative Commons Attribution-ShareAlike License
Current Version
1.0
Keywords
Scene Text, Korean, English, Signboard, Mobile phone image, Indoor image, Outdoor image
Description
The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480.
The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Mixed (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops.
Related Ground Truth Data
Related Tasks
References
- Jehyun Jung, SeongHun Lee, Min Su Cho, and Jin Hyung Kim, “Touch TT: Scene Text Extractor Using Touch Screen Interface“, ETRI Journal 2011
- SeongHun Lee, Min Su Cho, Kyomin Jung, and Jin Hyung Kim, "Scene Text Extraction with Edge Constraint and Text Collinearity Link," 20th International Conference on Pattern Recognition (ICPR), August 2010, Istanbul, Turkey.
Submitted Files
Version 1.0
Complete Download (the directory structure of the zip file reflects the structure below) (364 MB)
- Korean Language
- Digital Camera
- Signboard
- Book Cover (6.18 MB)
- Others (7.93 MB)
- Mobile Phone
- Digital Camera
- English Language
- Mixed Language Content
- Digital Camera
- Mobile Phone
- Signboard
- Book Cover (3.21 MB)
This page is editable only by TC11 Officers .