Difference between revisions of "KAIST Scene Text Database"
(→Version 1.0) |
(→Version 1.0) |
||
(14 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | [[Datasets]] -> Current Page | + | [[Datasets]] -> [[Datasets List]] -> Current Page |
{| style="width: 100%" | {| style="width: 100%" | ||
Line 21: | Line 21: | ||
Email: Jkim @ kaist.ac.kr | Email: Jkim @ kaist.ac.kr | ||
− | + | Seonghun Lee | |
+ | Artificial Intelligence and Pattern Recognition Lab, | ||
+ | Computer Science Department of KAIST, KOREA | ||
+ | Email: leesh @ ai.kaist.ac.kr | ||
+ | |||
=License= | =License= | ||
− | [[Image: | + | [[Image:CC_BY-SA.png|right|link=http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons License]] |
− | This work is licensed under a [http://creativecommons.org/licenses/by- | + | This work is licensed under a [http://creativecommons.org/licenses/by-sa/3.0/ Creative Commons Attribution-ShareAlike License] |
− | |||
=Current Version= | =Current Version= | ||
− | [[Image:KAIST_DB_Thumb.jpg|400px|thumb|right]] | + | [[Image:KAIST_DB_Thumb.jpg|400px|thumb|right| Example of images and ground truth information in the KAIST dataset.]] |
1.0 | 1.0 | ||
Line 38: | Line 41: | ||
The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480. | The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480. | ||
− | The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and | + | The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Mixed (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops. |
=Related Ground Truth Data= | =Related Ground Truth Data= | ||
Line 44: | Line 47: | ||
=Related Tasks= | =Related Tasks= | ||
− | * [[Scene Text | + | * [[Scene Text Localisation in the KAIST Dataset]] |
+ | * [[Scene Text Segmentation in the KAIST Dataset]] | ||
=References= | =References= | ||
Line 53: | Line 57: | ||
==Version 1.0== | ==Version 1.0== | ||
+ | [http://www.iapr-tc11.org/dataset/KAIST_SceneText/KAIST_all.zip Complete Download] (the directory structure of the zip file reflects the structure below) (364 MB) | ||
* Korean Language | * Korean Language | ||
** Digital Camera | ** Digital Camera | ||
Line 74: | Line 79: | ||
** Digital Camera | ** Digital Camera | ||
*** Signboard | *** Signboard | ||
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)A-shadow.zip Shadow] (2.16 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)B-light.zip Light] (2.33 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)C-outdoor1.zip Outdoor 1] (8.44 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)C-outdoor2.zip Outdoor 2] (10.53 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)C-outdoor3.zip Outdoor 3] (2.46 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)D-indoor.zip Indoor] (8.44 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)E-night.zip Night] (2.32 MB) |
− | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.S)F-others.zip Others] (6.17 MB) |
** Mobile Phone | ** Mobile Phone | ||
*** Signboard | *** Signboard | ||
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.M)A-outdoor.zip Outdoor] (828.2 KB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.M)B-indoor.zip Indoor] (2.36 MB) |
− | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E | + | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(E.M)C-bookCover.zip Light] (2.22 MB) |
− | * | + | * Mixed Language Content |
** Digital Camera | ** Digital Camera | ||
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)A-shadow.zip Shadow] (2.18 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)B-light.zip Light] (460.19 KB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor1.zip Outdoor 1] (24.34 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor2.zip Outdoor 2] (12.01 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor3.zip Outdoor 3] (11.69 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor4.zip Outdoor 4] (11.90 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor5.zip Outdoor 5] (11.73 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)C-outdoor6.zip Outdoor 6] (7.87 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)D-indoor1.zip Indoor 1] (10.94 MB), [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)D-indoor2.zip Indoor 2] (1.44 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)E-night.zip Night] (4.45 MB) |
− | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.S)F-others.zip Others] (5.13 MB) |
** Mobile Phone | ** Mobile Phone | ||
*** Signboard | *** Signboard | ||
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)A-outdoor.zip Outdoor] (2.42 MB) |
− | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | **** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)B-indoor.zip Indoor] (1.72 MB) |
− | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C | + | *** [http://www.iapr-tc11.org/dataset/KAIST_SceneText/(C.M)C-bookCover.zip Book Cover] (3.21 MB) |
---- | ---- | ||
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]]. | This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]]. |
Latest revision as of 12:02, 17 October 2012
Datasets -> Datasets List -> Current Page
|
Contents
Contact Author
Prof. Jin Hyung Kim Artificial Intelligence and Pattern Recognition Lab, Computer Science Department of KAIST, KOREA Tel: 82-42-350-3517 Email: Jkim @ kaist.ac.kr
Seonghun Lee Artificial Intelligence and Pattern Recognition Lab, Computer Science Department of KAIST, KOREA Email: leesh @ ai.kaist.ac.kr
License
This work is licensed under a Creative Commons Attribution-ShareAlike License
Current Version
1.0
Keywords
Scene Text, Korean, English, Signboard, Mobile phone image, Indoor image, Outdoor image
Description
The KAIST scene text dataset comprises 3000 images captured in different environments, including outdoors and indoors scenes under different lighting conditions (clear day, night, strong artificial lights, etc). Images were captured either by the use of a high-resolution digital camera or a low-resolution mobile phone camera. All images have been resized to 640x480.
The KAIST scene text database is categorized according to the language of the scene text captured: Korean, English (Number), and Mixed (Korean + English + Number). The scene text in the images is representative of common text in Korean streets or shops.
Related Ground Truth Data
Related Tasks
References
- Jehyun Jung, SeongHun Lee, Min Su Cho, and Jin Hyung Kim, “Touch TT: Scene Text Extractor Using Touch Screen Interface“, ETRI Journal 2011
- SeongHun Lee, Min Su Cho, Kyomin Jung, and Jin Hyung Kim, "Scene Text Extraction with Edge Constraint and Text Collinearity Link," 20th International Conference on Pattern Recognition (ICPR), August 2010, Istanbul, Turkey.
Submitted Files
Version 1.0
Complete Download (the directory structure of the zip file reflects the structure below) (364 MB)
- Korean Language
- Digital Camera
- Signboard
- Book Cover (6.18 MB)
- Others (7.93 MB)
- Mobile Phone
- Digital Camera
- English Language
- Mixed Language Content
- Digital Camera
- Mobile Phone
- Signboard
- Book Cover (3.21 MB)
This page is editable only by TC11 Officers .