Ground Truth for LRDE DBD OCR
From TC11
Datasets -> Datasets List -> Current Page
|
Contents
Keywords
scanned, magazine, documents, binarization
Description
125 binarized images for "clean documents".
Image groundtruths have been produced using a semi-automatic process: a global thresholding followed by some manual adjustments.
Purpose of the three document qualities :
- Original : evaluate the binarization quality on perfect documents mixing text and images.
- Clean : evaluate the binarization quality on perfect document with text only.
- Scanned : evaluate the binarization quality on slightly degraded documents with text only.
Related Dataset
Related Tasks
Submitted Files
Version 1.0
- Binarization groundtruth (0 Mb)
This page is editable only by TC11 Officers .