Description

The CVL-database has 311 writers and was designed for writer retrieval and identification. The database consists of 7 (27 writers) respectively 5 (284 writers) different texts (101069 words at all). Additionally each page is labeled and provides the coordinates of the bounding boxes of each word (punctuations are not annotated) encoded using an XML-file. Thus, the CVL database can also be used for the evaluation of word-spotting methods. In contrast to the IAM database the number of pages of each writer is distributed more equally.

Evaluation Protocol

Evaluation for Writer Identification: It is suggested to use the evaluation metrics from the Writer Identification Contest (ICDAR)

Related Dataset

CVL-Database

Related Ground Truth Data

Bounding Boxes, IDs, and Transcription for the CVL Database

References

Markus Diem, Stefan Fiel, Florian Kleber and Robert Sablatnig, CVL-Database: An Off-line Database for Writer Retrieval, Writer Identification and Word Spotting, In Proc. of the 12th Int. Conference on Document Analysis and Recognition (ICDAR) 2013, forthcoming.

This page is editable only by TC11 Officers .

Navigation menu

Writer Identification and Word Spotting for the CVL Database

Contents

Description

Evaluation Protocol

Related Dataset

Related Ground Truth Data

References