Ground truth for layout analysis performance evaluation

A. Antonacopoulos*, D. Karatzas, D. Bridson

*Autor corresponent d’aquest treball

Producció científica: Capítol de llibreCapítolRecercaAvaluat per experts

26 Cites (Scopus)

Resum

Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).

Idioma originalAnglès
Títol de la publicacióDocument Analysis Systems VII - 7th International Workshop, DAS 2006, Proceedings
Pàgines302-311
Nombre de pàgines10
Volum3872
ISBN (electrònic)978-3-540-32157-6
DOIs
Estat de la publicacióPublicada - 2006

Sèrie de publicacions

NomLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volum3872 LNCS
ISSN (imprès)0302-9743
ISSN (electrònic)1611-3349

Fingerprint

Navegar pels temes de recerca de 'Ground truth for layout analysis performance evaluation'. Junts formen un fingerprint únic.

Com citar-ho