Embedding document structure to bag-of-words through pair-wise stable key-regions

Hongxing Gao, Marçal Rusiñol, Dimosthenis Karatzas, Josep Lladós

Producción científica: Capítulo del libroCapítuloInvestigaciónrevisión exhaustiva

1 Cita (Scopus)

Resumen

Since the document structure carries valuable discriminative information, plenty of efforts have been made for extracting and understanding document structure among which layout analysis approaches are the most commonly used. In this paper, Distance Transform based MSER (DTMSER) is employed to efficiently extract the document structure as a dendrogram of key-regions which roughly correspond to structural elements such as characters, words and paragraphs. Inspired by the Bag of Words (BoW) framework, we propose an efficient method for structural document matching by representing the document image as a histogram of key-region pairs encoding structural relationships. Applied to the scenario of document image retrieval, experimental results demonstrate a remarkable improvement when comparing the proposed method with typical BoW and pyramidal BoW methods.

Idioma originalInglés
Título de la publicación alojadaProceedings - International Conference on Pattern Recognition
EditorialInstitute of Electrical and Electronics Engineers Inc.
Páginas2903-2908
Número de páginas6
ISBN (versión digital)9781479952083
DOI
EstadoPublicada - 4 dic 2014

Serie de la publicación

NombreProceedings - International Conference on Pattern Recognition
ISSN (versión impresa)1051-4651

Huella

Profundice en los temas de investigación de 'Embedding document structure to bag-of-words through pair-wise stable key-regions'. En conjunto forman una huella única.

Citar esto