Fast structural matching for document image retrieval through spatial databases

Hongxing Gao, Maçal Rusiñol, Dimosthenis Karatzas, Josep Lladós

Producció científica: Capítol de llibreCapítolRecercaAvaluat per experts

1 Citació (Scopus)

Resum

The structure of document images plays a significant role in document analysis thus considerable efforts have been made towards extracting and understanding document structure, usually in the form of layout analysis approaches. In this paper, we first employ Distance Transform based MSER (DTMSER) to efficiently extract stable document structural elements in terms of a dendrogram of key-regions. Then a fast structural matching method is proposed to query the structure of document (dendrogram) based on a spatial database which facilitates the formulation of advanced spatial queries. The experiments demonstrate a significant improvement in a document retrieval scenario when compared to the use of typical Bag of Words (BoW) and pyramidal BoW descriptors.

Idioma originalAnglès
Títol de la publicacióProceedings of SPIE-IS and T Electronic Imaging - Document Recognition and Retrieval XXI
DOIs
Estat de la publicacióPublicada - 2014

Sèrie de publicacions

NomProceedings of SPIE - The International Society for Optical Engineering
Volum9021
ISSN (imprès)0277-786X
ISSN (electrònic)1996-756X

Fingerprint

Navegar pels temes de recerca de 'Fast structural matching for document image retrieval through spatial databases'. Junts formen un fingerprint únic.

Com citar-ho