Multipage document retrieval by textual and visual representations

Marcal Rusinol*, Dimosthenis Karatzas, Andrew D. Bagdanov, Josep Llados

*Corresponding author for this work

Research output: Chapter in BookChapterResearchpeer-review

20 Citations (Scopus)

Abstract

In this paper we present a multipage administrative document image retrieval system based on textual and visual representations of document pages. Individual pages are represented by textual or visual information using a bag-of-words framework. Different fusion strategies are evaluated which allow the system to perform multipage document retrieval on the basis of a single page retrieval system. Results are reported on a large dataset of document images sampled from a banking workflow.

Original languageEnglish
Title of host publicationICPR 2012 - 21st International Conference on Pattern Recognition
Pages521-524
Number of pages4
Publication statusPublished - 2012

Publication series

NameProceedings - International Conference on Pattern Recognition
ISSN (Print)1051-4651

Fingerprint

Dive into the research topics of 'Multipage document retrieval by textual and visual representations'. Together they form a unique fingerprint.

Cite this