Novel line verification for multiple instance focused retrieval in document collections

Hongxing Gao, Marcal Rusinol, Dimosthenis Karatzas, Josep Llados, Rajiv Jain, David Doermann

Research output: Chapter in BookChapterResearchpeer-review

Abstract

Spatial verification is typically employed to check the spatial consistency among matched local features and to remove outliers. However, when looking for multiple instances of the query within a target image, RANSAC algorithms which are widely applied in many one-to-one matching applications might fail due to the large proportion of 'outliers' - correct matches corresponding to other instances. On the other hand, geometrical verification methods are more robust to outliers but usually suffer from high computational costs. In this paper, we introduce a novel two-step line verification method which is more flexible than existing methods and leads to lower computational complexity especially when multiple instances of a query are sought. We study this approach within an information extraction scenario, where the objective is to locate document structures indicative of certain type of information (e.g. different records on invoices).

Original languageEnglish
Title of host publication13th IAPR International Conference on Document Analysis and Recognition, ICDAR 2015 - Conference Proceedings
Pages481-485
Number of pages5
ISBN (Electronic)9781479918058
DOIs
Publication statusPublished - 20 Nov 2015

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Volume2015-November
ISSN (Print)1520-5363

Fingerprint

Dive into the research topics of 'Novel line verification for multiple instance focused retrieval in document collections'. Together they form a unique fingerprint.

Cite this