EM-based layout analysis method for structured documents

Francisco Cruz, Oriol Ramos Terrades

Research output: Book/ReportProceedingResearchpeer-review

8 Citations (Scopus)

Abstract

In this paper we present a method to perform layout analysis in structured documents. We proposed an EM-based algorithm to fit a set of Gaussian mixtures to the different regions according to the logical distribution along the page. After the convergence, we estimate the final shape of the regions according to the parameters computed for each component of the mixture. We evaluated our method in the task of record detection in a collection of historical structured documents and performed a comparison with other previous works in this task.

Original languageEnglish
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages6
ISBN (Electronic)9781479952083
DOIs
Publication statusPublished - 4 Dec 2014

Publication series

NameProceedings - International Conference on Pattern Recognition
ISSN (Print)1051-4651

Fingerprint

Dive into the research topics of 'EM-based layout analysis method for structured documents'. Together they form a unique fingerprint.

Cite this