Multi-oriented touching text character segmentation in graphical documents using dynamic programming

Partha Pratim Roy, Umapada Pal, Josep Lladós, Mathieu Delalandre

Producción científica: Contribución a una revistaArtículoInvestigaciónrevisión exhaustiva

38 Citas (SciVal)

Resumen

The touching character segmentation problem becomes complex when touching strings are multi-oriented. Moreover in graphical documents sometimes characters in a single-touching string have different orientations. Segmentation of such complex touching is more challenging. In this paper, we present a scheme towards the segmentation of English multi-oriented touching strings into individual characters. When two or more characters touch, they generate a big cavity region in the background portion. Based on the convex hull information, at first, we use this background information to find some initial points for segmentation of a touching string into possible primitives (a primitive consists of a single character or part of a character). Next, the primitives are merged to get optimum segmentation. A dynamic programming algorithm is applied for this purpose using the total likelihood of characters as the objective function. A SVM classifier is used to find the likelihood of a character. To consider multi-oriented touching strings the features used in the SVM are invariant to character orientation. Experiments were performed in different databases of real and synthetic touching characters and the results show that the method is efficient in segmenting touching characters of arbitrary orientations and sizes. © 2011 Elsevier Ltd. All Rights Reserved.
Idioma originalInglés
Páginas (desde-hasta)1972-1983
PublicaciónPattern Recognition
Volumen45
N.º5
DOI
EstadoPublicada - 1 may 2012

Huella

Profundice en los temas de investigación de 'Multi-oriented touching text character segmentation in graphical documents using dynamic programming'. En conjunto forman una huella única.

Citar esto