Multi-oriented touching text character segmentation in graphical documents using dynamic programming

Partha Pratim Roy, Umapada Pal, Josep Lladós, Mathieu Delalandre

Producció científica: Contribució a revistaArticleRecercaAvaluat per experts

37 Cites (Scopus)

Resum

The touching character segmentation problem becomes complex when touching strings are multi-oriented. Moreover in graphical documents sometimes characters in a single-touching string have different orientations. Segmentation of such complex touching is more challenging. In this paper, we present a scheme towards the segmentation of English multi-oriented touching strings into individual characters. When two or more characters touch, they generate a big cavity region in the background portion. Based on the convex hull information, at first, we use this background information to find some initial points for segmentation of a touching string into possible primitives (a primitive consists of a single character or part of a character). Next, the primitives are merged to get optimum segmentation. A dynamic programming algorithm is applied for this purpose using the total likelihood of characters as the objective function. A SVM classifier is used to find the likelihood of a character. To consider multi-oriented touching strings the features used in the SVM are invariant to character orientation. Experiments were performed in different databases of real and synthetic touching characters and the results show that the method is efficient in segmenting touching characters of arbitrary orientations and sizes. © 2011 Elsevier Ltd. All Rights Reserved.
Idioma originalAnglès
Pàgines (de-a)1972-1983
RevistaPattern Recognition
Volum45
Número5
DOIs
Estat de la publicacióPublicada - 1 de maig 2012

Fingerprint

Navegar pels temes de recerca de 'Multi-oriented touching text character segmentation in graphical documents using dynamic programming'. Junts formen un fingerprint únic.

Com citar-ho