Multi-oriented touching text character segmentation in graphical documents using dynamic programming

Partha Pratim Roy, Umapada Pal, Josep Lladós, Mathieu Delalandre

Research output: Contribution to journalArticleResearchpeer-review

33 Citations (Scopus)

Abstract

The touching character segmentation problem becomes complex when touching strings are multi-oriented. Moreover in graphical documents sometimes characters in a single-touching string have different orientations. Segmentation of such complex touching is more challenging. In this paper, we present a scheme towards the segmentation of English multi-oriented touching strings into individual characters. When two or more characters touch, they generate a big cavity region in the background portion. Based on the convex hull information, at first, we use this background information to find some initial points for segmentation of a touching string into possible primitives (a primitive consists of a single character or part of a character). Next, the primitives are merged to get optimum segmentation. A dynamic programming algorithm is applied for this purpose using the total likelihood of characters as the objective function. A SVM classifier is used to find the likelihood of a character. To consider multi-oriented touching strings the features used in the SVM are invariant to character orientation. Experiments were performed in different databases of real and synthetic touching characters and the results show that the method is efficient in segmenting touching characters of arbitrary orientations and sizes. © 2011 Elsevier Ltd. All Rights Reserved.
Original languageEnglish
Pages (from-to)1972-1983
JournalPattern Recognition
Volume45
Issue number5
DOIs
Publication statusPublished - 1 May 2012

Keywords

  • Dynamic programming
  • Multi-oriented character recognition
  • Touching character segmentation

Fingerprint Dive into the research topics of 'Multi-oriented touching text character segmentation in graphical documents using dynamic programming'. Together they form a unique fingerprint.

Cite this