TY - CHAP
T1 - Text segmentation in colour posters from the Spanish Civil War era
AU - Clavelli, Antonio
AU - Karatzas, Dimosthenis
PY - 2009
Y1 - 2009
N2 - The extraction of textual content from colour documents of a graphical nature is a complicated task. The text can be rendered in any colour, size and orientation while the existence of complex background graphics with repetitive patterns can make its localization and segmentation extremely difficult. Here, we propose a new method for extracting textual content from such colour images that makes no assumption as to the size of the characters, their orientation or colour, while it is tolerant to characters that do not follow a straight baseline. We evaluate this method on a collection of documents with historical connotations: the Posters from the Spanish Civil War.
AB - The extraction of textual content from colour documents of a graphical nature is a complicated task. The text can be rendered in any colour, size and orientation while the existence of complex background graphics with repetitive patterns can make its localization and segmentation extremely difficult. Here, we propose a new method for extracting textual content from such colour images that makes no assumption as to the size of the characters, their orientation or colour, while it is tolerant to characters that do not follow a straight baseline. We evaluate this method on a collection of documents with historical connotations: the Posters from the Spanish Civil War.
UR - http://www.scopus.com/inward/record.url?scp=71249152015&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2009.32
DO - 10.1109/ICDAR.2009.32
M3 - Chapter
AN - SCOPUS:71249152015
SN - 9780769537252
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 181
EP - 185
BT - ICDAR2009 - 10th International Conference on Document Analysis and Recognition
ER -