TY - JOUR
T1 - FAST
T2 - Facilitated and Accurate Scene Text Proposals through FCN Guided Pruning
AU - Bazazian, Dena
AU - Gómez, Raúl
AU - Nicolaou, Anguelos
AU - Gómez, Lluís
AU - Karatzas, Dimosthenis
AU - Bagdanov, Andrew D.
N1 - Funding Information:
This work was supported by the CERCA Programme of the Generalitat de Catalunya, the research project TIN2014-52072-P and the Eurecat Catalan Technology Center.
Publisher Copyright:
© 2017 Elsevier B.V.
PY - 2019/3/1
Y1 - 2019/3/1
N2 - Class-specific text proposal algorithms can efficiently reduce the search space for possible text object locations in an image. In this paper we combine the Text Proposals algorithm with Fully Convolutional Networks to efficiently reduce the number of proposals while maintaining the same recall level and thus gaining a significant speed up. Our experiments demonstrate that such text proposal approaches yield significantly higher recall rates than state-of-the-art text localization techniques, while also producing better-quality localizations. Our results on the ICDAR 2015 Robust Reading Competition (Challenge 4) and the COCO-text datasets show that, when combined with strong word classifiers, this recall margin leads to state-of-the-art results in end-to-end scene text recognition.
AB - Class-specific text proposal algorithms can efficiently reduce the search space for possible text object locations in an image. In this paper we combine the Text Proposals algorithm with Fully Convolutional Networks to efficiently reduce the number of proposals while maintaining the same recall level and thus gaining a significant speed up. Our experiments demonstrate that such text proposal approaches yield significantly higher recall rates than state-of-the-art text localization techniques, while also producing better-quality localizations. Our results on the ICDAR 2015 Robust Reading Competition (Challenge 4) and the COCO-text datasets show that, when combined with strong word classifiers, this recall margin leads to state-of-the-art results in end-to-end scene text recognition.
KW - Fully convolutional networks
KW - Scene text images
KW - Text proposals
UR - https://www.scopus.com/pages/publications/85028944701
U2 - 10.1016/j.patrec.2017.08.030
DO - 10.1016/j.patrec.2017.08.030
M3 - Article
AN - SCOPUS:85028944701
SN - 0167-8655
VL - 119
SP - 112
EP - 120
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -