TY - CHAP
T1 - ICDAR 2019 competition on large-scale street view text with partial labeling-RRC-LSVT
AU - Sun, Yipeng
AU - Karatzas, Dimosthenis
AU - Chan, Chee Seng
AU - Jin, Lianwen
AU - Ni, Zihan
AU - Chng, Chee Kheng
AU - Liu, Yuliang
AU - Luo, Canjie
AU - Ng, Chun Chet
AU - Han, Junyu
AU - Ding, Errui
AU - Liu, Jingtuo
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/9
Y1 - 2019/9
N2 - Robust text reading from street view images provides valuable information for various applications. Performance improvement of existing methods in such a challenging scenario heavily relies on the amount of fully annotated training data, which is costly and in-efficient to obtain. To scale up the amount of training data while keeping the labeling procedure cost-effective, this competition introduces a new challenge on Large-scale Street View Text with Partial Labeling (LSVT), providing 5,0000 and 400,000 images in full and weak annotations, respectively. This competition aims to explore the abilities of state-of-the-art methods to detect and recognize text instances from large-scale street view images, closing gaps between research benchmarks and real applications. During the competition period, a total number of 41 teams participate in the two tasks with 132 valid submissions, i.e., text detection and end-to-end text spotting. This paper includes dataset descriptions, task definitions, evaluation protocols and results summaries of ICDAR 2019-LSVT challenge.
AB - Robust text reading from street view images provides valuable information for various applications. Performance improvement of existing methods in such a challenging scenario heavily relies on the amount of fully annotated training data, which is costly and in-efficient to obtain. To scale up the amount of training data while keeping the labeling procedure cost-effective, this competition introduces a new challenge on Large-scale Street View Text with Partial Labeling (LSVT), providing 5,0000 and 400,000 images in full and weak annotations, respectively. This competition aims to explore the abilities of state-of-the-art methods to detect and recognize text instances from large-scale street view images, closing gaps between research benchmarks and real applications. During the competition period, a total number of 41 teams participate in the two tasks with 132 valid submissions, i.e., text detection and end-to-end text spotting. This paper includes dataset descriptions, task definitions, evaluation protocols and results summaries of ICDAR 2019-LSVT challenge.
KW - End-to-end text spotting
KW - Large-scale street view text
KW - Text detection
KW - Weak annotations
UR - https://www.scopus.com/pages/publications/85079895443
U2 - 10.1109/ICDAR.2019.00250
DO - 10.1109/ICDAR.2019.00250
M3 - Chapter
AN - SCOPUS:85079895443
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 1557
EP - 1562
BT - Proceedings - 15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
ER -