TY - BOOK
T1 - Text/graphic separation using a sparse representation with multi-learned dictionaries
AU - Do, T.-H.
AU - Tabbone, S.
AU - Ramos-Terrades, O.
PY - 2012
Y1 - 2012
N2 - In this paper, we propose a new approach to extract text regions from graphical documents. In our method, we first empirically construct two sequences of learned dictionaries for the text and graphical parts respectively. Then, we compute the sparse representations of all different sizes and non-overlapped document patches in these learned dictionaries. Based on these representations, each patch can be classified into the text or graphic category by comparing its reconstruction errors. Same-sized patches in one category are then merged together to define the corresponding text or graphic layers which are combined to create a final text/graphic layer. Finally, in a post-processing step, text regions are further filtered out by using some learned thresholds.
AB - In this paper, we propose a new approach to extract text regions from graphical documents. In our method, we first empirically construct two sequences of learned dictionaries for the text and graphical parts respectively. Then, we compute the sparse representations of all different sizes and non-overlapped document patches in these learned dictionaries. Based on these representations, each patch can be classified into the text or graphic category by comparing its reconstruction errors. Same-sized patches in one category are then merged together to define the corresponding text or graphic layers which are combined to create a final text/graphic layer. Finally, in a post-processing step, text regions are further filtered out by using some learned thresholds.
UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-84874565450&partnerID=MN8TOARS
M3 - Proceeding
SN - 978-499064410-9
BT - Text/graphic separation using a sparse representation with multi-learned dictionaries
ER -