TY - CHAP
T1 - LayeredDoc
T2 - Domain Adaptive Document Restoration with a Layer Separation Approach
AU - Pilligua, Maria
AU - Biescas, Nil
AU - Vazquez-Corral, Javier
AU - Lladós, Josep
AU - Valveny, Ernest
AU - Biswas, Sanket
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024/11/11
Y1 - 2024/11/11
N2 - The rapid evolution of intelligent document processing systems demands robust solutions that adapt to diverse domains without extensive retraining. Traditional methods often falter with variable document types, leading to poor performance. To overcome these limitations, this paper introduces a text-graphic layer separation approach that enhances domain adaptability in document image restoration (DIR) systems. We propose LayeredDoc, which utilizes two layers of information: the first targets coarse-grained graphic components, while the second refines machine-printed textual content. This hierarchical DIR framework dynamically adjusts to the characteristics of the input document, facilitating effective domain adaptation. We evaluated our approach both qualitatively and quantitatively using a new real-world dataset, LayeredDocDB, developed for this study. Initially trained on a synthetically generated dataset, our model demonstrates strong generalization capabilities for the DIR task, offering a promising solution for handling variability in real-world data. Our code is accessible on this GitHub(https://github.com/mpilligua/LayeredDoc).
AB - The rapid evolution of intelligent document processing systems demands robust solutions that adapt to diverse domains without extensive retraining. Traditional methods often falter with variable document types, leading to poor performance. To overcome these limitations, this paper introduces a text-graphic layer separation approach that enhances domain adaptability in document image restoration (DIR) systems. We propose LayeredDoc, which utilizes two layers of information: the first targets coarse-grained graphic components, while the second refines machine-printed textual content. This hierarchical DIR framework dynamically adjusts to the characteristics of the input document, facilitating effective domain adaptation. We evaluated our approach both qualitatively and quantitatively using a new real-world dataset, LayeredDocDB, developed for this study. Initially trained on a synthetically generated dataset, our model demonstrates strong generalization capabilities for the DIR task, offering a promising solution for handling variability in real-world data. Our code is accessible on this GitHub(https://github.com/mpilligua/LayeredDoc).
KW - Document Image Restoration
KW - Domain Adaptation
KW - Layer Separation
KW - Text-Graphic Separation
UR - http://www.scopus.com/inward/record.url?scp=85204539008&partnerID=8YFLogxK
UR - https://portalrecerca.uab.cat/en/publications/59c3c7b3-9966-4118-8e97-0851844de7a7
U2 - 10.1007/978-3-031-70645-5_3
DO - 10.1007/978-3-031-70645-5_3
M3 - Chapter
AN - SCOPUS:85204539008
SN - 9783031706455
SN - 9783031706448
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 27
EP - 39
BT - Document Analysis and Recognition – ICDAR 2024 Workshops, Proceedings
A2 - Mouchère, Harold
A2 - Zhu, Anna
PB - Springer Science and Business Media Deutschland GmbH
ER -