Historical document image analysis using controlled data for pre-training

Rahal, Najoua; Vögtlin, Lars; Ingold, Rolf

doi:10.1007/s10032-023-00437-8

Back

Journal article

Historical document image analysis using controlled data for pre-training

BP2-STS

Rahal, Najoua ORCID Université de Fribourg
Vögtlin, Lars ORCID Université de Fribourg
Ingold, Rolf ORCID Université de Fribourg

2023

Published in:

International Journal on Document Analysis and Recognition (IJDAR). - Springer Science and Business Media LLC. - 2023, vol. 26, p. 241-254

English Using neural networks for semantic labeling has become a dominant technique for layout analysis of historical document images. However, to train or fine-tune appropriate models, large labeled datasets are needed. This paper addresses the case when only limited labeled data are available and promotes a novel approach using so-called controlled data to pre-train the networks. Two different strategies are proposed: The first addresses the real labeling task by using artificial data; the second uses real data to pre-train the networks with a pretext task. To assess these strategies, a large set of experiments has been carried out on a text line detection and classification task using different variants of U-Net. The observations, obtained from two different datasets, show that globally the approach reduces the training time while offering similar or better performance. Furthermore, the effect is bigger on lightweight network architectures.

Faculty

Faculté des sciences et de médecine

Department

Département d'Informatique

Language

English

Classification

Computer science and technology

License

CC BY

Open access status

hybrid

Identifiers

DOI 10.1007/s10032-023-00437-8
ISSN 1433-2833

Persistent URL

https://folia.unifr.ch/unifr/documents/334265

Statistics

Document views: 33 File downloads:

historicaldocumentimageanalysisusingcontrolleddataforpre-training_2023: 67

Journal article

Historical document image analysis using controlled data for pre-training

BP2-STS

Historical document images

Layout analysis

Neural networks

Training strategies

Statistics