Synthetic Data for the Analysis of Archival Documents: Handwriting Determination

Published on 26 October 2020
Updated on 14 February 2026
1 min read
research

Synthetic Data for the Analysis of Archival Documents: Handwriting Determination

This page contains all data necessary to generate training data, train a model, or use a trained model for our paper.

Trained Model

model.zip contains the pre-trained model that we used for our experiments.

Training Data

training_data.zip contains all training data we used to create our model in model.zip. Be aware, the file is quite large (> 3GB).

Generating Your Own Data

generation_data.zip contains the directory structure necessary to work with our data generator. Because of Copyright issues, we are not able to provide you with all data we used. However, we supply information on how to get the data in each sub-directory in a README.