Creator(s)
Entangled Histories project (National Library of the Netherlands)
Language(s)
Centuries
16th, 17th, 18th
CER on Validation Set
1.4%
Size (Nr. of Words)
88,105
Model ID
26138
This model is based on printed texts in the Roman-type fonts that were used in the
Low Countries, during the late 16th, 17th, 18th and 19th century. Some pages may have contained
(properly) transcribed Gothic font; as well as French or Latin texts have been included to ensure
the (more or less) proper transcription of words in those languages when occuring.
The type of sources used for this model, are books of ordinances,
which contained the norms (‘laws’) at the time.
This model has been the result of one of the KB National Library of the Netherlands
Researcher-in-Residence position 2019. The project was called ‘Entangled Histories’.
For more information regarding the background of the model and how to cite it,
please visit: www.https://lab.kb.nl/dataset/entangled-histories-ordinances-low-countries
The model has been trained on 88100 words and the CER on the validation set is 1.4%.
You can use this model to automatically transcribe Handwritten documents with Handwritten Text Recgnition in Transkribus.