Creator(s)
Gaelic Algorithmic Research Group
Language(s)
Centuries
20th
CER on Validation Set
1.9%
Size (Nr. of Words)
376,792
Model ID
48445
The models were trained upon roughly 2500 pages (ca. 400k words) of handwritten transcriptions from the School of Scottish Studies Archives (https://www.ed.ac.uk/information-services/library-museum-gallery/cultural-heritage-collections/school-scottish-studies-archives). The transcriptions are from recordings of hero tales and international tales in Scottish Gaelic, made between 1949 and 1979. They were gathered from tradition bearers living in the Scottish Highlands and Islands. The majority of the transcriptions are from one hand (~80%) with the remainder split across approximately 10 hands. Thus, the generalisability of the models is limited at present. We hope to diversify the training data in the future by adding further hands.
You can use this model to automatically transcribe Handwritten documents with Handwritten Text Recgnition in Transkribus.