Slovenian 18th and 19th century manuscripts

Model details

Creator(s)

matija.ogrin@zrc-sazu.si

Language(s)

Slovenian

Centuries

18th, 19th

CER on Validation Set

3.29%

Size (Nr. of Words)

170,159

Model ID

216113

About this Model

The model for the identification of Slovenian manuscripts of the late 18th and mid-19th century is based on manuscript texts of four Slovenian writers. The total training set consists of approx. 170,000 words: ~ 55,000: Konrad Branka, Franciscan friar, theologian and professor; late 18th century, ~ 20,000: Mihael Zagajšek, parish priest at Kalobje, preacher, spiritual writer, linguist, ~ 12,000: Tobias Vernik, Franciscan brother layman, mid-19th century, ~ 93,000: Ignazij Holzapfel, parish priest in Ribnica, preacher, spiritual writer. The size of the learning set for each writer varies according to the difficulty of the manuscript and the complexity of the hand. The most difficult handwriting is undoubtedly Holzapfel's, and therefore the most extensive training set is made for him. The model was prepared by Marko Kunavar and Matija Ogrin. This work was funded by the CLARIN.SI consortium (Jožef Stefan Institute) and the Research Centre of the Slovenian Academy of Sciences and Arts (ARIS programme P6-0024).

Try it out

Slovenian 18th and 19th century manuscripts is freely available to everyone

You can use this model to automatically transcribe Handwritten documents with Handwritten Text Recgnition in Transkribus.