Creator(s)
achim.rabus@slavistik.uni-freiburg.de
Language(s)
Russian
Centuries
CER on Validation Set
5.8%
Size (Nr. of Words)
484,429
Model ID
45595
This second version of a generic model for handwritten Russian (predominantly late 19th/early 20th century) was trained as part of the MultiHTR project (Freiburg/Germany, www.multihtr.uni-freiburg.de). It incorporates models trained by the Estonian State Archive and the Hamburg-based INEL project. Portions of the GT data have kindly been provided by the Prozhito project (Saint Petersburg) and the Ukraine RD of JewishGen, USA.
In some of the GT transcriptions, pre-1918 letters have been represented faithfully, while in other GT transcriptions, they have been replaced with their modern equivalents.
We expanded the first version of the Russian generic model by adding the Russian Civil Records model by IAJGS, USA as well as several additional sources from the Prozhito database. Moreover, we incorporated data from the HKR dataset (https://github.com/abdoelsayed2016/HKR_Dataset).
You can use this model to automatically transcribe Handwritten documents with Handwritten Text Recgnition in Transkribus.