Contemporary Basque Student Handwritten

Model details

Creator(s)

mikel.iruskieta@ehu.eus

Language(s)

Basque

Centuries

CER on Validation Set

6.07%

Size (Nr. of Words)

51,195

Model ID

185185

About this Model

The Contemporary Basque Student Handwritten model is a Basque AI model in Transkribus designed to transcribe learners' handwriting in Basque. It achieves a Character Error Rate (CER) of 4.77% on the training set and 6.07% on the validation set. The dataset consists of school-based texts written by adolescent students aged 12–16. Original errors in the handwriting were preserved and transcribed verbatim. The model was trained on a corpus of 51,195 words in Basque, collected from various schools in the Basque Autonomous Community in 2023. Further details and an accompanying research paper will be made available soon. The model training was conducted by Mikel Iruskieta (HiTZ - Ixa, UPV/EHU) and Roberto Arias-Hermoso (Mondragon Unibertsitatea).

Try it out

Contemporary Basque Student Handwritten is freely available to everyone

You can use this model to automatically transcribe Handwritten documents with Handwritten Text Recgnition in Transkribus.