Contemporary Student Handwritten: Basque, English and Spanish

Model details

Creator(s)

mikel.iruskieta@ehu.eus

Language(s)

English, Basque, Castilian

Centuries

20th, 21st

CER on Validation Set

7.68%

Size (Nr. of Words)

96,931

Model ID

187725

About this Model

The Contemporary Student Handwritten model is a multilingual AI model in Transkribus designed to transcribe learners' handwriting in Basque, Spanish, and English. It achieves a Character Error Rate (CER) of 4.46% on the training set and 7.68% on the validation set. The dataset consists of school-based texts written by adolescent students aged 12–16. Original errors in the handwriting were preserved and transcribed verbatim. The model was trained on a corpus of 96,931 words across the three languages, collected from various schools in the Basque Autonomous Community in 2023. Further details and an accompanying research paper will be made available soon. The model training was conducted by Mikel Iruskieta (HiTZ - Ixa, UPV/EHU) and Roberto Arias-Hermoso (Mondragon Unibertsitatea).

Try it out

Contemporary Student Handwritten: Basque, English and Spanish is freely available to everyone

You can use this model to automatically transcribe Handwritten documents with Handwritten Text Recgnition in Transkribus.