General Portuguese

Model details

Creator(s)

luciafwx@icloud.com

Language(s)

Portuguese

Centuries

0th, 1st, 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th, 11th, 12th, 13th, 14th, 15th, 16th, 17th, 18th, 19th, 20th, 21st

CER on Validation Set

3.8%

Size (Nr. of Words)

64,842

Model ID

44949

About this Model

This is a combined model of two Portuguese sources housed at the Portuguese National Archive Torre do Tombo and State Archive of Bahia, Brazil.

There are handwritten and printed scripts of the Inquisition from Torre do Tombo together with the Notarial Books of Salvador da Bahia. To make the model even more general, a printed script from the middle of the 17th Century was added. Some documents are damaged.

This is the first attempt to create a general Portuguese Model with 64,842 words. Two different projects are working on the collections independently. As they progress in their work, new general models will be created, to attend to other collections.

Try it out

General Portuguese is freely available to everyone

You can use this model to automatically transcribe Handwritten documents with Handwritten Text Recgnition in Transkribus.