Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Model . 2024
License: CC BY
Data sources: Datacite
ZENODO
Model . 2023
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

CATMuS Medieval

Authors: Pinche, Ariane; Clérice, Thibault; Chagué, Alix; Camps, Jean-Baptiste; Vlachou-Efstathiou, Malamatenia; Gille Levenson, Matthias; Brisville-Fertin, Olivier; +6 Authors
Abstract

CATMuS (Consistent Approach to Transcribing ManuScript) Medieval is a Kraken HTR model trained on four different languages (in descending order of importance in the dataset: Old and Middle French, Latin, Spanish (and other languages of Spain), Italian) on strictly graphematic transcriptions. No abbreviations are resolved. This model is the result of the collaboration from researchers from CREMMA, GalliCorpora, HTRomance and DEEDS projects. It follows the CREMMA Guidelines (Supplemented by the CREMMA Medii Aevi) and will be consolidated under the CATMuS Medieval Guidelines in an upcoming paper. The model is trained with NFD Unicode normalization: each diacritic (including superscripts) are transcribed as their own characters, separately from the "main" character. Data See https://huggingface.co/datasets/CATMuS/medieval All source datasets and papers are referenced in the related works section, all transcribers are mentioned in the collaborators section, all partner-project members are mentioned as authors. Fundings CREMMA, DIM MAP, Région Île-de-France CremmaLab, DIM MAP, Région Île-de-France GalliCorpora, Datalab, Bibliothèque nationale de France HTRomance, Datalab, Bibliothèque nationale de France Text as Image, Image as Text: Charter integrity and topic modelling, SSHRCC 1350911 Les Décades de Bersuire, première traduction française de l'Histoire romaine de Tite-Live – LiBer, ANR 21-CE27-0008 Projet Fabliaux, Biblissima+, ANR 21-ESRE-0005

Keywords

handwritten text recognition, kraken_pytorch, middle ages, htr

  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    2
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
2
Average
Average
Average