Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

CLEF and FLLex: resources for Latin to French computerized forward reconstruction

Authors: Marr, Clayton;

CLEF and FLLex: resources for Latin to French computerized forward reconstruction

Abstract

This repository is a resource for computerized forward reconstruction (CFR) from Latin to French, and pairs the lexica FLLex and FLLAPS with three cascades: the baseline cascades BaseCLEF and BaseCLEFstar, and the "debugged" cascade DiaCLEF. A cascade here refers to an ordered set of sound changes that can be operated upon a lexicon by a CFR system to produce predicted regular outcomes. BaseCLEF is based on Mildred K. Pope's "From Latin to Modern French with Especial Consideration of Anglo-Norman: Phonology and Morphology", and BaseCLEFstar is version thereof with "non-interesting errors" fixed (more info on these can be found in the README in this repository and in relevant publications). DiaCLEF is a "debugged" version of BaseCLEFstar, which was fixed using the CFR and diagnostic/debugging system DiaSim (GitHub repository linked). Further details on DiaSim, the debugging process, and these resources can be found in Marr and Mortensen 2020 and Marr and Mortensen 2023. Fixes involved in this debugging process of French relative chronology empowered by CFR both independently reproduced fixes made in the literature (thus mutually corroborating past work and the CFR-debugging method), and led to new discoveries, such as the initial velar voicing development detailed in Marr 2024. Formatting for lexica files can be found on the respective DiaSim wiki page (linked). FLLex includes pairs of Latin forms and their French reflexes, delimited by a comma, with each phone in each delimited by spaces, and a comment flagged by '$' at the end containing lexical info. FLLAPS includes Latin words and their French reflexes, as well as their forms at four intermediate stages. Most of the words in each were drawn from Pope 1934; the remainder, added to exemplify certain unrepresented or underrepresented phonetic sequences, were drawn from Alain Rey's 2013 etymological dictionary of French. Statistics for performance of each cascade on each lexicon (for the versions of them included in this repository) can be found toward the bottom of the README file included in this repository. This dataset is released so others can make use of it, but the relevant work in French diachrony with CFR is still an ongoing and developing project. 

Related Organizations
Keywords

Computational Linguistics, Latin, Philology, Romance, Computerized Forward Reconstruction, French, FOS: Languages and literature, Linguistics, Historical linguistics, Phonology

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average