
doi: 10.5334/johd.42
The Corpus for Idiolectal Research (CIDRE) is a collection of fiction works from 11 prolific 19th-century French authors (4 women, 7 men; 22–62 works/author; total of 37 million words). Every work is dated with the year it was written. Using programming scripts, the works have been gathered from open source platforms, for example La Bibliothèque électronique du Québec, and stripped of paratext (text not being part of the novel, e.g. prefaces). We distribute the text files, the dating, other metadata and the programming scripts under an open source license. CIDRE is the first resource of French for the study of style and idiolect in a diachronic manner (i.e. stylochronometry) on a larger scale.
Littérature, French, idiolect, corpus linguistics, fiction, Idiolectal variation, [SHS.LITT] Humanities and Social Sciences/Literature, AZ20-999, stylochronometry, French langage, 19th - 20th century, diachrony, Language and Literature, literature, linguistics, P, [SHS.LANGUE] Humanities and Social Sciences/Linguistics, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], stylometry, french, History of scholarship and learning. The humanities, [SHS] Humanities and Social Sciences
Littérature, French, idiolect, corpus linguistics, fiction, Idiolectal variation, [SHS.LITT] Humanities and Social Sciences/Literature, AZ20-999, stylochronometry, French langage, 19th - 20th century, diachrony, Language and Literature, literature, linguistics, P, [SHS.LANGUE] Humanities and Social Sciences/Linguistics, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], stylometry, french, History of scholarship and learning. The humanities, [SHS] Humanities and Social Sciences
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
