Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Preprint . 2026
License: CC BY
Data sources: Datacite
ZENODO
Preprint . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Etymological Origins of First Names in France (1900–2024)

A Century of Cultural Transformation Measured Through Large-Scale AI Classification
Authors: Simon, Romain;

Etymological Origins of First Names in France (1900–2024)

Abstract

We present a comprehensive analysis of the etymological origins of all first names given at birth in France between 1900 and 2024, using the complete INSEE civil registry dataset (87 million births, 48,516 unique names). Each name was classified into one of 20 etymological origin categories using a large language model (Claude Haiku 4.5) operating as an automated onomastic classifier. Our analysis reveals four major structural shifts: (1) a sustained decline of names with Germanic etymological roots, from 28% of births in 1920 to 8% in 2024; (2) a collapse and partial recovery of Hebrew/Biblical names, peaking at 40% in 1946 before stabilizing at 23%; (3) a steady, quasi-linear rise of names with Arabic etymological origins from near-zero in 1950 to 16% in 2024; and (4) a monotonic increase in the Shannon diversity index of name origins across the full period. Monte Carlo projections (10,000 trajectories calibrated on 1990–2024 volatility) produce 90% intervals for 2050. The full classification dataset (48,516 name–origin mappings) and analysis code are included as supplementary materials. A shorter French-language version is included as an additional file. An interactive visualization of these results is available at https://yukicapital.com/french-first-names-origins

Keywords

first names, Artificial intelligence, INSEE, Sociology, etymology, AI classification, cultural change, onomastics, France, time series, Demography, FOS: Sociology

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Related to Research communities
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!