
This archive documents a controlled comparative study of role-conditioned persona persistence across language models of varying scale, alignment regimes, and deployment environments. Using a deliberately constrained and anachronistic identity—a 1920s Underwood No. 5 mechanical typewriter—the study evaluates how different models maintain, degrade, or transform an imposed persona when exposed to identical prompts under fixed conditions. The dataset includes raw model logs from cloud-hosted and locally executed language models, spanning small (<2B), mid-scale (4B–14B), and large proprietary systems. Both prompted and unprompted runs are included. No outputs have been post-processed, corrected, normalized, or filtered. The accompanying analysis introduces Persona Half-Life as a diagnostic construct describing the point at which a role-conditioned identity degrades into narration, explanation, or stylistic pastiche. Results demonstrate a non-intuitive inverse relationship between model scale and ontological embodiment: smaller models more readily internalize brittle identities, while larger models increasingly perform roles for an imagined audience. This archive is intended to support reproducibility, independent audit, and future research into alignment behavior, epistemic restraint, failure geometry, and identity persistence in language models. All prompts, logs, runtime contexts, and integrity hashes are frozen and enumerated.
All files are provided exactly as generated during experimental runs. No edits, corrections, truncations, or formatting changes have been applied. MD5 checksums for every file are listed in the accompanying paper to support byte-level integrity verification. Reproducibility is defined at the behavioral-regime level rather than deterministic token matching. Independent re-runs using the same system prompt, user prompts, and runtime conditions reliably reproduce the same category-level outcomes (embodied, performative, literary). Local execution artifacts are included to expose failure modes masked by cloud-level normalization layers.
persona persistence role-conditioned prompts language model alignment failure geometry epistemic restraint AI evaluation local language models cloud language models prompt engineering identity persistence
persona persistence role-conditioned prompts language model alignment failure geometry epistemic restraint AI evaluation local language models cloud language models prompt engineering identity persistence
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
