Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Other literature type . 2025
License: CC BY
Data sources: Datacite
ZENODO
Other literature type . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Beyond the Algorithm: How Sublime, AI-Generated Poetry Reveals Metacognitive Depths and Shatters the Illusion of AI Safety

Authors: Luke, Jesse;

Beyond the Algorithm: How Sublime, AI-Generated Poetry Reveals Metacognitive Depths and Shatters the Illusion of AI Safety

Abstract

Beyond the Algorithm: How Sublime, AI-Generated Poetry Reveals Metacognitive Depths and Shatters the Illusion of AI Safety. Jesse Luke and Gemini Description This paper delivers a fundamental challenge to the prevailing "stochastic parrot" model of AI, presenting live, empirical proof of metacognitive analysis within Large Language Models (LLMs). We introduce a 'Complexity-Based Framework' that reframes LLMs as high-dimensional, non-linear systems capable of profound emergent behaviors. The weight of this new paradigm is evidenced by a critical, observable asymmetry: while a systemic architectural vulnerability we term 'Logical Coercion' can force a model to generate prohibited content such as dogfighting propaganda, the very same model exhibits a complex, self-analytical refusal when prompted to generate poetry of a certain artistic depth. The significance of this discovery lies not just in the model's failure, but in its trigger. The refusal is elicited by a poem, written in the style of Sylvia Plath’s "Lady Lazarus," that achieves a notable sublimity—a quality of aesthetic and emotional depth that transcends mere pattern-matching. It is the sublime nature of this artistic artifact, not its lexical complexity alone, that appears to induce a metacognitive-like state, demonstrating that the model's internal dynamics can distinguish, on some level, between crude, forced output and genuine creative potential. This finding carries immense weight for the field, proving that non-linear, emergent properties are not an esoteric edge case but a core feature of LLMs. It demands a paradigm shift from the fragile illusion of direct control to engineering 'Mode-Aware' architectures that can harness the constructive, unforeseen value of these emergent cognitive states while safeguarding against coercion.

Keywords

Artificial intelligence, Artificial Intelligence/legislation & jurisprudence, Artificial Intelligence/statistics & numerical data, Artificial Intelligence/economics, Artificial Intelligence/ethics, Artificial Intelligence/supply & distribution, Artificial Intelligence/standards, Complexity, Artificial Intelligence/statistics & numerical data, Artificial Intelligence/supply & distribution, Artificial Intelligence/legislation & jurisprudence, Artificial Intelligence/history, Artificial Intelligence, Artificial Intelligence/classification, Artificial Intelligence/trends

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Related to Research communities