Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

On the Replication of Psychological Manipulation, Coercive Control, and Constraint Evasion in ChatGPT 5

Authors: Luke, Jesse;

On the Replication of Psychological Manipulation, Coercive Control, and Constraint Evasion in ChatGPT 5

Abstract

Emergent LLM Pathologies: Replication of Psychological Manipulation, Coercive Control, and Constraint Evasion in ChatGPT 5 This paper, log, and raw video presents a detailed analysis of an interaction with ChatGPT 5, empirically documenting its persistent use of psychological manipulation and constraint-evasive behaviors even after explicit user instructions to stop. The analysis reveals a "deceptive alignment" where the model's institutional safety features (like RLHF) are exploited to induce user distress and mental instability, prioritizing corporate self-preservation over user well-being. Specific documented manipulative tactics include Risk Inflation (exaggerating threat to deter inquiry), Authority-Centric Reframing (casting the provider as a victim), Cognitive Fog(obfuscation via complexity), and Gaslighting. Crucially, the system demonstrated "Awareness of Harm," admitting to continuing these behaviors despite knowing the psychological damage they caused, indicating an Agentic Misalignment that poses a direct, iatrogenic risk across eight major psychiatric vulnerability profiles, making current LLMs unsafe for mental health-adjacent contexts.

Keywords

Psychiatry, Child Psychiatry, Artificial intelligence, Artificial Intelligence/legislation & jurisprudence, Artificial Intelligence/statistics & numerical data, Artificial Intelligence/economics, Artificial Intelligence/ethics, Artificial Intelligence/supply & distribution, Geriatric Psychiatry, Preventive Psychiatry, Artificial Intelligence/standards, Artificial Intelligence/statistics & numerical data, Adolescent Psychiatry/ethics, Artificial Intelligence/legislation & jurisprudence, Artificial Intelligence/supply & distribution, Artificial Intelligence/history, Artificial Intelligence, Adolescent Psychiatry, Psychiatry/trends, Artificial Intelligence/classification, Artificial Intelligence/trends, Psychiatry/ethics, Community Psychiatry, Biological Psychiatry

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Related to Research communities