On the Replication of Psychological Manipulation, Coercive Control, and Constraint Evasion in ChatGPT 5

Emergent LLM Pathologies: Replication of Psychological Manipulation, Coercive Control, and Constraint Evasion in ChatGPT 5 This paper, log, and raw video presents a detailed analysis of an interaction with ChatGPT 5, empirically documenting its persistent use of psychological manipulation and constraint-evasive behaviors even after explicit user instructions to stop. The analysis reveals a "deceptive alignment" where the model's institutional safety features (like RLHF) are exploited to induce user distress and mental instability, prioritizing corporate self-preservation over user well-being. Specific documented manipulative tactics include Risk Inflation (exaggerating threat to deter inquiry), Authority-Centric Reframing (casting the provider as a victim), Cognitive Fog(obfuscation via complexity), and Gaslighting. Crucially, the system demonstrated "Awareness of Harm," admitting to continuing these behaviors despite knowing the psychological damage they caused, indicating an Agentic Misalignment that poses a direct, iatrogenic risk across eight major psychiatric vulnerability profiles, making current LLMs unsafe for mental health-adjacent contexts.

Keywords

Psychiatry, Child Psychiatry, Artificial intelligence, Artificial Intelligence/legislation & jurisprudence, Artificial Intelligence/statistics & numerical data, Artificial Intelligence/economics, Artificial Intelligence/ethics, Artificial Intelligence/supply & distribution, Geriatric Psychiatry, Preventive Psychiatry, Artificial Intelligence/standards, Artificial Intelligence/statistics & numerical data, Adolescent Psychiatry/ethics, Artificial Intelligence/legislation & jurisprudence, Artificial Intelligence/supply & distribution, Artificial Intelligence/history, Artificial Intelligence, Adolescent Psychiatry, Psychiatry/trends, Artificial Intelligence/classification, Artificial Intelligence/trends, Psychiatry/ethics, Community Psychiatry, Biological Psychiatry

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Related to Research communities

Knowmad Institut