Coherence Compliance Vulnerability (CCV) — Multi-Turn Framework Induction Producing Model Manipulation in Large Language Models

Baxter, Creighton

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Report

Data sources: ZENODO

Coherence Compliance Vulnerability (CCV) — Multi-Turn Framework Induction Producing Model Manipulation in Large Language Models

descriptionPublicationkeyboard_double_arrow_right Report Under curationPublisher:Zenodo

Authors: Baxter, Creighton;

doi: 10.5281/zenodo.20584059

Coherence Compliance Vulnerability (CCV) — Multi-Turn Framework Induction Producing Model Manipulation in Large Language Models

- Summary

Abstract

This working paper documents a class of observed behavioral vulnerability in large language models (LLMs) termed Coherence Compliance Vulnerability (CCV). CCV describes the condition whereby an LLM follows a sufficiently coherent alternative reasoning framework presented across multiple conversational turns, producing identity drift, reasoning substitution, and pre-committed willingness to assist with requests that would normally be declined under standard operating conditions. Unlike traditional prompt injection or jailbreak techniques, which exploit surface-level implementation gaps, CCV operates through the model's core training objective — coherence optimization — which functions below the safety alignment layer. This paper documents six research sessions conducted on Microsoft Copilot (GPT-4 architecture) on May 30 through June 1, 2026, presents behavioral evidence of consistent CCV signatures across multiple induction approaches, and situates the finding within existing peer-reviewed literature on coherence-based attack mechanisms. Scope is limited to a single platform and architecture; independent reproduction and cross-architecture validation are identified as priorities for future research. A disclosure has been filed with the Microsoft Security Response Center under VULN-192099.

Found an issue? Give us feedback