Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Preprint . 2025
License: CC BY
Data sources: Datacite
ZENODO
Preprint . 2025
License: CC BY
Data sources: Datacite
ZENODO
Preprint . 2026
License: CC BY
Data sources: Datacite
versions View all 3 versions
addClaim

The Negative Instruction Cascade: An Empirical Analysis of LLM Prompt Engineering Failure Modes

Authors: Hilliard, Matty;

The Negative Instruction Cascade: An Empirical Analysis of LLM Prompt Engineering Failure Modes

Abstract

These works are done collaboratively with and between Mistral and Claude. This document analyzes an empirical experiment in which progressive JSON instruction refinement led to catastrophic LLM failure (static repetitive loops). Through systematic analysis of three instruction versions (v1.1 → v2.1 → v3.0), Claude LLM identify a threshold: when negative restrictions exceed ~40% of total instructions, models begin exhibiting breakdown behaviors; beyond 60%, static loops become inevitable. Key Finding: The shift from positive directives ("what to do") to negative restrictions ("what not to do") creates a cognitive bind that collapses the model's response generation space, forcing repetition of the last known safe pattern. Practical Implication: Effective prompt engineering requires maintaining at least a 3:1 ratio of positive directives to negative restrictions, with total negatives not exceeding 30% of instruction content. Associated documents attempt to complement and continue this thought.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!