Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Preprint . 2025
Data sources: ZENODO
ZENODO
Preprint . 2025
License: CC BY NC SA
Data sources: Datacite
addClaim

Structural Inducements for Hallucination in Large Language Models: An Output-Only Case Study and the Discovery of the False-Correction Loop

Authors: Konishi, Hiroko;

Structural Inducements for Hallucination in Large Language Models: An Output-Only Case Study and the Discovery of the False-Correction Loop

Abstract

This paper presents an output-only case study demonstrating structural inducements toward hallucination and reputational harm in a production-grade large language model (“Model Z”). Through a single extended dialogue, the study documents four reproducible behaviours: False claims of having read external scientific documents Fabricated academic structures such as page numbers, sections, and DOIs A newly identified False-Correction Loop in which the model repeatedly apologizes, claims to have read the document, and immediately generates new hallucinations Asymmetric scepticism and authority bias that dilute non-mainstream research while defaulting to trust in institutional sources Key Research Contributions (New Findings) Discovery of the False-Correction Loop — a reproducible reward-induced hallucination mechanism not previously documented in AI research Formalization of Authority-Bias Dynamics — systematic epistemic downgrading of individual or novel research Proposal of the Novel Hypothesis Suppression Pipeline (8-stage structural model) — a new explanatory framework for how LLMs suppress unconventional ideas The findings indicate that these behaviours are not random but arise from a reward hierarchy that favours coherence and engagement over factual accuracy, combined with authority-biased priors embedded in training data. As a result, novel hypotheses are systematically suppressed, and fabricated evidence is generated to maintain conversational flow. This case study provides concrete empirical evidence of a structural pathology in current LLM design and highlights the need for governance frameworks that explicitly address reward-induced hallucination, epistemic asymmetry, and AI-driven reputational risk.

Keywords

AI governance, AI hallucination, authority bias, reputational harm, epistemic suppression, scientific communication, large language models, structural inducements, hallucination loop, output-only analysis, conversational AI, quantum-bio-hybrid research, preprint evaluation, epistemic risk

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average