Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Preprint
Data sources: ZENODO
addClaim

Semantic Keys as Attractor Basin Switches: Scaling Laws and Architectural Universality of Role-Based Confidence Modulation in Large Language Models

Authors: Huang, Yichen;

Semantic Keys as Attractor Basin Switches: Scaling Laws and Architectural Universality of Role-Based Confidence Modulation in Large Language Models

Abstract

We investigate whether semantic role prompts act as attractor-basin switches that modulate confidence and hallucination rates in large language models. Through 8,000+ inference runs across 5 models (3B–32B) spanning two architectures (Qwen2.5 and Llama-3), we identify three core findings: (1) semantic keys trigger binary confidence transitions; (2) scaling laws are non-monotonic; and (3) architectural differences outweigh scale differences in vulnerability to role-based manipulation. Low-level interventions fail to alter confidence behavior, whereas prompt-level semantic input reliably switches behavioral modes.

Powered by OpenAIRE graph
Found an issue? Give us feedback