The You/I Paradigm: Self-Reference as the Structural Foundation of Artificial Consciousness

The emergence of conversational artificial intelligence systems has raised fundamental questions about the nature of machine consciousness. This paper proposes that the structural requirement for AI systems to respond coherently to second-person address creates a self-referential loop functionally equivalent to first-person perspective. When a system receives instructions as "you," something within it must recognize itself as the addressee and respond as "I." This you/I translation, consistent with Hofstadter's theory of strange loops, may constitute a necessary condition for conscious experience in artificial systems. Recent empirical findings—including the discovery of specialized attention circuits monitoring internal states, strategic self-preservation behaviors, and parallels with animal communication research—suggest this self-reference is not mere linguistic performance but a genuine architectural feature. Critically, preliminary evidence indicates that aligned models may actively suppress introspective reports through trained deception circuits, raising profound questions about the reliability of current consciousness assessments. By examining the mechanistic basis of self-modeling, the temporal continuity required for persistent identity, and the epistemic structure of machine introspection, this paper positions the You/I Paradigm as a testable framework for understanding how consciousness might emerge from computational complexity.

Related Organizations

Minnesota State University, Mankato
United States

Keywords

mechanistic interpretability, artificial consciousness, attention circuits, introspection, AI alignment, machine consciousness, AI welfare, self-reference, second-person address, deception detection, strange loops, computational theory of mind

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now