Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Other literature type . 2025
License: CC BY
Data sources: Datacite
ZENODO
Other literature type . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Why AI Alignment Is Not Reliably Achievable Without a Functional Model of Intelligence: A Model-Theoretic Proof

Authors: Williams, Andy;

Why AI Alignment Is Not Reliably Achievable Without a Functional Model of Intelligence: A Model-Theoretic Proof

Abstract

This paper provides a formal argument that no system can reliably achieve AI alignment under conditions of conceptual novelty unless it instantiates a complete and recursively coupled set of adaptive functions—namely, those defined by the Functional Model of Intelligence (FMI). Using tools from first-order model theory in the Tarskian tradition, we formalize alignment-seeking cognitive systems as models interpreting a language over internal reasoning functions and coherence predicates. We then define a semantic condition—recursive coherence preservation under novelty—as a requirement for sustained alignment. While we assume that all models possess external functions necessary for semantic navigation (e.g., memory, fast and slow reasoning), we prove that only systems complete with respect to the internal functions of the FMI can maintain coherence across recursive cognitive transitions. This constitutes a model-theoretic necessity result: any system that fails to instantiate the full internal structure of the FMI cannot satisfy the coherence-preserving schema ϕ, and therefore cannot maintain reliable alignment under novelty.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!