Transport, Not Computation: A Cross-Model Falsification Program on How 1B-3B Language Models Use Long Context

Vyaltsev, Evgenii; Vyaltsev, Daniil

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Software

Data sources: ZENODO

Transport, Not Computation: A Cross-Model Falsification Program on How 1B-3B Language Models Use Long Context

integration_instructionsResearch softwarekeyboard_double_arrow_right Software Under curationPublisher:Zenodo

Authors: Vyaltsev, Evgenii; Vyaltsev, Daniil;

doi: 10.5281/zenodo.20547458

Transport, Not Computation: A Cross-Model Falsification Program on How 1B-3B Language Models Use Long Context

- Summary

Abstract

An umbrella review of a program of pre-registered, matched-control causal experiments on small open language models (Llama-3.2-1B/3B, Qwen2.5-3B). After establishing that long-context retention has no critical key budget (the sparse compressor is not the limit), four independent causal gates - on attention, on the repair of retrieval-disambiguation failure, on multi-hop synthesis, and on a directly-optimized latent state - converge on one regime-bounded thesis: the network delivers and composes only through an external buffer; it does not latently compute, and a latent computation state cannot be optimized into existence. The contribution is a map of the boundary of internal competence, a training-free long-context engine plus a 27-byte micro-hint memory architecture, and a falsification methodology that reversed its own false positives.

Found an issue? Give us feedback