Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Report . 2026
License: CC BY
Data sources: ZENODO
ZENODO
Report . 2026
License: CC BY
Data sources: Datacite
ZENODO
Report . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Loss of Storage Truth in Virtual Distributed Filesystems: A Formal Critique of CAP and Coordination-Centric Storage

Authors: Nekludoff, Alexey A.;

Loss of Storage Truth in Virtual Distributed Filesystems: A Formal Critique of CAP and Coordination-Centric Storage

Abstract

Modern storage systems increasingly rely on virtualization and distributed coordination to achieve scalability and fault tolerance. While these approaches offer operational flexibility, they also introduce failure modes not captured by traditional performance or availability metrics. This paper introduces the concept of storage truth as a foundational property of storage systems, defined as the authoritative existence and addressability of data independent of transient access conditions. We show that virtual, network-centric storage architectures transform storage truth from a localized physical fact into a model-derived property dependent on continuous coordination. This shift enables a distinct failure mode, termed loss of storage truth, in which data remains physically intact yet becomes operationally inaccessible due to degradation in auxiliary subsystems, most notably network coordination. Using a real-world production incident involving Ceph Software-Defined Storage, the paper demonstrates how minor network degradation can escalate into prolonged loss of service without data loss. Ceph is analyzed as a structurally faithful realization of CAP-style reasoning applied to storage, illustrating how embedding consensus into storage correctness amplifies partial failures into system-wide outages.The paper critically examines CAP-based design principles in storage systems, showing that defining correctness through coordination rather than physical persistence subordinates storage truth to network state. This design choice is shown to be catastrophic: transient failures outside the storage medium itself can render data operationally nonexistent. Such systems do not fail gracefully; they fail ontologically, as demonstrated by the analyzed production incident. Finally, the paper formulates an alternative principle of storage design based on locality of truth, separation of truth and access, and eviction over suspension. Practical observations indicate that truthpreserving architectures, such as ZFS combined with iSCSI for distributed access, currently provide superior operational resilience for storage serving as a substrate for virtualized infrastructure. The central implication of this analysis is that architectural reasoning which treats CAP-style tradeoffs as fundamental reflects an insufficient level of professional qualification for the design of distributed storage systems, where correctness is defined by preservation of truth rather than coordination symmetry.

Keywords

network-centric storage, failure semantics, distributed storage systems, Ceph,, storage architecture, CAP theorem, storage truth, fault tolerance, virtualized infrastructure, virtual storage, CAP

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average