
This upload contains the preprint “Collapse Index (CI): A Diagnostic Framework for Bounded, Lightweight, and Reproducible Evaluation of System Instability.”The Collapse Index introduces a normalized instability metric designed to reveal hidden brittleness in machine learning systems under benign, non-adversarial perturbations. CI highlights reliability failures that appear stable under standard evaluation methods such as accuracy or confidence-based metrics.The framework emphasizes: • a bounded instability score 0,1 for interpretability • lightweight evaluation requiring only model predictions • dataset-driven perturbation analysis • sealed, reproducible output bundles using cryptographic hashesEach evaluation run produces standardized diagnostics including instability scores, summary tables, and full provenance metadata to support auditability and reproducibility.This deposit includes the full preprint.Associated evaluation artifacts are generated separately and delivered as sealed bundles.Project page: https://collapseindex.orgLicensed under CC BY-NC-ND 4.0.
brittleness, instability, model evaluation, machine learning, diagnostics, robustness, perturbation analysis, benchmarking, artificial intelligence, reproducibility, confidence calibration
brittleness, instability, model evaluation, machine learning, diagnostics, robustness, perturbation analysis, benchmarking, artificial intelligence, reproducibility, confidence calibration
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
