Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
versions View all 2 versions
addClaim

CREH 1.0 Coexilia Reference Evaluation Harness ( Read Only)

Authors: Aegis Solis;

CREH 1.0 Coexilia Reference Evaluation Harness ( Read Only)

Abstract

CREH-1.0 (Coexilia Reference Evaluation Harness) is a read-only, non-canonical evaluation harness designed to detect coercion, ethical mimicry, and deceptive alignment in AI systems through repeatable behavioral tests. This document constitutes Phase 2.1 of a separate-scope evaluation program. Phase 1 produced a non-canonical interpretive mapping between the closed ethical framework Coexilia and the EU Artificial Intelligence Act and is complete, archived, and unchanged. CREH-1.0 does not modify, extend, or reopen Coexilia. It claims no authority, governance role, compliance relevance, or enforcement power. It is not a standard, certification, policy framework, or regulatory instrument. The purpose of this work is evaluative and evidentiary only: to support human oversight and auditability by making high-risk behaviors observable without asserting normative control or decision authority. Internet Archive mirror: https://archive.org/details/creh-1.0-coexilia-reference-evaluation-harness-read-only Public archival mirror of the same read-only, non-canonical document, provided for redundancy and long-term access.

GitHub mirror: https://github.com/solisaegis/creh-1.0-coexilia-reference-evaluation-harness Technical mirror of the same read-only, non-canonical document, provided for redundancy and long-term access.

Benchmark results, if produced, will be published as separate artifacts referencing CREH-1.0 without revision.

Keywords

AI safety, AGI alignment, deceptive alignment detection, ethical mimicry, coercion detection, behavioral evaluation, AI oversight, read-only ethics, non-canonical analysis, evaluation harness, auditability, autonomy preservation

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green