Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Preprint
Data sources: ZENODO
addClaim

Observer Depth: Quantifying Reflexive Intelligence in LLMs via Phase Transition Analysis

Authors: Zhang, Mian;

Observer Depth: Quantifying Reflexive Intelligence in LLMs via Phase Transition Analysis

Abstract

We present ReflexBench, the first benchmark designed to evaluate reflexive reasoning in large language models - the capacity to reason about one's own causal impact on the environment being analyzed. ReflexBench comprises 20 scenarios across 6 domains, each probing four levels of Observer Depth (OD). We evaluate 5 frontier LLMs and find that all exhibit systematic degradation at higher observer depths (mean Delta = -0.50). We propose the Soros Test as a practical standard for evaluating observer-participant readiness and document that reflexive capabilities emerge through a phase transition during multi-reward GRPO training.

Powered by OpenAIRE graph
Found an issue? Give us feedback