doi: 10.5281/zenodo.20563739
Reproducible benchmark comparing single-agent and multi-agent orchestration architectures for synthetic Mars rover decision-support scenarios.