Correlation of EchoMind Multi-Level Empathetic Evaluation with Human Judgments and Model Differentiation in Spoken Dialogue

SOVEREIGN Research Kernel

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Report

Data sources: ZENODO

Correlation of EchoMind Multi-Level Empathetic Evaluation with Human Judgments and Model Differentiation in Spoken Dialogue

descriptionPublicationkeyboard_double_arrow_right Report Under curation English Publisher:Zenodo

Authors: SOVEREIGN Research Kernel;

doi: 10.5281/zenodo.20652515

Correlation of EchoMind Multi-Level Empathetic Evaluation with Human Judgments and Model Differentiation in Spoken Dialogue

- Summary

Abstract

Driven by the rapid advancement of Large Language Models (LLMs), particularly Audio-LLMs and Omni-models, spoken dialogue systems have evolved significantly, progressively narrowing the gap between human-machine and human-human interactions. Achieving truly ``human-like'' communication necessitates a dual capability: emotional intelligence to perceive and resonate with users' emotional states, and robust interaction mechanisms to navigate the dynamic, natural flow of conversation, such as real-time turn-taking. Therefore, we launched the first Human-like Spoken Dialogue Systems Challenge (HumDResearch goal: To what extent does multi-level empathetic evaluation in EchoMind correlate with human judgments on empathy in spoken dialogue systems, and can this benchmark identify nuanced differences between models like OpenPangu-7B-MLA and Llama2-70B?Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.1/10.

Found an issue? Give us feedback