Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation (CURVAS) challenge results

Name: Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation (CURVAS) challenge results
Keywords: Multi-class image segmentation, [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], FOS: Computer and information sciences, [SDV.IB.IMA] Life Sciences [q-bio]/Bioengineering/Imaging, Multiple expert annotations, Computer Vision and Pattern Recognition (cs.CV), Calibration, Uncertainty, [INFO.INFO-IM] Computer Science [cs]/Medical Imaging, Computer Vision and Pattern Recognition

Riera-Marín, Meritxell; O.K., Sikha; Rodríguez-Comas, Júlia; May, Matthias Stefan; Pan, Zhaohong; Zhou, Xiang; Liang, Xiaokun; Erick, Franciskus Xaverius; Prenner, Andrea; Hémon, Cédric; Boussot, Valentin; Dillenseger, Jean-Louis; Nunes, Jean-Claude; Qayyum, Abdul; Mazher, Moona; Niederer, Steven; Kushibar, Kaisar; Martín-Isla, Carlos; Radeva, Petia; Lekadir, Karim; Barfoot, Theodore; Garcia Peraza Herrera, Luis; Glocker, Ben; Vercauteren, Tom; Gago, Lucas; Englemann, Justin; Kleiss, Joy-Marie; Aubanell, Anton; Antolin, Andreu; García-López, Javier; González Ballester, Miguel; Galdrán, Adrián

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

HAL-Rennes 1

Article . 2025

License: CC BY

Data sources: HAL-Rennes 1

King's Research Portal

Article . 2025

License: CC BY

Data sources: King's Research Portal

Computers in Biology and Medicine

Article . 2025 . Peer-reviewed

License: Elsevier TDM

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2025

License: CC BY NC SA

Data sources: Datacite

DBLP

Article

Data sources: DBLP

DBLP

Article

Data sources: DBLP

Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation (CURVAS) challenge results

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Oct 2025Embargo end date: 01 Jan 2025 France, United Kingdom English Publisher:Elsevier BVJournal:Computers in Biology and Medicine, volume 197, page 111,024 (issn: 0010-4825,

Copyright policy )Funded by:DFG | unidentified, ANR | VATSop

Authors: Riera-Marín, Meritxell; O.K., Sikha; Rodríguez-Comas, Júlia; May, Matthias Stefan; Pan, Zhaohong; Zhou, Xiang; Liang, Xiaokun; +25 Authors

doi: 10.1016/j.compbiomed.2025.111024 , 10.48550/arxiv.2505.08685

arXiv: 2505.08685

Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation (CURVAS) challenge results

- Summary
- Subjects
- Metrics

Abstract

Deep learning (DL) has become the dominant approach for medical image segmentation, yet ensuring the reliability and clinical applicability of these models requires addressing key challenges such as annotation variability, calibration, and uncertainty estimation. This is why we created the Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation (CURVAS), which highlights the critical role of multiple annotators in establishing a more comprehensive ground truth, emphasizing that segmentation is inherently subjective and that leveraging inter-annotator variability is essential for robust model evaluation. Seven teams participated in the challenge, submitting a variety of DL models evaluated using metrics such as Dice Similarity Coefficient (DSC), Expected Calibration Error (ECE), and Continuous Ranked Probability Score (CRPS). By incorporating consensus and dissensus ground truth, we assess how DL models handle uncertainty and whether their confidence estimates align with true segmentation performance. Our findings reinforce the importance of well-calibrated models, as better calibration is strongly correlated with the quality of the results. Furthermore, we demonstrate that segmentation models trained on diverse datasets and enriched with pre-trained knowledge exhibit greater robustness, particularly in cases deviating from standard anatomical structures. Notably, the best-performing models achieved high DSC and well-calibrated uncertainty estimates. This work underscores the need for multi-annotator ground truth, thorough calibration assessments, and uncertainty-aware evaluations to develop trustworthy and clinically reliable DL-based medical image segmentation models.

This challenge was hosted in MICCAI 2024

Countries

France, United Kingdom

Related Organizations

King's College London
United Kingdom
University College London
United Kingdom
Inserm
France
Imperial College London
United Kingdom
University of Chinese Academy of Social Sciences
China (People's Republic of)

View all View all

Keywords

Multi-class image segmentation, [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], FOS: Computer and information sciences, [SDV.IB.IMA] Life Sciences [q-bio]/Bioengineering/Imaging, Multiple expert annotations, Computer Vision and Pattern Recognition (cs.CV), Calibration, Uncertainty, [INFO.INFO-IM] Computer Science [cs]/Medical Imaging, Computer Vision and Pattern Recognition, Abdominal CT, abdominal CT, [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

Funded by

DFG| unidentified, ANR| VATSop

Related to Research communities

EUTOPIA Open Research Portal

UArctic