Deep learning with noisy labels in medical prediction problems: a scoping review

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 30 May 2024Embargo end date: 01 Jan 2024 English Publisher:Oxford University Press (OUP)Journal:Journal of the American Medical Informatics Association, volume 31, pages 1,596-1,607 (issn: 1067-5027, eissn: 1527-974X,

Copyright policy )Funded by:NIH | Closing the loop with an ..., NSF | CAREER: Knowledge-enhance...

Authors: Yishu Wei; Yu Deng; Cong Sun; Mingquan Lin; Hongmei Jiang; Yifan Peng;

doi: 10.1093/jamia/ocae108 , 10.48550/arxiv.2403.13111

pmid: 38814164

pmc: PMC11187424

arXiv: 2403.13111

Deep learning with noisy labels in medical prediction problems: a scoping review

- Summary
- Subjects
- Metrics

Abstract

Abstract Objectives Medical research faces substantial challenges from noisy labels attributed to factors like inter-expert variability and machine-extracted labels. Despite this, the adoption of label noise management remains limited, and label noise is largely ignored. To this end, there is a critical need to conduct a scoping review focusing on the problem space. This scoping review aims to comprehensively review label noise management in deep learning-based medical prediction problems, which includes label noise detection, label noise handling, and evaluation. Research involving label uncertainty is also included. Methods Our scoping review follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. We searched 4 databases, including PubMed, IEEE Xplore, Google Scholar, and Semantic Scholar. Our search terms include “noisy label AND medical/healthcare/clinical,” “uncertainty AND medical/healthcare/clinical,” and “noise AND medical/healthcare/clinical.” Results A total of 60 papers met inclusion criteria between 2016 and 2023. A series of practical questions in medical research are investigated. These include the sources of label noise, the impact of label noise, the detection of label noise, label noise handling techniques, and their evaluation. Categorization of both label noise detection methods and handling techniques are provided. Discussion From a methodological perspective, we observe that the medical community has been up to date with the broader deep-learning community, given that most techniques have been evaluated on medical data. We recommend considering label noise as a standard element in medical research, even if it is not dedicated to handling noisy labels. Initial experiments can start with easy-to-implement methods, such as noise-robust loss functions, weighting, and curriculum learning.

Related Organizations

Joan and Sanford I. Weill Medical College of Cornell University
United States
University of Minnesota Morris
United States
Weill Cornell Medicine
United States
WEILL MEDICAL COLL OF CORNELL UNIV
Cornell University
United States

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Deep Learning, Biomedical Research, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Humans, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	8
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%