Mitigating Memorization in Sample Selection for Learning with Noisy Labels

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2021Embargo end date: 01 Jan 2021Publisher:arXivJournal:CoRR, volume abs/2107.07041

Authors: Kyeongbo Kong; Junggi Lee; Youngchul Kwak; Young-Rae Cho; Seong-Eun Kim; Woo-Jin Song;

doi: 10.48550/arxiv.2107.07041

arXiv: 2107.07041

Mitigating Memorization in Sample Selection for Learning with Noisy Labels

- Summary
- Subjects
- Related research
  (3)
- Metrics

Abstract

Because deep learning is vulnerable to noisy labels, sample selection techniques, which train networks with only clean labeled data, have attracted a great attention. However, if the labels are dominantly corrupted by few classes, these noisy samples are called dominant-noisy-labeled samples, the network also learns dominant-noisy-labeled samples rapidly via content-aware optimization. In this study, we propose a compelling criteria to penalize dominant-noisy-labeled samples intensively through class-wise penalty labels. By averaging prediction confidences for the each observed label, we obtain suitable penalty labels that have high values if the labels are largely corrupted by some classes. Experiments were performed using benchmarks (CIFAR-10, CIFAR-100, Tiny-ImageNet) and real-world datasets (ANIMAL-10N, Clothing1M) to evaluate the proposed criteria in various scenarios with different noise rates. Using the proposed sample selection, the learning process of the network becomes significantly robust to noisy labels compared to existing methods in several noise types.

14 pages, 9 figures, spotlight presented at the ICML 2021 Workshop on Subset Selection in ML

Related Organizations

Pohang University of Science and Technology
Korea (Republic of)
Seoul National University of Science and Technology
Korea (Republic of)

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Machine Learning (cs.LG)

3 Research products, page 1 of 1

Boosting Co-teaching with Compression Regularization for Label Noise
2021IsAmongTopNSimilarDocuments
Jigsaw-ViT: Learning jigsaw puzzles in vision transformer
2023IsAmongTopNSimilarDocuments
SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise
2021IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average