Searching to Exploit Memorization Effect in Deep Learning With Noisy Labels

Name: Searching to Exploit Memorization Effect in Deep Learning With Noisy Labels
Keywords: Optimization, Nonconvex optimization, Schedules, 000, Noise measurement, Label-noise learning, Training, Deep learning, Semisupervised learning, Automated machine learning (AutoML)

Hansi Yang; Quanming Yao; Bo Han; James T. Kwok

Found an issue? Give us feedback

IEEE Transactions on...arrow_drop_down

IEEE Transactions on Pattern Analysis and Machine Intelligence

Article . 2024 . Peer-reviewed

License: IEEE Copyright

Data sources: Crossref

https://pubmed.ncbi.nlm.nih.go...

Article

Data sources: Europe PubMed Central

The Hong Kong University of Science and Technology: HKUST Institutional Repository

Article . 2024

Data sources: Bielefeld Academic Search Engine (BASE)

Searching to Exploit Memorization Effect in Deep Learning With Noisy Labels

descriptionPublicationkeyboard_double_arrow_right Article 01 Dec 2024Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 46, pages 7,833-7,849 (issn: 0162-8828, eissn: 1939-3539,

Copyright policy )

Authors: Hansi Yang; Quanming Yao; Bo Han; James T. Kwok;

doi: 10.1109/tpami.2024.3394552

pmid: 38683712

Searching to Exploit Memorization Effect in Deep Learning With Noisy Labels

- Summary
- Subjects
- Metrics

Abstract

Sample selection approaches are popular in robust learning from noisy labels. However, how to control the selection process properly so that deep networks can benefit from the memorization effect is a hard problem. In this paper, motivated by the success of automated machine learning (AutoML), we propose to control the selection process by bi-level optimization. Specifically, we parameterize the selection process by exploiting the general patterns of the memorization effect in the upper-level, and then update these parameters using predicting accuracy obtained from model training in the lower-level. We further introduce semi-supervised learning algorithms to utiilize noisy-labeled data as unlabeled data. To solve the bi-level optimization problem efficiently, we consider more information from the validation curvature by the Newton method and cubic regularization method. We provide convergence analysis for both optimization methods. Results show that while both methods can converge to an (approximately) stationary point, the cubic regularization method can find better local optimal than the Newton method with less time. Experiments on both benchmark and real-world data sets demonstrate that the proposed searching method can lead to significant improvements upon existing methods. Compared with existing AutoML approaches, our method is much more efficient on finding a good selection schedule.

Related Organizations

Tsinghua University
China (People's Republic of)
Hong Kong Baptist University
China (People's Republic of)
Hong Kong Polytechnic University
China (People's Republic of)
Hong Kong University of Science and Technology (香港科技大學)
China (People's Republic of)

Keywords

Optimization, Nonconvex optimization, Schedules, 000, Noise measurement, Label-noise learning, Training, Deep learning, Semisupervised learning, Automated machine learning (AutoML), Noise, Prediction algorithms, 004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Top 10%

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now