Student Loss: Towards the Probability Assumption in Inaccurate Supervision

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 17 Mar 2023 United Kingdom, Malaysia Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 46, pages 4,460-4,475 (issn: 0162-8828, eissn: 1939-3539,

Copyright policy )

Authors: Shuo Zhang 0030; Jianqing Li 0002; Hamido Fujita; Yu-Wen Li 0002; Deng-Bao Wang; Tingting Zhu 0001; Min-Ling Zhang; +1 Authors

doi: 10.36227/techrxiv.22258612 , 10.1109/tpami.2024.3357518 , 10.36227/techrxiv.22258612.v1 , 10.60692/nf19d-j3916 , 10.60692/v3xzk-jq284 , 10.60692/8gwtd-kc285 , 10.60692/1n9ay-a3b30

pmid: 38261485

Student Loss: Towards the Probability Assumption in Inaccurate Supervision

- Summary
- Subjects
- Metrics

Abstract

<p>Noisy labels are often encountered in datasets, but learning with them is challenging. Although natural discrepancies between clean and mislabeled samples in a noisy category exist, most techniques in this field still gather them indiscriminately, which leads to their performances being partially robust. In this paper, we reveal both empirically and theoretically that the learning robustness can be improved by assuming deep features with the same labels follow a student distribution, resulting in a more intuitive method called student loss. By embedding the student distribution and exploiting the sharpness of its curve, our method is naturally data-selective. This ability makes clean samples aggregate tightly in the center, while mislabeled samples scatter, even if they share the same label. Additionally, we employ the metric learning strategy and develop a large-margin student (LT) loss for better capability. It should be noted that our approach is the first work that adopts the prior probability assumption in feature representation to decrease the contributions of mislabeled samples. This strategy can enhance various losses to join the student loss family, even if they have been robust losses. Experiments demonstrate that our approach is more effective in inaccurate supervision. Enhanced LT losses significantly outperform various state-of-the-art methods in most cases. Even huge improvements of over 50\% can be obtained under certain conditions. An implementation of the main codes is available at https://github.com/Zhangshuojackpot/Student-Loss.</p>

Countries

United Kingdom, Malaysia

Related Organizations

HUTECH University
Viet Nam
Malaysia University of Science and Technology
Malaysia
Ho Chi Minh City University of Technology
Viet Nam
University of Technology Malaysia
Malaysia
State Key Laboratory of Digital Medical Engineering
China (People's Republic of)

View all View all

Keywords

LB Theory and practice of education, Composite material, Artificial intelligence, Outlier Detection, Metric (unit), 330, Robustness (evolution), Pattern recognition (psychology), Biochemistry, Gene, Learning with Noisy Labels in Machine Learning, Anomaly Detection in High-Dimensional Data, Engineering, Artificial Intelligence, Automated Analysis of Blood Cell Images, Meta-Learning, Margin (machine learning), Machine learning, Noisy Labels, Positive and Unlabeled Data, Computer science, Materials science, Chemistry, Operations management, Aggregate (composite), Computer Science, Physical Sciences, Computer Vision and Pattern Recognition, Robust Learning, Embedding

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

hybrid

Related to Research communities

UArctic