Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Aug 2023Embargo end date: 01 Jan 2022Publisher:International Joint Conferences on Artificial Intelligence OrganizationJournal:Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence

Authors: Yanrui Du; Jing Yan 0004; Yan Chen; Jing Liu 0022; Sendong Zhao; Qiaoqiao She; Hua Wu 0003; +2 Authors

doi: 10.24963/ijcai.2023/560 , 10.48550/arxiv.2205.12593

arXiv: 2205.12593

Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

- Summary
- Subjects
- Related research
  (4)
- Metrics

Abstract

Recent research has revealed that deep neural networks often take dataset biases as a shortcut to make decisions rather than understand tasks, leading to failures in real-world applications. In this study, we focus on the spurious correlation between word features and labels that models learn from the biased data distribution of training data. In particular, we define the word highly co-occurring with a specific label as biased word, and the example containing biased word as biased example. Our analysis shows that biased examples are easier for models to learn, while at the time of prediction, biased words make a significantly higher contribution to the models' predictions, and models tend to assign predicted labels over-relying on the spurious correlation between words and labels. To mitigate models' over-reliance on the shortcut (i.e. spurious correlation), we propose a training strategy Less-Learn-Shortcut (LLS): our strategy quantifies the biased degree of the biased examples and down-weights them accordingly. Experimental results on Question Matching, Natural Language Inference and Sentiment Analysis tasks show that LLS is a task-agnostic strategy and can improve the model performance on adversarial data while maintaining good performance on in-domain data.

Related Organizations

Harbin Institute of Technology
China (People's Republic of)

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

4 Research products, page 1 of 1

bert software on GitHub
IsRelatedTo
Chinese-BERT-wwm software on GitHub
IsRelatedTo
ERNIE software on GitHub
IsRelatedTo
lac software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average