Augmented Negative Sampling for Collaborative Filtering

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 14 Sep 2023Embargo end date: 01 Jan 2023Publisher:ACMJournal:Proceedings of the 17th ACM Conference on Recommender Systems

Authors: Yuhan Zhao 0001; Rui Chen 0012; Riwei Lai; Qilong Han; Hongtao Song; Li Chen 0009;

doi: 10.1145/3604915.3608811 , 10.48550/arxiv.2308.05972

arXiv: 2308.05972

Augmented Negative Sampling for Collaborative Filtering

- Summary
- Subjects
- Metrics

Abstract

Negative sampling is essential for implicit-feedback-based collaborative filtering, which is used to constitute negative signals from massive unlabeled data to guide supervised learning. The state-of-the-art idea is to utilize hard negative samples that carry more useful information to form a better decision boundary. To balance efficiency and effectiveness, the vast majority of existing methods follow the two-pass approach, in which the first pass samples a fixed number of unobserved items by a simple static distribution and then the second pass selects the final negative items using a more sophisticated negative sampling strategy. However, selecting negative samples from the original items is inherently restricted, and thus may not be able to contrast positive samples well. In this paper, we confirm this observation via experiments and introduce two limitations of existing solutions: ambiguous trap and information discrimination. Our response to such limitations is to introduce augmented negative samples. This direction renders a substantial technical challenge because constructing unconstrained negative samples may introduce excessive noise that distorts the decision boundary. To this end, we introduce a novel generic augmented negative sampling paradigm and provide a concrete instantiation. First, we disentangle hard and easy factors of negative items. Next, we generate new candidate negative samples by augmenting only the easy factors in a regulated manner: the direction and magnitude of the augmentation are carefully calibrated. Finally, we design an advanced negative sampling strategy to identify the final augmented negative samples, which considers not only the score function used in existing methods but also a new metric called augmentation gain. Extensive experiments on real-world datasets demonstrate that our method significantly outperforms state-of-the-art baselines.

11 pages, 16 figures,

Related Organizations

Hong Kong Baptist University
China (People's Republic of)
Harbin Engineering University
China (People's Republic of)

Keywords

FOS: Computer and information sciences, 68T07, H.3.3, Information Retrieval (cs.IR), Computer Science - Information Retrieval

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	18
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

18

Top 10%

Green

Related to Research communities

UArctic