descriptionPublicationkeyboard_double_arrow_right Article , Preprint 24 Aug 2024Embargo end date: 01 Jan 2024Publisher:ACMJournal:Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Authors: Xixi Wu; Kaiyu Xiong; Yun Xiong; Xiaoxin He; Yao Zhang; Yizhu Jiao; Jiawei Zhang;

doi: 10.1145/3637528.3671749 , 10.48550/arxiv.2408.07369

arXiv: 2408.07369

ProCom: A Few-shot Targeted Community Detection Algorithm

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Targeted community detection aims to distinguish a particular type of community in the network. This is an important task with a lot of real-world applications, e.g., identifying fraud groups in transaction networks. Traditional community detection methods fail to capture the specific features of the targeted community and detect all types of communities indiscriminately. Semi-supervised community detection algorithms, emerged as a feasible alternative, are inherently constrained by their limited adaptability and substantial reliance on a large amount of labeled data, which demands extensive domain knowledge and manual effort. In this paper, we address the aforementioned weaknesses in targeted community detection by focusing on few-shot scenarios. We propose ProCom, a novel framework that extends the ``pre-train, prompt'' paradigm, offering a low-resource, high-efficiency, and transferable solution. Within the framework, we devise a dual-level context-aware pre-training method that fosters a deep understanding of latent communities in the network, establishing a rich knowledge foundation for downstream task. In the prompt learning stage, we reformulate the targeted community detection task into pre-training objectives, allowing the extraction of specific knowledge relevant to the targeted community to facilitate effective and efficient inference. By leveraging both the general community knowledge acquired during pre-training and the specific insights gained from the prompt communities, ProCom exhibits remarkable adaptability across different datasets. We conduct extensive experiments on five benchmarks to evaluate the ProCom framework, demonstrating its SOTA performance under few-shot scenarios, strong efficiency, and transferability across diverse datasets.

Accepted by SIGKDD'2024

Related Organizations

University of Illinois at Urbana Champaign
United States
Fudan University
China (People's Republic of)
National University of Singapore
Singapore
University of California System
United States
University of California, Davis
United States

View all View all

Keywords

Social and Information Networks (cs.SI), FOS: Computer and information sciences, Computer Science - Social and Information Networks

1 Research products, page 1 of 1

KDD2024ProCom software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%