Optimized customer churn prediction using tabular generative adversarial network (GAN)-based hybrid sampling method and cost-sensitive learning

Name: Optimized customer churn prediction using tabular generative adversarial network (GAN)-based hybrid sampling method and cost-sensitive learning
Keywords: Cost-sensitive learning, Algorithms and Analysis of Algorithms, Electronic computers. Computer science, GAN-based hybrid sampling method, QA75.5-76.95, Customer churn prediction

I Nyoman Mahayasa Adiputra; Paweena Wanchai; Pei-Chun Lin

Found an issue? Give us feedback

PeerJ Computer Scien...arrow_drop_down

PeerJ Computer Science

Article . 2025 . Peer-reviewed

License: CC BY

Data sources: Crossref

PubMed Central

Other literature type . 2025

License: http://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Data sources: PubMed Central

PeerJ Computer Science

Article . 2025

Data sources: DOAJ

DBLP

Article

Data sources: DBLP

Optimized customer churn prediction using tabular generative adversarial network (GAN)-based hybrid sampling method and cost-sensitive learning

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 19 Jun 2025 English Publisher:PeerJJournal:PeerJ Computer Science, volume 11, page e2949 (eissn: 2376-5992,

Copyright policy )Funded by:UKRI | A connected digital proce...

Authors: I Nyoman Mahayasa Adiputra; Paweena Wanchai; Pei-Chun Lin;

doi: 10.7717/peerj-cs.2949

Optimized customer churn prediction using tabular generative adversarial network (GAN)-based hybrid sampling method and cost-sensitive learning

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Background Imbalanced and overlapped data in customer churn prediction significantly impact classification results. Various sampling and hybrid sampling methods have demonstrated effectiveness in addressing these issues. However, these methods have not performed well with classical machine learning algorithms. Methods To optimize the performance of classical machine learning on customer churn prediction tasks, this study introduces an extension framework called CostLearnGAN, a tabular generative adversarial network (GAN)-hybrid sampling method, and cost-sensitive Learning. Utilizing a cost-sensitive learning perspective, this research aims to enhance the performance of several classical machine learning algorithms in customer churn prediction tasks. Based on the experimental results classical machine learning algorithms exhibit shorter execution times, making them suitable for predicting churn in large customer bases. Results This study conducted an experiment with six comparative sampling methods, six datasets, and three machine learning algorithms. The results show that CostLearnGAN achieved a satisfying result across all evaluation metrics with a 1.44 average mean rank score. Additionally, this study provided a robustness measurement for algorithms, demonstrating that CostLearnGAN outperforms other sampling methods in improving the performance of classical machine learning models with a 5.68 robustness value on average.

Related Organizations

Khon Kaen University
Thailand
Feng Chia University
Taiwan

Keywords

Cost-sensitive learning, Algorithms and Analysis of Algorithms, Electronic computers. Computer science, GAN-based hybrid sampling method, QA75.5-76.95, Customer churn prediction

1 Research products, page 1 of 1

ctgan-enn-cs software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

gold

Funded by

UKRI| A connected digital process for reproducible 3D printing mass manufacture

Optimized customer churn prediction using tabular generative adversarial network (GAN)-based hybrid sampling method and cost-sensitive learning

Optimized customer churn prediction using tabular generative adversarial network (GAN)-based hybrid sampling method and cost-sensitive learning

1 Research products, page 1 of 1

ctgan-enn-cs software on GitHub