Gene selection and cancer classification using interaction-based feature clustering and improved-binary Bat algorithm

Name: Gene selection and cancer classification using interaction-based feature clustering and improved-binary Bat algorithm
Keywords: Neoplasms, Gene Expression Profiling, Databases, Genetic, Humans, Cluster Analysis, Computational Biology, Female, Algorithms

Ahmad Esfandiari; Niki Nasiri

Found an issue? Give us feedback

Computers in Biology...arrow_drop_down

Computers in Biology and Medicine

Article . 2024 . Peer-reviewed

License: Elsevier TDM

Data sources: Crossref

https://pubmed.ncbi.nlm.nih.go...

Article . 2024

Data sources: Europe PubMed Central

DBLP

Article

Data sources: DBLP

Gene selection and cancer classification using interaction-based feature clustering and improved-binary Bat algorithm

descriptionPublicationkeyboard_double_arrow_right Article 01 Oct 2024 English Publisher:Elsevier BVJournal:Computers in Biology and Medicine, volume 181, page 109,071 (issn: 0010-4825,

Copyright policy )

Authors: Ahmad Esfandiari; Niki Nasiri;

doi: 10.1016/j.compbiomed.2024.109071

pmid: 39205342

Gene selection and cancer classification using interaction-based feature clustering and improved-binary Bat algorithm

- Summary
- Subjects
- Metrics

Abstract

In high-dimensional gene expression data, selecting an optimal subset of genes is crucial for achieving high classification accuracy and reliable diagnosis of diseases. This paper proposes a two-stage hybrid model for gene selection based on clustering and a swarm intelligence algorithm to identify the most informative genes with high accuracy. First, a clustering-based multivariate filter approach is performed to explore the interactions between the features and eliminate any redundant or irrelevant ones. Then, by controlling for the problem of premature convergence in the binary Bat algorithm, the optimal gene subset is determined using different classifiers with the Monte Carlo cross-validation data partitioning model. The effectiveness of our proposed framework is evaluated using eight gene expression datasets, by comparison with other recently published algorithms in the literature. Experiments confirm that in seven out of eight datasets, the proposed method can achieve superior results in terms of classification accuracy and gene subset size. In particular, it achieves a classification accuracy of 100% in Lymphoma and Ovarian datasets and above 97.4% in the rest with a minimum number of genes. The results demonstrate that our proposed algorithm has the potential to solve the feature selection problem in different applications with high-dimensional datasets.

Related Organizations

Islamic Azad University Sari Branch
Iran (Islamic Republic of)
Mazandaran University of Medical Sciences
Iran (Islamic Republic of)

Keywords

Neoplasms, Gene Expression Profiling, Databases, Genetic, Humans, Cluster Analysis, Computational Biology, Female, Algorithms

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

6

Top 10%

Average

Top 10%

Related to Research communities

Cancer Research

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now