A Hybrid Feature Selection Method for Improve the Accuracy of Medical Classification Process

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 30 Nov 2021Publisher:Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESPJournal:International Journal of Innovative Technology and Exploring Engineering, volume 11, pages 50-55 (eissn: 2278-3075,

Copyright policy )

Authors: Maria Mohammad Yousef;

doi: 10.35940/ijitee.a9624.1111121

A Hybrid Feature Selection Method for Improve the Accuracy of Medical Classification Process

- Summary
- Subjects
- Metrics

Abstract

Generally, medical dataset classification has become one of the biggest problems in data mining research. Every database has a given number of features but it is observed that some of these features can be redundant and can be harmful as well as disrupt the process of classification and this problem is known as a high dimensionality problem. Dimensionality reduction in data preprocessing is critical for increasing the performance of machine learning algorithms. Besides the contribution of feature subset selection in dimensionality reduction gives a significant improvement in classification accuracy. In this paper, we proposed a new hybrid feature selection approach based on (GA assisted by KNN) to deal with issues of high dimensionality in biomedical data classification. The proposed method first applies the combination between GA and KNN for feature selection to find the optimal subset of features where the classification accuracy of the k-Nearest Neighbor (kNN) method is used as the fitness function for GA. After selecting the best-suggested subset of features, Support Vector Machine (SVM) are used as the classifiers. The proposed method experiments on five medical datasets of the UCI Machine Learning Repository. It is noted that the suggested technique performs admirably on these databases, achieving higher classification accuracy while using fewer features.

Related Organizations

Al al-Bayt University
Jordan

Keywords

Dimensionality Problem, Feature Selection, classification, Genetic Algorithm.

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average