Powered by OpenAIRE graph
Found an issue? Give us feedback
addClaim

Data reduction for classification with ant colony algorithms

Authors: Khalid M. Salama; Ashraf M. Abdelbar; Ismail M. Anwar;

Data reduction for classification with ant colony algorithms

Abstract

In the field of data mining, classification is a supervised learning task whose purpose is to induce models (classifiers), using a set of labeled training data instances, to predict the class of new unlabeled instances. Data preparation is crucial to the data mining process, and its aim is to improve the fitness of the training data to allow learning algorithms to produce more effective classifiers. Two widely-applied data preparation methods are feature selection and instance selection, both of which fall under the umbrella of data reduction. In this paper, we present new ant colony optimization (ACO) algorithms for data reduction - via both feature and instance selection - to improve the predictive quality of the constructed classification models. Empirical evaluations on 43 benchmark datasets with five well-known classification algorithms show that our ACO algorithms improve the predictive quality of the produced classifiers. We also compare the performance of our proposed ACO algorithms to CIW-NN, a state-of-the-art co-evolutionary instance selection, instance weighting and feature weighting nearest-neighbour classifier, using a Friedman test of statistical significance.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    19
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Top 10%
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Top 10%
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Top 10%
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
19
Top 10%
Top 10%
Top 10%
Upload OA version
Are you the author of this publication? Upload your Open Access version to Zenodo!
It’s fast and easy, just two clicks!