Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ IEEE Accessarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
IEEE Access
Article . 2025 . Peer-reviewed
License: CC BY
Data sources: Crossref
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
IEEE Access
Article . 2025
Data sources: DOAJ
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Double Filter and Double Wrapper Feature Selection Algorithm for High-Dimensional Data Analysis

Authors: Hong Chen; Yuefeng Zheng;

Double Filter and Double Wrapper Feature Selection Algorithm for High-Dimensional Data Analysis

Abstract

With the advent of the big data era, we often deal with datasets containing a large number of redundant features, and in this context, dimensionality reduction of data becomes crucial. To address this issue, this study proposes a double filter and double wrapper (DFDW) feature selection algorithm for high-dimensional data. In the double filter stage, the algorithm first evaluates all features from two perspectives using two filter algorithms: ReliefF and the Pearson correlation coefficient. It then selects the top k features and obtains a candidate feature subset F by taking the intersection. Next, the standard Cauchy distribution was used for population initialization. Subsequently, the algorithm enters the double wrapper stage, where it uses the Random Walk Whale Optimization Algorithm (RWWOA) and the improved Adaptive Differential Evolution (ADE) to jointly optimize and obtain the optimal feature subset. Among them, in order to overcome the problem of single algorithm falling into the local optimum, the Algorithm Iteration Mechanism is proposed, which selectively runs two wrapper algorithms to make the algorithm jump out of local optimum and explore a broader optimization space. Finally, we verified the effectiveness of the algorithm through three sets of comparative experiments. The experimental results show that the DFDW algorithm performed well in obtaining the optimal feature subsets on 10 high-dimensional datasets, with an average classification accuracy of more than 95.1% on 8 datasets, a dimensionality reduction rate of less than 0.64% on all datasets, and the lowest dimensionality reduction rate of 0.19%.

Related Organizations
Keywords

ReliefF, differential evolution, Feature selection, Pearson correlation coefficient, Electrical engineering. Electronics. Nuclear engineering, whale optimization algorithm, TK1-9971

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
gold