descriptionPublicationkeyboard_double_arrow_right Article 01 Jul 2012Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE/ACM Transactions on Computational Biology and Bioinformatics, volume 9, pages 1,106-1,119 (issn: 1545-5963,

Authors: Lazar, Cosmin; Taminau, Jonatan; Meganck, Stijn; Steenhoff, David; Nowe, Ann; Coletta, Alain; Molter, Colin; +3 Authors

doi: 10.1109/tcbb.2012.33

pmid: 22350210

handle: 2013/ULB-DIPOT:oai:dipot.ulb.ac.be:2013/184434

A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis

- Summary
- Subjects
- Metrics

Abstract

A plenitude of feature selection (FS) methods is available in the literature, most of them rising as a need to analyze data of very high dimension, usually hundreds or thousands of variables. Such data sets are now available in various application areas like combinatorial chemistry, text mining, multivariate imaging, or bioinformatics. As a general accepted rule, these methods are grouped in filters, wrappers, and embedded methods. More recently, a new group of methods has been added in the general framework of FS: ensemble techniques. The focus in this survey is on filter feature selection methods for informative feature discovery in gene expression microarray (GEM) analysis, which is also known as differentially expressed genes (DEGs) discovery, gene prioritization, or biomarker discovery. We present them in a unified framework, using standardized notations in order to reveal their technical details and to highlight their common characteristics as well as their particularities.

Related Organizations

Vrije Universiteit Amsterdam
Netherlands
Vrije Universiteit Brussel
Belgium
Université Libre de Bruxelles
Belgium

Keywords

Genetic Markers, gene prioritization, Gene ranking, Biotechnologie, Information Theory, Statistics, Nonparametric, scoring functions, feature selection, gene expression data, biomarker discovery, Oligonucleotide Array Sequence Analysis, Information filters, Analysis of Variance, Models, Statistical, Gene Expression Profiling, Computational Biology, Bayes Theorem, Statistical significance, Mathématiques, gene ranking, ROC Curve, information filters, Scoring functions, Feature selection, Gene prioritization, statistical methods, Gene expression data, Biomarker discovery, Biologie

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	455
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%