
A plenitude of feature selection (FS) methods is available in the literature, most of them rising as a need to analyze data of very high dimension, usually hundreds or thousands of variables. Such data sets are now available in various application areas like combinatorial chemistry, text mining, multivariate imaging, or bioinformatics. As a general accepted rule, these methods are grouped in filters, wrappers, and embedded methods. More recently, a new group of methods has been added in the general framework of FS: ensemble techniques. The focus in this survey is on filter feature selection methods for informative feature discovery in gene expression microarray (GEM) analysis, which is also known as differentially expressed genes (DEGs) discovery, gene prioritization, or biomarker discovery. We present them in a unified framework, using standardized notations in order to reveal their technical details and to highlight their common characteristics as well as their particularities.
Genetic Markers, gene prioritization, Gene ranking, Biotechnologie, Information Theory, Statistics, Nonparametric, scoring functions, feature selection, gene expression data, biomarker discovery, Oligonucleotide Array Sequence Analysis, Information filters, Analysis of Variance, Models, Statistical, Gene Expression Profiling, Computational Biology, Bayes Theorem, Statistical significance, Mathématiques, gene ranking, ROC Curve, information filters, Scoring functions, Feature selection, Gene prioritization, statistical methods, Gene expression data, Biomarker discovery, Biologie
Genetic Markers, gene prioritization, Gene ranking, Biotechnologie, Information Theory, Statistics, Nonparametric, scoring functions, feature selection, gene expression data, biomarker discovery, Oligonucleotide Array Sequence Analysis, Information filters, Analysis of Variance, Models, Statistical, Gene Expression Profiling, Computational Biology, Bayes Theorem, Statistical significance, Mathématiques, gene ranking, ROC Curve, information filters, Scoring functions, Feature selection, Gene prioritization, statistical methods, Gene expression data, Biomarker discovery, Biologie
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 487 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 0.1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
