
With the introduction of microarray, cancer classification, diagnosis and prediction are made more accurate and effective. However, the final outcome of the data analyses very much depend on the huge number of genes with relatively small number of samples present in each experiment. It is thus crucial to select relevant genes to be used for future specific cancer markers. Many feature selection methods have been proposed but none is able to classify all kinds of microarray data accurately, especially on those multi-class datasets. We propose a one-versus-one comparison method for selecting discriminatory features instead of performing the statistical test in a one-versus-all manner. Brain cancer is chosen as an example. Here, 3 types of statistics are used: signal-to-noise ratio (SNR), t-statistics and Pearson correlation coefficient. Results are verified by performing hierarchical and k-means clustering. Using our one-versus-one comparisons, best performance accuracies of 90.48% and 97.62% can be obtained by hierarchical and k-means clustering respectively. However best performance accuracies of 88.10% and 80.95% can be obtained respectively when using one-versus-all comparison. This shows that one-versus-one comparison is superior.
Automated, Biological - Metabolism, Pattern Recognition, Pattern Recognition, Automated, Computer-Assisted, Theoretical, Computational Biology - Methods, Models, Diagnosis, Brain Neoplasms - Diagnosis - Genetics, Biomarkers, Tumor, Cluster Analysis, Humans, Diagnosis, Computer-Assisted, Tumor Markers, Oligonucleotide Array Sequence Analysis, Neoplastic, Tumor Markers, Biological - Metabolism, Models, Statistical, Brain Neoplasms, Gene Expression Profiling, Neoplasm Proteins - Metabolism, Computational Biology, Statistical, Models, Theoretical, Neoplasm Proteins, Gene Expression Regulation, Neoplastic, Gene Expression Regulation, Algorithms
Automated, Biological - Metabolism, Pattern Recognition, Pattern Recognition, Automated, Computer-Assisted, Theoretical, Computational Biology - Methods, Models, Diagnosis, Brain Neoplasms - Diagnosis - Genetics, Biomarkers, Tumor, Cluster Analysis, Humans, Diagnosis, Computer-Assisted, Tumor Markers, Oligonucleotide Array Sequence Analysis, Neoplastic, Tumor Markers, Biological - Metabolism, Models, Statistical, Brain Neoplasms, Gene Expression Profiling, Neoplasm Proteins - Metabolism, Computational Biology, Statistical, Models, Theoretical, Neoplasm Proteins, Gene Expression Regulation, Neoplastic, Gene Expression Regulation, Algorithms
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 8 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
