
pmid: 12967956
Abstract Motivation: Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive—genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. Results: We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i) re-partitioning the genes and (ii) computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems. Availability: See http://bioinformatics.icmb.utexas.edu for the experimental results. Software is available on request.
Models, Genetic, Sequence Analysis, RNA, Gene Expression Profiling, Statistics as Topic, Reproducibility of Results, Fibroblasts, Sensitivity and Specificity, Gene Expression Regulation, Yeasts, Cluster Analysis, Humans, Sequence Alignment, Algorithms, Oligonucleotide Array Sequence Analysis
Models, Genetic, Sequence Analysis, RNA, Gene Expression Profiling, Statistics as Topic, Reproducibility of Results, Fibroblasts, Sensitivity and Specificity, Gene Expression Regulation, Yeasts, Cluster Analysis, Humans, Sequence Alignment, Algorithms, Oligonucleotide Array Sequence Analysis
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 75 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
