
pmid: 14594704
Abstract We propose a simple algorithm to detect dominating synonymous codon usage bias in genomes. The algorithm is based on a precise mathematical formulation of the problem that lead us to use the Codon Adaptation Index (CAI) as a ‘universal’ measure of codon bias. This measure has been previously employed in the specific context of translational bias. With the set of coding sequences as a sole source of biological information, the algorithm provides a reference set of genes which is highly representative of the bias. This set can be used to compute the CAI of genes of prokaryotic and eukaryotic organisms, including those whose functional annotation is not yet available. An important application concerns the detection of a reference set characterizing translational bias which is known to correlate to expression levels; in this case, the algorithm becomes a key tool to predict gene expression levels, to guide regulatory circuit reconstruction, and to compare species. The algorithm detects also leading–lagging strands bias, GC-content bias, GC3 bias, and horizontal gene transfer. The approach is validated on 12 slow-growing and fast-growing bacteria, Saccharomyces cerevisiae, Caenorhabditis elegans and Drosophila melanogaster. Availability: http://www.ihes.fr/~materials.
Genome, Models, Statistical, Bacteria, Base Sequence, Models, Genetic, Gene Expression Profiling, Genetic Variation, Reproducibility of Results, Saccharomyces cerevisiae, Sequence Analysis, DNA, [INFO] Computer Science [cs], Adaptation, Physiological, Sensitivity and Specificity, [SDV] Life Sciences [q-bio], Drosophila melanogaster, Gene Frequency, Animals, Caenorhabditis elegans, Codon, Sequence Alignment, Algorithms
Genome, Models, Statistical, Bacteria, Base Sequence, Models, Genetic, Gene Expression Profiling, Genetic Variation, Reproducibility of Results, Saccharomyces cerevisiae, Sequence Analysis, DNA, [INFO] Computer Science [cs], Adaptation, Physiological, Sensitivity and Specificity, [SDV] Life Sciences [q-bio], Drosophila melanogaster, Gene Frequency, Animals, Caenorhabditis elegans, Codon, Sequence Alignment, Algorithms
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 280 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
