
pmid: 12117795
arXiv: cond-mat/0105479
Abstract Motivation: Comparative genomics provides a powerful way to investigate regularities and differences observed at DNA level across species. Here we study the number and location of inverted repeats occurring in complete genomes of bacteria. Inverted repeats are compatible with the formation of hairpin structures in the messenger RNA. Some of these structures are known to be rho-independent intrinsic terminators. Results: We investigate the number of inverted repeats observed in 37 complete genomes of bacteria. The number of inverted repeats observed is much higher than expected using Markovian models of DNA sequences in most of the eubacteria. By using the information annotated in the genomes we discover that in most of the eubacteria the inverted repeats of stem length longer than 8 nucleotides preferentially locate near the 3′ end of the nearest coding regions. We also show that IRs characterized by large values of the stem length locate preferentially in short non-coding regions bounded by two 3′ ends of convergent genes. By using the program TransTerm recently introduced to predict transcription terminators in bacterial genomes, we conclude that only a part of the observed inverted repeats fullfils the model requirements characterizing rho-independent termination in several genomes. Availability: http://lagash.dft.unipa.it/IR.html Contact: mantegna@unipa.it
DNA, Bacterial, Models, Statistical, Base Sequence, Models, Genetic, Statistical Mechanics (cond-mat.stat-mech), Molecular Sequence Data, FOS: Physical sciences, Sequence Analysis, DNA, Sensitivity and Specificity, Quantitative Biology, FOS: Biological sciences, Databases, Nucleic Acid, Sequence Alignment, Condensed Matter - Statistical Mechanics, Genome, Bacterial, Quantitative Biology (q-bio), Repetitive Sequences, Nucleic Acid
DNA, Bacterial, Models, Statistical, Base Sequence, Models, Genetic, Statistical Mechanics (cond-mat.stat-mech), Molecular Sequence Data, FOS: Physical sciences, Sequence Analysis, DNA, Sensitivity and Specificity, Quantitative Biology, FOS: Biological sciences, Databases, Nucleic Acid, Sequence Alignment, Condensed Matter - Statistical Mechanics, Genome, Bacterial, Quantitative Biology (q-bio), Repetitive Sequences, Nucleic Acid
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 27 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
