
pmid: 20595017
Non-protein-coding DNA comprises the majority of animal genomes but its functions are largely unknown. We identified over 17,000 different tetranucleotide pairs in the Drosophila melanogaster genome that are over-represented at distances up to 100nt in conserved non-exonic sequences. Those exhibiting the highest information content in surrounding nucleotides were classified into five groups: tRNAs, motifs associated with histone genes, Suppressor-of-Hairy-wing binding sites, and two sets of previously unrecognized motifs (DLM3 and DLM4). There are hundreds to thousands of copies of DLM3 and DLM4, respectively, in the genome, located almost exclusively in non-coding regions. They have similar copy numbers among drosophilids, but are largely absent in other insects. DLM3 is likely a cis-regulatory element, whereas DLM4 sequences are capable of forming a short hairpin structure and are expressed as approximately 80nt RNAs. This work reports the existence of Drosophila genus-specific sequence motifs, and suggests that many more novel functional elements may be discovered in genomes using the general approach outlined herein.
572, Euchromatin, Histones, Factor-binding sites, 1311 Genetics, RNA, Transfer, DNA-sequences, Regulatory elements, Genetics, Animals, Morphological evolution, Regulatory Elements, Transcriptional, Non-coding RNA, tRNA, Conserved Sequence, Phylogeny, Ultraconserved elements, DNA Primers, Human genome, Statistical properties, Computational Biology, Exaptation, Blotting, Northern, Drosophila melanogaster, Systematic discovery, Predicting regulatory regions, RNA, DNA, Intergenic, Transposable elements
572, Euchromatin, Histones, Factor-binding sites, 1311 Genetics, RNA, Transfer, DNA-sequences, Regulatory elements, Genetics, Animals, Morphological evolution, Regulatory Elements, Transcriptional, Non-coding RNA, tRNA, Conserved Sequence, Phylogeny, Ultraconserved elements, DNA Primers, Human genome, Statistical properties, Computational Biology, Exaptation, Blotting, Northern, Drosophila melanogaster, Systematic discovery, Predicting regulatory regions, RNA, DNA, Intergenic, Transposable elements
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 7 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
