
Single amino acid repeats are prevalent in eukaryote organisms, although the role of many such sequences is still poorly understood. We have performed a comprehensive analysis of the proteins containing homopolymeric histidine tracts in the human genome and identified 86 human proteins that contain stretches of five or more histidines. Most of them are endowed with DNA- and RNA-related functions, and, in addition, there is an overrepresentation of proteins expressed in the brain and/or nervous system development. An analysis of their subcellular localization shows that 15 of the 22 nuclear proteins identified accumulate in the nuclear subcompartment known as nuclear speckles. This localization is lost when the histidine repeat is deleted, and significantly, closely related paralogous proteins without histidine repeats also fail to localize to nuclear speckles. Hence, the histidine tract appears to be directly involved in targeting proteins to this compartment. The removal of DNA-binding domains or treatment with RNA polymerase II inhibitors induces the re-localization of several polyhistidine-containing proteins from the nucleoplasm to nuclear speckles. These findings highlight the dynamic relationship between sites of transcription and nuclear speckles. Therefore, we define the histidine repeats as a novel targeting signal for nuclear speckles, and we suggest that these repeats are a way of generating evolutionary diversification in gene duplicates. These data contribute to our better understanding of the physiological role of single amino acid repeats in proteins.
570, GENES, TANDEM REPEATS, AMINO-ACID REPEATS, Molecular Sequence Data, Nuclear Localization Signals, UBIQUITIN-PROTEASOME SYSTEM, QH426-470, TRANSCRIPTIONAL ACTIVITY, Cell Line, INTERCHROMATIN GRANULE CLUSTERS, SPLICING FACTOR, SIMPLE SEQUENCE, 616, Genetics, Humans, Histidine, Amino Acids, Genetics & Heredity, Cell Nucleus, 0604 Genetics, Genome, Science & Technology, Genome, Human, Nuclear Proteins, Proteins, P-TEFB, Protein Transport, Tandem Repeat Sequences, Life Sciences & Biomedicine, Sequence Alignment, Human, CELL-NUCLEUS, Developmental Biology, Research Article
570, GENES, TANDEM REPEATS, AMINO-ACID REPEATS, Molecular Sequence Data, Nuclear Localization Signals, UBIQUITIN-PROTEASOME SYSTEM, QH426-470, TRANSCRIPTIONAL ACTIVITY, Cell Line, INTERCHROMATIN GRANULE CLUSTERS, SPLICING FACTOR, SIMPLE SEQUENCE, 616, Genetics, Humans, Histidine, Amino Acids, Genetics & Heredity, Cell Nucleus, 0604 Genetics, Genome, Science & Technology, Genome, Human, Nuclear Proteins, Proteins, P-TEFB, Protein Transport, Tandem Repeat Sequences, Life Sciences & Biomedicine, Sequence Alignment, Human, CELL-NUCLEUS, Developmental Biology, Research Article
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 134 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
