
AbstractThe β and β′ subunits of the RNA polymerase (RNAP) are large proteins with complex multi‐domain architectures that include several insertional domains. Here, we analyze the domain organizations of RNAP‐β and RNAP‐β′ using sequence, experimentally determined structures and AlphaFold structure predictions. We observe that lineage‐specific insertional domains in bacterial RNAP‐β belong to a group that we call BEAN (broadly embedded annex). We observe that lineage‐specific insertional domains in bacterial RNAP‐β′ belong to a group that we call HABAS (hammerhead/barrel‐sandwich hybrid). The BEAN domain has a characteristic three‐dimensional structure composed of two square bracket‐like elements that are antiparallel relative to each other. The HABAS domain contains a four‐stranded open β‐sheet with a GD‐box‐like motif in one of the β‐strands and the adjoining loop. The BEAN domain is inserted not only in the bacterial RNAP‐β′, but also in the archaeal version of universal ribosomal protein L10. The HABAS domain is inserted in several metabolic proteins. The phylogenetic distributions of bacterial lineage‐specific insertional domains of β and β′ subunits of RNAP follow the Tree of Life. The presence of insertional domains can help establish a relative timeline of events in the evolution of a protein because insertion is inferred to post‐date the base domain. We discuss mechanisms that might account for the discovery of homologous insertional domains in non‐equivalent locations in bacteria and archaea.
Models, Molecular, Protein Domains, Bacterial Proteins, DNA-Directed RNA Polymerases, Amino Acid Sequence, Phylogeny, Research Article
Models, Molecular, Protein Domains, Bacterial Proteins, DNA-Directed RNA Polymerases, Amino Acid Sequence, Phylogeny, Research Article
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
