
AbstractCandidate phyla radiation (CPR) bacteria separate phylogenetically from other bacteria, but the organismal distribution of their protein families remains unclear. Here, we leveraged sequences from thousands of uncultivated organisms and identified protein families that co-occur in genomes, thus are likely foundational for lineage capacities. Protein family presence/absence patterns cluster CPR bacteria together, and away from all other bacteria and archaea, partly due to proteins without recognizable homology to proteins in other bacteria. Some are likely involved in cell-cell interactions and potentially important for episymbiotic lifestyles. The diversity of protein family combinations in CPR may exceed that of all other bacteria. Over the bacterial tree, protein family presence/absence patterns broadly recapitulate phylogenetic structure, suggesting persistence of core sets of proteins since lineage divergence. The CPR could have arisen in an episode of dramatic but heterogeneous genome reduction or from a protogenote community and co-evolved with other bacteria.
570, Genome, Bacteria, Science, Human Genome, Q, Bioinformatics and Computational Biology, Bacterial, 610, Biological Sciences, Microbiology, Article, Bacterial Proteins, Genetics, 2.2 Factors relating to the physical environment, Biochemistry and Cell Biology, Metagenomics, Aetiology, Genome, Bacterial, Phylogeny
570, Genome, Bacteria, Science, Human Genome, Q, Bioinformatics and Computational Biology, Bacterial, 610, Biological Sciences, Microbiology, Article, Bacterial Proteins, Genetics, 2.2 Factors relating to the physical environment, Biochemistry and Cell Biology, Metagenomics, Aetiology, Genome, Bacterial, Phylogeny
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 124 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
