Downloads provided by UsageCounts
This dataset contains 9 sets of species-level gene clusters and high-resolution species trees for 125 representative bacterial and archaeal species, encompassing a total of 6,851 nearly complete genomes. Each set represents a different approach to homology-, orthology-, and synteny-based gene clustering as implemented by 6 popular tools for comparative genomics and pangenome analysis (Roary, panX, OrthoFinder, MMseqs2/PanACoTa, CD-HIT, and eggNOG-mapper). For Escherichia coli, Cutibacterium acnes, Bacteroides uniformis, and Staphylococcus epidermidis, we provide additional sets that combine high-quality genomes with different proportions of medium- and low-quality metagenome-assembled genomes (MAGs). This dataset is a helpful resource for benchmarking gene clustering tools and pangenome analysis workflows, as well as for testing their robustness with respect to the presence of incomplete or contaminated genomic assemblies. Reference: Manzano-Morales S, Liu Y, González-Bodí S, Huerta-Cepas J, Iranzo J. 2022. Comparison of gene clustering criteria reveals intrinsic uncertainty in pangenome analyses. bioRxiv doi: 10.1101/2022.09.25.509376
Changes with respect to previous version: addition of gene clusters for medium- and low-quality MAGs.
{"references": ["https://www.biorxiv.org/content/10.1101/2022.09.25.509376"]}
pangenome, COG, comparative genomics, orthology
pangenome, COG, comparative genomics, orthology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 7 | |
| downloads | 2 |

Views provided by UsageCounts
Downloads provided by UsageCounts