
The UHGV is a comprehensive genomic resource of viruses from the human gut microbiome. Genomes were derived from 12 independent data sources and annotated using a uniform bioinformatics pipeline. The files in this repository, which are described below, contain the most relevant data for most users. For the remaining data, please see: https://uhgv.jgi.doe.gov/downloads File Description uhgv_full.fna.zst Genomic sequences of all viruses (n=873,995) in UHGV, including the ones with uncertain virus prediction uhgv_full.faa.zst Protein sequences (n=37,443,649) of all viruses in UHGV, including the ones with uncertain virus prediction votus_full.fna.zst Genomic sequences of vOTU representatives (n=168,536) votus_full.faa.zst Protein sequences (n=7,426,124) of vOTU representatives uhgv_metadata.tsv.zst Metadata for each of the 873,995 UHGV genomes votus_metadata_extended.tsv.zst Metadata for the 168,536 species-level viral clusters (vOTUs) source_biosample_metadata.tsv.zst Information for the samples from which virus genomes were obtained host_range_breadth.tsv.zst Estimated host range breadth for 72,503 vOTUs read_mapping_relative_abundances.tsv.zst Per-sample relative abundances of viruses and hosts derived from read mapping data read_mapping_sample_metadata.tsv.zst Metadata describing the samples used for viral profiling through read mapping (e.g., country, lifestyle, age, gender, BMI, study) read_mapping_study_metadata.tsv.zst Study-level metadata for the sources from which samples were obtained for viral profiling through read mapping
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
