Appendix A. Supplementary data for Exploring the universal healthy human gut microbiota around the world

Piquer-Esteban, Samuel; Ruiz-Ruiz, Susana; Arnau, Vicente; Díaz-Villanueva, Wladimiro; Moya, Andrés

Found an issue? Give us feedback

Recolector de Cienci...arrow_drop_down

Recolector de Ciencia Abierta, RECOLECTA

Dataset . 2023 . Peer-reviewed

Data sources: Recolector de Ciencia Abierta, RECOLECTA

DIGITAL.CSIC

Dataset . 2023 . Peer-reviewed

Data sources: DIGITAL.CSIC

Appendix A. Supplementary data for Exploring the universal healthy human gut microbiota around the world

Research datakeyboard_double_arrow_right Dataset 01 Jan 2023 Spain English Publisher:Elsevier BV

Authors: Piquer-Esteban, Samuel; Ruiz-Ruiz, Susana; Arnau, Vicente; Díaz-Villanueva, Wladimiro; Moya, Andrés;

handle: 10261/330885

Appendix A. Supplementary data for Exploring the universal healthy human gut microbiota around the world

- Summary
- Metrics

Abstract

Supplementary data to this article: Supplementary Document 1. Detailed description of the databases construction. Supplementary Figure 1. Processing and construction workflow of reference databases. The final databases used in the comparison are highlighted in red. Supplementary Figure 2. Determination of the optimal number of clusters for average abundance clustering analysis. The Elbow method was used to set the optimal number of clusters for the k-means algorithm at genus (A) and species (B) level. Supplementary Figure 3. Determination of the optimal number of clusters for z-scored average abundance patterns analysis. The Elbow method was used to set the optimal number of clusters for the k-means algorithm at genus (A) and species (B) level. Supplementary Figure 4. Bracken results comparison between the enriched filtered-MAG and original NT databases. From top to bottom, the performance of the databases was examined in terms of classification capacity for all samples as a whole, by host lifestyles and geographic origin at genus (A) and species (B) level. In the box plots, the black line within the box marks the median and the red triangle the mean, outliers are presented as red dots. Supplementary Figure 5. Relative abundances of top gut microbiota at the genus level. Only the 25 most abundant taxa in all samples were represented. Taxa are sorted by average relative abundances and their NCBI’s taxID is indicated in parentheses. In the box plots, the black line within the box marks the median and the red triangle the mean. Supplementary Figure 6. Universal core species. (A) Intersections between cores of interest. (B) Taxonomic relationships between universal core species. The number of core taxa assigned to a particular level is indicated inside the square brackets. (C) Prevalence-Abundance Heatmap. Taxa are sorted by average relative abundances and their NCBI’s taxID is indicated in parentheses. Supplementary Figure 7. Taxonomic relationships between different intersect core species in a prevalence gradient. Additional intersect cores were computed defining soft and medium prevalence species cores, which were compared to the corresponding universal species core, working in a prevalence range between 0.5 and 1. Supplementary Figure 8. Relative abundances of top gut microbiota at the species level. Only the 25 most abundant taxa in all samples were represented. Taxa are sorted by average relative abundances and their NCBI’s taxID is indicated in parentheses. In the box plots, the black line within the box marks the median and the red triangle the mean. Supplementary Figure 9. Relative abundances of individuals for the universal core taxa. Abundances are shown for the genus (A) and species (B) core taxa. Groups and taxa were clustered using the Complete method and Euclidean distances. In the case of the groups, this was done for every single group combination, and each taxon NCBI’s taxID is indicated in parentheses. Supplementary Figure 10. Abundance patterns of individuals for the universal core taxa. Z-scored abundances are shown for the genus (A) and species (B) core taxa. Groups and taxa were clustered using the Complete method and Euclidean distances. In the case of the groups, this was done for every single group combination, and each taxon NCBI’s taxID is indicated in parentheses. Supplementary Table 1. Summary information for the microbiome project genomes used in the construction of the enriched databases. Supplementary Table 2. Summary information for the metagenome studies used in this work. Supplementary Table 3. Summary of statistics for samples quality control. The different results generated by FastQC were aggregated with MultiQC for each study. Supplementary Table 4. Metadata for each human gut metagenome sample. Supplementary Table 5. Core genera membership for the different cores of interest. Presence-absence data is presented for the different cores of interest at 0.9, 0.7, and 0.5 prevalence thresholds. Prevalence values for the intersect core taxa at 1e−4 relative abundance threshold are also shown. NCBI’s taxIDs are indicated in parentheses. Supplementary Table 6. Core species membership for the different cores of interest. Presence-absence data is presented for the different cores of interest at 0.9, 0.7, and 0.5 prevalence thresholds. Prevalence values for the intersect core taxa at 1e−4 relative abundance threshold are also shown. NCBI’s taxIDs are indicated in parentheses. Supplementary Table 7. Universal core genera information and known characteristics.

Peer reviewed

Country

Spain

Related Organizations

Spanish National Research Council
Spain

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Usage byUsageCounts

visibility	views	63
download	downloads	34

63
views
34
downloads
Powered by

Found an issue? Give us feedback

visibility

download

0

Average

63

34