Downloads provided by UsageCounts
Supplement of the paper “The emergence of high-fitness variants accelerates the slowdown of genome heterogeneity in the coronavirus” Since the outbreak of the COVID-19 pandemic, the SARS-CoV-2 coronavirus accumulated an important amount of genome variability through mutation and recombination. To test evolutionary trends that could inform us on the adaptive process of the virus to its human host, we compute a genome-wide measure of Sequence Compositional Complexity (SCC) in high-quality coronavirus genomes from across the globe, covering the full span of the pandemic. In early samples, we find no statistical support for any trend in SCC values over time, although the virus genome appears to evolve faster than Brownian Motion expectation. However, in samples taken after the emergence of Variants of Concern with higher transmissibility, and controlling for phylogenetic and sampling effects, we detect a declining trend for SCC and an increasing one for its absolute evolutionary rate. This means that the decline in SCC itself accelerated over time, and that increasing fitness of variant genomes lead to a reduction of their genome sequence heterogeneity. Supplementary files * SupplementaryTables S1-S18.xlsx. The strain name, the collection date, and the SCC values for each analyzed genome. * SupplementaryTableS19.pdf. A complete list acknowledging all originating and submitting laboratories for the sequence data in GISAID EpiCoV on which these analyses are based. * SupplementaryTable S20.pdf. A complete list acknowledging the authors, originating and submitting laboratories of the genetic sequences we used for the analysis of the Nextstrain sample. * PhylogeneticTimetrees_NexusFormat.zip. Phylogenetic timetrees (Nexus format). * PhylogeneticTimetrees_NewickFormat.zip. Phylogenetic timetrees (Newick format). * SCCdata.zip. SCC data.
This project was funded by grants from the Spanish Minister of Science, Innovation and Universities (former Spanish Minister of Economy and Competitiveness) to J.L.O. (Project AGL2017-88702-C2-2-R) and A.M. (Project PID2019-105969GB-I00), a grant from Generalitat Valenciana to A.M. (Project Prometeo/2018/A/133) and co-financed by the European Regional Development Fund (ERDF). The most time-demanding computations were done on the servers of the Laboratory of Bioinformatics, Dept. of Genetics & Institute of Biotechnology, Center of Biomedical Research, 18100, Granada, Spain.
Phylogenetic evolutionary trends, coronavirus evolution, genome heterogeneity, sequence compositional complexity.
Phylogenetic evolutionary trends, coronavirus evolution, genome heterogeneity, sequence compositional complexity.
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 5 | |
| downloads | 1 |

Views provided by UsageCounts
Downloads provided by UsageCounts