Downloads provided by UsageCounts
Centrifuge database for Sar, Nematode, Bacteria, Archaea, Fungi, Protozoa, and Human reference and representative genomes (SANBAFPH) Database type: Centrifuge database Centrifuge version: 1.0.4 Label: 17032023 Sequences: 32,318 Sum length: 25,415,962,902 Assembly summary entries: 4,668 Summary In module 2 of the SOVAP pipeline, which is dedicated to the decontamination step, we used Centrifuge software to filter out reads originating from non-viral sources. To achieve this, we created a custom database for Centrifuge called SANBAFPH, which includes reference and representative genomes of SAR (Stramenopila, Alveolata, and Rhizaria) Nematodes, Bacteria, Archaea, Fungi, Protozoa, and Human. This custom database enables Centrifuge to accurately identify and remove non-viral reads from the virome datasets, resulting in more accurate and reliable viral detection and analysis. -------------------------------------------------------- SOVAP v.1.3: GitHub Soil Virome Analysis Pipeline Description The study of viral communities in complex environmental samples, such as soil, can provide valuable insights into the diversity and functions of viral communities in the ecosystem. However, processing and analyzing of virome data can be a challenging task that requires the integration of various computational tools and techniques. To address these challenges, we have developed SOVAP pipeline that utilizes a suite of state-of-the-art tools for processing, analysis, and annotation viromics and metagenomics data. It utilizes various tools such as Fastp and Centrifuge for preprocessing and contamination removal, geNomad, Diamond and Megan for identification and annotation of viral contigs which are assembled and clustered using Megahit and CD-HIT. Additionally, this pipeline provides an estimate of the abundance of viral contigs, allowing for a more comprehensive understanding of the virome within the sample. The integration of these tools offers a reliable and effective means of taxonomy classification and annotation of viral contigs, aiding researchers in gaining insight into the composition and function of the virome within the analyzed sample. By integrating the SOVAP pipeline with IMG/VR and geNomad, it is possible to identify a wider range of viruses, including those that were previously unknown. The batch-mode script allows for the processing of multiple datasets using the SOVAP pipeline. This feature is particularly useful for large-scale analyses, such as those involving multiple environmental samples or large sequencing datasets.
SOVAP, Viromics, Centrifuge
SOVAP, Viromics, Centrifuge
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 15 | |
| downloads | 1 |

Views provided by UsageCounts
Downloads provided by UsageCounts