Downloads provided by UsageCounts
Identifying clusters of co-located, homologous genes is a commonplace procedure in comparative genomics, for example when looking for gene clusters encoding production of secondary metabolites. cblaster is a Python package that facilitates the identification of such gene clusters across publically available Basic Local Alignment Search Tool (BLAST) databases hosted by the National Center for Biotechnology Information (NCBI). Given either a FASTA file of query sequences, or a collection of valid NCBI sequence identifiers, cblaster is capable of both local (using the DIAMOND protein aligner) and remote (using the NCBI's public BLAST API) BLAST searches as well as retrieval and parsing of results. It leverages the NCBI's Identical Protein Groups (IPG) resource to retrieve the genomic context of each BLAST hit, grouping those that are co-located on genomic scaffolds within user-defined thresholds for intergenic distance and number of conserved sequences. cblaster then provides a human-readable report of its results. cblaster provides a simple command line interface with sensible default options, as well as offering several public methods and classes directly usable in Python code. It is installable from PyPI via pip (https://pypi.org/project/cblaster), and source code is freely available on GitHub (https://www.github.com/gamcil/cblaster) under the MIT license.
sequence similarity, BGC, secondary metabolism, biosynthetic gene cluster, homology, secondary metabolite, BLAST, gene cluster
sequence similarity, BGC, secondary metabolism, biosynthetic gene cluster, homology, secondary metabolite, BLAST, gene cluster
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 58 | |
| downloads | 3 |

Views provided by UsageCounts
Downloads provided by UsageCounts