
Contains all supplementary files that go with the `baktfold-analysis` repository (https://github.com/gbouras13/baktfold-analysis) that are too large for GitHub. File list: baktfold-benchmark.tar.gz - Bakta manuscript benchmark genomes - relevantly, contains genbank and mag dataset genomes combined_plasmid_annotations.tsv.gz - IMG/PR annotations for Bakta + Baktfold all_chunks_with_go.tsv.gz - all GlobDB per protein Baktfold annotations with mapped GO Terms for all Swiss-Prot hits protist_baktfold_jsons.tar - all Ensembl protists Baktfold JSON annotation files smag_combined_baktfold_with_eggnog.tsv.gz - SMAG dataset protein eggnog-Mapper (from original Delmont et al publication) + baktfold annotations updated_arc_protein.trimmed.faa.gz - 1,993,306 custom archaeal protein database raw FASTA updated_arc_protein.headers.tsv.gz - 1,993,306 custom archaeal protein database 2 column TSV for use with baktfold's custom DB --custom-annotations parameter updated_arc_protein.trimmed.fs.db.tar.gz - 1,993,306 custom archaeal protein database Foldseek database for use with --custom-db genbank_predictions_esm.tar genbank_hypotheticals_structures.tar mag_predictions_esm.tar mag_hypotheticals_structures.tar - ESMFold and ColabFold predictions for hypothetical proteins for mag and genbank benchmarking datasets
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
