Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset
Data sources: ZENODO
addClaim

Genome-wide AnnotateMissense predictions for 90.6 million hg38 missense variants

Authors: Muneeb, Muhammad; Ascher, David B.;

Genome-wide AnnotateMissense predictions for 90.6 million hg38 missense variants

Abstract

This record contains the genome-wide AnnotateMissense prediction database and model-associated output associated with the manuscript AnnotateMissense: A Multi-source Annotation and Benchmarking Framework for Genome-wide Missense Pathogenicity Prediction. AnnotateMissense is a scalable framework for genome-wide missense variant annotation, feature integration, benchmarking, and pathogenicity prediction. Starting from 90,643,830 hg38 missense single-nucleotide variants derived from dbNSFP v5.1/CAGI7 Annotate-All-Missense resources, the workflow integrates ANNOVAR-derived annotations, dbNSFP features, AlphaMissense scores, ESM-derived protein language model features, and engineered biological features. The uploaded files include: variants.duckdb.gz: compressed DuckDB database containing genome-wide missense variant annotations and AnnotateMissense prediction outputs. UQ_BioSig_model_Final.tsv.gz: compressed final model-associated prediction/output table. The source code and workflow scripts are available at: https://github.com/MuhammadMuneeb007/CAGI7_Annotate_All_Missense These predictions are intended for research prioritisation, variant triage, and reproducible benchmarking. They are not standalone clinical classifications and should not be used as the sole basis for clinical decision-making.

Powered by OpenAIRE graph
Found an issue? Give us feedback