Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Software
Data sources: ZENODO
addClaim

ProteinTensor: AI-Native Biomolecular Tensor Storage for Structural Biology ML

Authors: Moore, Clayton W.;

ProteinTensor: AI-Native Biomolecular Tensor Storage for Structural Biology ML

Abstract

ProteinTensor is a Python library and file format (.ptt) that eliminates redundant preprocessing in structural biology machine learning pipelines. By converting mmCIF/PDB structures once into a Zarr-backed, LZ4-compressed, memory-mappable store, ProteinTensor provides zero-parse access to atomic coordinates, backbone geometry, covalent bond graphs, MSA tokens, pairwise distance features, and protein language model embeddings. Benchmarked on proteins from 76 to 3,525 residues, full feature assembly is 34x faster on average than traditional mmCIF-based pipelines.

Powered by OpenAIRE graph
Found an issue? Give us feedback