7 Research products, page 1 of 1
Loading
- Research software . 2023EnglishAuthors:Kumar, Neeraj; Bontha, Mridula; McNaughton, Andrew; Knutson, Carter; Pope, Jenna;Kumar, Neeraj; Bontha, Mridula; McNaughton, Andrew; Knutson, Carter; Pope, Jenna;Publisher: Not Available
3D-MolGNNRL, couples reinforcement learning (RL) to a deep generative model based on 3D-Scaffold to generate target candidates specific to a protein pocket building up atom by atom from the core scaffold. 3D-MolGNNRL provides an efficient way to optimize key features within a protein pocket using a parallel graph neural network model. The agent learns to build molecules in 3D space while optimizing the binding affinity, potency, and synthetic accessibility of the candidates generated for the SARS-CoV-2 Main protease
add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Research software . 2022Authors:Gans, Jason;Gans, Jason;Publisher: DOE CODE
Protective vaccines and reliable diagnostics are essential tools for controlling viral diseases. However, the efficacy of these tools can be diminished by mutations in viral genomes. The delay between the emergence of new viral strains and the redesign of vaccines and diagnostics allows for continued viral transmission. Is it possible to address this challenge by computationally predicting viral genome sequence evolution? Can we “future-proof” vaccines and diagnostics by targeting both current and anticipated future sequence variants? While predicting viral evolution is still an unsolved, “grand challenge” problem in biology, the large, and rapidly growing, number of SARS-CoV-2 genome sequences provide an opportunity to quantify the ability of machine learning to predict viral genome sequence evolution. Towards this end, we have developed a simple computational model for predicting viral evolution at the level of individual nucleotides. The key metric for quantifying the per-base, prediction accuracy for viral evolution is the Mann-Whitney U statistic (or, equivalently, the area under the receiver operator curve). Since the Mann-Whitney U statistic is not a differentiable function, existing deep leaning packages (like Pytorch and Keras/TensorFlow) are not useful, as they require that the accuracy metric/objective function be analytically differentiable with respect to the model parameters. To overcome this challenge, we have implemented custom software, “FutureTense”, that can train a machine learning model by maximizing the non-differentiable Mann-Whitney U statistic. This software trains a machine learning model by exploring along the direction of the discrete gradient of the Mann-Whitney U statistic in the model parameter space. Parallel computing and genome sequence-specific optimizations are used to accelerate model training. The resulting machine learning model learns the observed high C->U mutation rates in the SARS-CoV-2 genome (which are potentially induced by host defenses) and provides prediction accuracies that are significantly better than one would expect from random chance. While predicting viral evolution is still quite far from a solved problem, the surprising performance of this simple model gives hope that the accuracy of predicting viral genome evolution can be further increased by more sophisticated approaches.
- Research software . 2022Authors:Sandholtz, Sarah H; Drocco, Jeffrey A;Sandholtz, Sarah H; Drocco, Jeffrey A;Publisher: DOE CODE
The TargetID pipeline enables rapid identification and characterization of binding sites in SARS-CoV-2 proteins as well as the core chemical components with which these sites interact.
- Research software . 2021Authors:Cadena Pico, Jose E.; Soper, Braden C.; Ray, Pryadip; Mguyen, Chanh P.; Chan, Ryan;Cadena Pico, Jose E.; Soper, Braden C.; Ray, Pryadip; Mguyen, Chanh P.; Chan, Ryan;Publisher: DOE CODE
Background: Machine learning (ML) based risk stratification models of Electronic Health records (EHR) data may help to optimize treatment of COVID-19 patients, but are often limited by their lack of clinical interpretability and cost of laboratory tests. We develop a ML based tool for predicting adverse outcomes based on EHR data to optimize clinical utility under a given cost structure. This cohort study was performed using deidentified EHR data from COVID-19 patients from ProMedica Healthcare in northwest Ohio and southeastern Michigan. Methods: We tested performance of various ML approaches for predicting either increasing ventilatory support or mortality and the set of model features under a budget constraint was optimized via exhaustive search across all combinations of features. Results: The optimal sets of features for predicting ventilation under any budget constraint included demographics and comorbidities (DCM), basic metabolic panel (BMP), D-dimer, lactate dehydrogenase (LDH), erythrocyte sedimentation rate (ESR), CRP, brain natriuretic peptide (BNP), and procalcitonin and for mortality included DCM, BMP, complete blood count, D-dimer, LDH, CRP, BNP, procalcitonin and ferritin. Conclusions: This study presents a quick, accurate and cost-effective method to evaluate risk of deterioration for patients with SARS-CoV-2 infection at the time of clinical evaluation.
- Research software . 2021Authors:Higa, Kenneth; Ushizima, Daniela;Higa, Kenneth; Ushizima, Daniela;Publisher: DOE CODE
From a single data description file, this package generates a simple but complete RESTful web interface to a relational database, in the form of containers that run in a Docker environment. This initial version produces containers that are intended for use on the NERSC Spin platform. It was developed as a foundation for the smart catalog created for LDRD FY20 ACTS: Accelerating COVID-19 Testing with Screening.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Research software . 2021PythonAuthors:Safta, Cosmin; Ray, Jaideep; Blonigan, Patrick; Chowdhary, Kenny;Safta, Cosmin; Ray, Jaideep; Blonigan, Patrick; Chowdhary, Kenny;Publisher: DOE CODE
SAND2021-0565 O PRIME is a modeling framework designed for the real-time characterization and forecasting of partially observed epidemics. The method is designed to help guide medical resource allocation in the early epoch of the outbreak. Characterization is the estimation of infection spread parameters using daily counts of symptomatic patients. The estimation problem is posed as one of Bayesian inference and solved using a Markov Chain Monte Carlo technique. The framework can accommodate multiple epidemic waves and can help identify different disease dynamics at the regional, state, and country levels. Examples are provided using publicly available COVID-19 data. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-NA0003525.
- Research software . 2020EnglishAuthors:Bauer, Travis;Bauer, Travis;Publisher: Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
This is software lets one explore the data released as part of the COVID-19 Open Research Dataset Challenge. It downloads and analyzes the natural language text of the data set and then creates a 2D visualization that can be used to explore it. SAND2020-12185 M Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-NA0003525.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.
7 Research products, page 1 of 1
Loading
- Research software . 2023EnglishAuthors:Kumar, Neeraj; Bontha, Mridula; McNaughton, Andrew; Knutson, Carter; Pope, Jenna;Kumar, Neeraj; Bontha, Mridula; McNaughton, Andrew; Knutson, Carter; Pope, Jenna;Publisher: Not Available
3D-MolGNNRL, couples reinforcement learning (RL) to a deep generative model based on 3D-Scaffold to generate target candidates specific to a protein pocket building up atom by atom from the core scaffold. 3D-MolGNNRL provides an efficient way to optimize key features within a protein pocket using a parallel graph neural network model. The agent learns to build molecules in 3D space while optimizing the binding affinity, potency, and synthetic accessibility of the candidates generated for the SARS-CoV-2 Main protease
add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Research software . 2022Authors:Gans, Jason;Gans, Jason;Publisher: DOE CODE
Protective vaccines and reliable diagnostics are essential tools for controlling viral diseases. However, the efficacy of these tools can be diminished by mutations in viral genomes. The delay between the emergence of new viral strains and the redesign of vaccines and diagnostics allows for continued viral transmission. Is it possible to address this challenge by computationally predicting viral genome sequence evolution? Can we “future-proof” vaccines and diagnostics by targeting both current and anticipated future sequence variants? While predicting viral evolution is still an unsolved, “grand challenge” problem in biology, the large, and rapidly growing, number of SARS-CoV-2 genome sequences provide an opportunity to quantify the ability of machine learning to predict viral genome sequence evolution. Towards this end, we have developed a simple computational model for predicting viral evolution at the level of individual nucleotides. The key metric for quantifying the per-base, prediction accuracy for viral evolution is the Mann-Whitney U statistic (or, equivalently, the area under the receiver operator curve). Since the Mann-Whitney U statistic is not a differentiable function, existing deep leaning packages (like Pytorch and Keras/TensorFlow) are not useful, as they require that the accuracy metric/objective function be analytically differentiable with respect to the model parameters. To overcome this challenge, we have implemented custom software, “FutureTense”, that can train a machine learning model by maximizing the non-differentiable Mann-Whitney U statistic. This software trains a machine learning model by exploring along the direction of the discrete gradient of the Mann-Whitney U statistic in the model parameter space. Parallel computing and genome sequence-specific optimizations are used to accelerate model training. The resulting machine learning model learns the observed high C->U mutation rates in the SARS-CoV-2 genome (which are potentially induced by host defenses) and provides prediction accuracies that are significantly better than one would expect from random chance. While predicting viral evolution is still quite far from a solved problem, the surprising performance of this simple model gives hope that the accuracy of predicting viral genome evolution can be further increased by more sophisticated approaches.
- Research software . 2022Authors:Sandholtz, Sarah H; Drocco, Jeffrey A;Sandholtz, Sarah H; Drocco, Jeffrey A;Publisher: DOE CODE
The TargetID pipeline enables rapid identification and characterization of binding sites in SARS-CoV-2 proteins as well as the core chemical components with which these sites interact.
- Research software . 2021Authors:Cadena Pico, Jose E.; Soper, Braden C.; Ray, Pryadip; Mguyen, Chanh P.; Chan, Ryan;Cadena Pico, Jose E.; Soper, Braden C.; Ray, Pryadip; Mguyen, Chanh P.; Chan, Ryan;Publisher: DOE CODE
Background: Machine learning (ML) based risk stratification models of Electronic Health records (EHR) data may help to optimize treatment of COVID-19 patients, but are often limited by their lack of clinical interpretability and cost of laboratory tests. We develop a ML based tool for predicting adverse outcomes based on EHR data to optimize clinical utility under a given cost structure. This cohort study was performed using deidentified EHR data from COVID-19 patients from ProMedica Healthcare in northwest Ohio and southeastern Michigan. Methods: We tested performance of various ML approaches for predicting either increasing ventilatory support or mortality and the set of model features under a budget constraint was optimized via exhaustive search across all combinations of features. Results: The optimal sets of features for predicting ventilation under any budget constraint included demographics and comorbidities (DCM), basic metabolic panel (BMP), D-dimer, lactate dehydrogenase (LDH), erythrocyte sedimentation rate (ESR), CRP, brain natriuretic peptide (BNP), and procalcitonin and for mortality included DCM, BMP, complete blood count, D-dimer, LDH, CRP, BNP, procalcitonin and ferritin. Conclusions: This study presents a quick, accurate and cost-effective method to evaluate risk of deterioration for patients with SARS-CoV-2 infection at the time of clinical evaluation.
- Research software . 2021Authors:Higa, Kenneth; Ushizima, Daniela;Higa, Kenneth; Ushizima, Daniela;Publisher: DOE CODE
From a single data description file, this package generates a simple but complete RESTful web interface to a relational database, in the form of containers that run in a Docker environment. This initial version produces containers that are intended for use on the NERSC Spin platform. It was developed as a foundation for the smart catalog created for LDRD FY20 ACTS: Accelerating COVID-19 Testing with Screening.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Research software . 2021PythonAuthors:Safta, Cosmin; Ray, Jaideep; Blonigan, Patrick; Chowdhary, Kenny;Safta, Cosmin; Ray, Jaideep; Blonigan, Patrick; Chowdhary, Kenny;Publisher: DOE CODE
SAND2021-0565 O PRIME is a modeling framework designed for the real-time characterization and forecasting of partially observed epidemics. The method is designed to help guide medical resource allocation in the early epoch of the outbreak. Characterization is the estimation of infection spread parameters using daily counts of symptomatic patients. The estimation problem is posed as one of Bayesian inference and solved using a Markov Chain Monte Carlo technique. The framework can accommodate multiple epidemic waves and can help identify different disease dynamics at the regional, state, and country levels. Examples are provided using publicly available COVID-19 data. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-NA0003525.
- Research software . 2020EnglishAuthors:Bauer, Travis;Bauer, Travis;Publisher: Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
This is software lets one explore the data released as part of the COVID-19 Open Research Dataset Challenge. It downloads and analyzes the natural language text of the data set and then creates a 2D visualization that can be used to explore it. SAND2020-12185 M Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-NA0003525.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.