Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers

@article{Bhati2021PandemicDA,
  title={Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers},
  author={Agastya P. Bhati and Shunzhou Wan and Dario Alf{\`e} and Austin R. Clyde and Mathis Bode and Li Tan and Mikhail Titov and Andr{\'e} Merzky and Matteo Turilli and Shantenu Jha and R. Highfield and Walter Rocchia and Nicola Scafuri and Sauro Succi and Dieter August Kranzlm{\"u}ller and Gerald Mathias and David Wifling and Yann Donon and Alberto Di Meglio and Sofia Vallecorsa and Heng Ma and Anda Trifan and Arvind Ramanathan and Thomas S. Brettin and Alexander Partin and Fangfang Xia and Xiaotan Duan and Rick L. Stevens and Peter V. Coveney},
  journal={Interface Focus},
  year={2021},
  volume={11}
}
The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case, developed for linear accelerators, and physics-based methods. The two in silico… 

Figures and Tables from this paper

Structures of the SARS-CoV-2 spike glycoprotein and applications for novel drug development

Novel insights are provided for the efficient discovery of promising drug candidates against the S protein and contribute to the development of broad-spectrum anti-coronavirus drugs to fight against SARS-CoV-2.

Coupling streaming AI and HPC ensembles to achieve 100–1000× faster biomolecular simulations

The results establish DeepDriveMD as a high-performance framework for ML-driven HPC simulation scenarios, that supports diverse MD simulation and ML back-ends, and which enables new scientific insights by improving the length and time scales accessible with current computing capacity.

Large Scale Study of Ligand-Protein Relative Binding Free Energy Calculations: Actionable Predictions from Statistically Robust Protocols

The findings from a large dataset comprising over 500 ligand transformations spanning over 300 ligands binding to a diverse set of 14 different protein targets are reported which furnish statistically robust results on the accuracy, precision and reproducibility of RBFE calculations.

Alchemical Free Energy Estimators and Molecular Dynamics Engines: Accuracy, Precision, and Reproducibility

The binding free energy between a ligand and its target protein is an essential quantity to know at all stages of the drug discovery pipeline. Assessing this value computationally can offer insight

Data-Driven Analytics Leveraging Artificial Intelligence in the Era of COVID-19: An Insightful Review of Recent Developments

An insightful overview of the role of data-driven analytics leveraging AI in the era of COVID-19 and the challenges involved in applying AI to the available data and privacy issues that can arise from personal data transitioning into cyberspace are discussed.

Applying Physics-Informed Enhanced Super-Resolution Generative Adversarial Networks to Finite-Rate-Chemistry Flows and Predicting Lean Premixed Gas Turbine Combustors

The modeling approach of PIESRGAN is modified to accurately account for the challenges in the context of laminar finite-rate-chemistry flows and a reduced PiesRGAN-based model is presented that solves only the major species on a reconstructed �áeld and employs PIERSGAN lookup for the remaining species, utilizing staggering in time.

Applying Physics-Informed Enhanced Super-Resolution Generative Adversarial Networks to Turbulent Non-Premixed Combustion on Non-Uniform Meshes and Demonstration of an Accelerated Simulation Workflow

This paper extends the methodology to use physics-informed enhanced super-resolution generative adversarial networks (PIESR-GANs) for LES subfilter modeling in turbulent flows with finite-rate chemistry

Asynchronous Execution of Heterogeneous Tasks in AI-coupled HPC Workflows

An analysis of an important class of heterogeneous work, viz., AI-driven HPC workflows, to investigate asynchronous task execution requirements and properties and proposes key metrics that can be used to determine qualitative benefits when employing asynchronous execution.

Large Scale Study of Ligand–Protein Relative Binding Free Energy Calculations: Actionable Predictions from Statistically Robust Protocols

The findings from a large data set comprising over 500 ligand transformations spanning over 300 ligands binding to a diverse set of 14 different protein targets which furnish statistically robust results on the accuracy, precision, and reproducibility of RBFE calculations are reported.

References

SHOWING 1-10 OF 88 REFERENCES

Targeting SARS-CoV-2 with AI- and HPC-enabled Lead Generation: A First Data Release

This data release encompasses structural information on the 4.2 B molecules enriched with pre-computed data to enable exploration and application of image-based deep learning methods, and 2D and 3D molecular descriptors to speed development of machine learning models.

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads

Development and deployment of computational infrastructure at scale integrates multiple artificial intelligence and simulation-based approaches to overcome this fundamental limitation of the drug discovery process.

Scalable HPC and AI Infrastructure for COVID-19 Therapeutics

This work describes several methods that integrate artificial intelligence and simulation-based approaches, and the design of computational infrastructure to support these methods at scale, and discusses their implementation and characterize their performance.

Scalable HPC & AI infrastructure for COVID-19 therapeutics

Several methods that integrate artificial intelligence and simulation-based approaches, and the design of computational infrastructure to support these methods at scale are described.

Reaction-Based Enumeration, Active Learning, and Free Energy Calculations To Rapidly Explore Synthetically Tractable Chemical Space and Optimize Potency of Cyclin-Dependent Kinase 2 Inhibitors

A new computational technique, referred to as 'PathFinder', that uses retrosynthetic analysis followed by combinatorial synthesis to generate novel compounds in synthetically accessible chemical space to accelerate the discovery of novel chemical matter in drug discovery campaigns is reported.

AI-Driven Multiscale Simulations Illuminate Mechanisms of SARS-CoV-2 Spike Dynamics

A generalizable AI-driven workflow is developed that leverages heterogeneous HPC resources to explore the time-dependent dynamics of molecular systems and demonstrates how AI can accelerate conformational sampling across different systems and pave the way for the future application of such methods to additional studies in SARS-CoV-2 and other molecular systems.

AI-driven multiscale simulations illuminate mechanisms of SARS-CoV-2 spike dynamics

A generalizable AI-driven workflow is developed that leverages heterogeneous HPC resources to explore the time-dependent dynamics of molecular systems and presents several novel scientific discoveries, including the elucidation of the spike’s full glycan shield and the characterization of the flexible interactions between the spike and the human ACE2 receptor.

[Artificial Intelligence in Drug Discovery].

A database to integrate genome data and clinical information obtained by clinical genome analysis and a computational support system for clinical interpretation of variants using AI and the development of a binding affinity prediction system for mutated proteins and drugs by molecular dynamics simulation using supercomputer "Kei".

Fast Identification of Possible Drug Treatment of Coronavirus Disease-19 (COVID-19) through Computational Drug Repurposing Study

The findings of this study can facilitate rational drug design targeting the SARS-CoV-2 main protease, including carfilzomib, eravacycline, valrubicin, lopinavir, and elbasvir.

Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations

Two antimicrobial peptides have low toxicity, as validated in vitro and in mice, and it is shown using live-cell confocal imaging that the bactericidal mode of action of the peptides involves the formation of membrane pores.
...