• Corpus ID: 228083817

Utilising Graph Machine Learning within Drug Discovery and Development

@article{Gaudelet2020UtilisingGM,
  title={Utilising Graph Machine Learning within Drug Discovery and Development},
  author={Thomas Gaudelet and Ben Day and Arian Jamasb and Jyothish Soman and Cristian Regep and Gertrude Liu and Jeremy B. R. Hayter and Richard J Vickers and Charlie Roberts and Jian Tang and David Roblin and Tom L. Blundell and Michael M. Bronstein and Jake P. Taylor-King},
  journal={ArXiv},
  year={2020},
  volume={abs/2012.05716}
}
Graph Machine Learning (GML) is receiving growing interest within the pharmaceutical and biotechnology industries for its ability to model biomolecular structures, the functional relationships between them, and integrate multi-omic datasets - amongst other data types. Herein, we present a multidisciplinary academic-industrial review of the topic within the context of drug discovery and development. After introducing key terms and modelling approaches, we move chronologically through the drug… 

Figures from this paper

A Review of Biomedical Datasets Relating to Drug Discovery: A Knowledge Graph Perspective
TLDR
This review presents a comparative analysis of existing public drug discovery KGs and a evaluation of selected motivating case studies from the literature, and raises numerous and unique challenges and issues associated with the domain and its datasets, whilst also highlighting key future research directions.
Understanding and bridging the gaps in current GNN performance optimizations
TLDR
An in-depth examination of the state-of-the-art GNN frameworks is provided, revealing five major gaps in the current frameworks in optimizing GNN performance, especially in handling the special complexities of GNN over traditional graph or DNN operations.
Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.
TLDR
The proposed multiphysical graph neural network (MP-GNN) model can outperform all existing models as far as the authors know, and is used in coronavirus disease 2019 drug design and demonstrates the great potential of the model for the screening of potential drugs for SARS-CoV-2.
G RAPH A NISOTROPIC D IFFUSION FOR M OLECULES
TLDR
This work presents a new GNN architecture called Graph Anisotropic Diffusion, which alters-nates between linear diffusion, for which a closed-form solution is available, and local anisotropic filters to obtain efficient multi-hop an isotropic kernels.
Graph Anisotropic Diffusion
TLDR
This work presents a new GNN architecture called Graph Anisotropic Diffusion, which alters-nates between linear diffusion, for which a closed-form solution is available, and local anisotropic filters to obtain efficient multi-hop an isotropic kernels.
Graph Neural Networks in Particle Physics: Implementations, Innovations, and Challenges
TLDR
A range of graph neural networks capabilities that allow a wide variety of high- and low-level physical features to be attached to measurements and, by the same token, to best match unique GNN capabilities to unique HEP obstacles are presented.
Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery
Implications of Topological Imbalance for Representation Learning on Biomedical Knowledge Graphs
TLDR
The results highlight the importance of data modeling choices, and the need for practitioners to be mindful of these issues when interpreting model outputs and during KG composition, and provide support for the observation that KGE models can be more influenced by the frequency of entities rather than any biological information encoded within the relations.
Biological Insights Knowledge Graph: an integrated knowledge graph to support drug development
TLDR
The main design choices in implementation of the BIKG graph are described and different aspects of its life cycle are discussed: from graph construction to exploitation.
...
...

References

SHOWING 1-10 OF 263 REFERENCES
Machine learning‐driven protein engineering: a case study in computational drug discovery
TLDR
A platform that combines high-throughput display and selection data generation with ML and enables multi-parameter optimisation across multiple important protein characteristics is presented.
Toward heterogeneous information fusion: bipartite graph convolutional networks for in silico drug repurposing
TLDR
BiFusion is proposed, a bipartite graph convolution network model for DR through heterogeneous information fusion that combines insights of multiscale pharmaceutical information by constructing a multirelational graph of drug–protein, disease–protein and PPIs.
A multiple kernel learning algorithm for drug-target interaction prediction
TLDR
KronRLS-MKL, which models the drug-target interaction problem as a link prediction task on bipartite networks, allows the integration of multiple heterogeneous information sources for the identification of new interactions, and can also work with networks of arbitrary size.
A Literature-Based Knowledge Graph Embedding Method for Identifying Drug Repurposing Opportunities in Rare Diseases
TLDR
A knowledge graph embedding method is applied that explicitly models the uncertainty associated with literature-derived relationships and uses link prediction to generate drug repurposing hypotheses and is capable of generating novel repurpose hypotheses, which are independently validate using external literature sources and protein interaction networks.
Relation-weighted Link Prediction for Disease Gene Identification
TLDR
A novel machine learning method is proposed that identifies disease genes on such graphs by leveraging recent advances in network biology and graph representation learning, and it is empirically demonstrated that the algorithms outperform its closest state-of-the-art competitor in disease gene identification.
An omics perspective on drug target discovery platforms
TLDR
A survey and compare a set of popular DTD platforms that utilize multiple data sources and omics-driven knowledge bases (either directly or indirectly) for identifying drug targets and describes of omics technologies and related data repositories which are important for DTD tasks.
PotentialNet for Molecular Property Prediction
TLDR
The PotentialNet family of graph convolutions, specifically designed for and achieve state-of-the-art performance for protein–ligand binding affinity, is presented and a cross-validation strategy based on structural homology clustering is introduced that can more accurately measure model generalizability, which crucially distinguishes the aims of machine learning for drug discovery from standard machine learning tasks.
In silico methods for drug repurposing and pharmacology
TLDR
It is argued that methods such as matrix factorization that can integrate data within and across diverse data types have the potential to improve predictive performance and provide a fuller picture of a drug's pharmacological action.
Integrated Multi-Omics Analyses in Oncology: A Review of Machine Learning Methods and Tools
TLDR
Recent data-driven methodologies that have been developed and applied to respond major challenges of stratified medicine in oncology, including patients' phenotyping, biomarker discovery, and drug repurposing are explored.
Neural networks for link prediction in realistic biomedical graphs: a multi-dimensional evaluation of graph embedding-based approaches
TLDR
Investigation of how inputs from four node representation algorithms affect performance of a neural link predictor on random- and time-sliced biomedical graphs of real-world sizes containing information relevant to DTI, PPI and LBD showed that neural network methods performed well on links between nodes with no previous common neighbours; potentially the most interesting links.
...
...