• Corpus ID: 238583214

Iterative Refinement Graph Neural Network for Antibody Sequence-Structure Co-design

@article{Jin2021IterativeRG,
  title={Iterative Refinement Graph Neural Network for Antibody Sequence-Structure Co-design},
  author={Wengong Jin and Jeremy Wohlwend and Regina Barzilay and T. Jaakkola},
  journal={ArXiv},
  year={2021},
  volume={abs/2110.04624}
}
Antibodies are versatile proteins that bind to pathogens like viruses and stimulate the adaptive immune system. The specificity of antibody binding is determined by complementarity-determining regions (CDRs) at the tips of these Y-shaped proteins. In this paper, we propose a generative model to automatically design the CDRs of antibodies with enhanced binding specificity or neutralization capabilities. Previous generative approaches formulate protein design as a structure-conditioned sequence… 

Figures and Tables from this paper

Benchmarking deep generative models for diverse antibody sequence design
TLDR
Three recently proposed deep generative frameworks for protein design are considered: (AR) the sequence-based autoregressive generative model, (GVP) the precise structure-based graph neural network, and Fold2Seq that leverages a fuzzy and scale-free representation of a three-dimensional fold, while enforcing structure-to-sequence consistency.
AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation
TLDR
The results across 188 antigens demonstrate the benefit of AntBO in designing CDRH3 regions with diverse biophysical properties, and in under 200 protein designs, AntBO can suggest antibody sequences that outperform the best binding sequence drawn from 6.9 million experimentally obtained CD RH3s and a commonly used genetic algorithm baseline.
In silico proof of principle of machine learning-based antibody design at unconstrained scale
TLDR
This work established a priori feasibility and the theoretical foundation of high-throughput ML-based mAb design and found that a deep generative model, trained exclusively on antibody sequence data, can be used to design conformational epitope-specific antibodies, matching, or exceeding the training dataset in affinity and developability parameter value variety.
A Deep SE(3)-Equivariant Model for Learning Inverse Protein Folding
TLDR
A generative SE(3)-equivariant model is developed which significantly improves upon existing autoregressive methods and captures functional aspects of the underlying protein by accurately predicting the effects of point mutations through testing on Deep Mutational Scanning datasets.
A 3D Generative Model for Structure-Based Drug Design Step 1: 3D Generative Model Design 2: Sampling Step 3: Training Molecule Design Data: at 30% sequence identity,
TLDR
A 3D generative model that generates molecules given a designated 3D protein binding site that exhibits high binding affinity to specific targets and good drug properties such as drug-likeness even if the model is not explicitly optimized for them is proposed.
A 3D Molecule Generative Model for Structure-Based Drug Design
TLDR
A 3D generative model that generates molecules given a designated 3D protein binding site that exhibits high binding affinity to specific targets and good drug properties such as drug-likeness even if the model is not explicitly optimized for them is proposed.
Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders
TLDR
In the experiments, LaMBO outperforms genetic optimizers and does not require a large pretraining corpus, demonstrating that Bayesian optimization is practical and effective for biological sequence design.
Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking
TLDR
A novel pairwise-independent SE(3)-equivariant graph matching network is designed to predict the rotation and translation to place one of the proteins at the right docked position relative to the second protein.
END-TO-END RIGID PROTEIN DOCKING
TLDR
A novel pairwise-independent SE(3)-equivariant graph matching network is designed to predict the rotation and translation to place one of the proteins at the right docked position relative to the second protein.
Learning inverse folding from millions of predicted structures
TLDR
A sequence-to-sequence transformer with invariant geometric input processing layers achieves 51% native sequence recovery on structurally held-out backbones with 72% recovery for buried residues, an overall improvement of almost 10 percentage points over existing methods.
...
1
2
...

References

SHOWING 1-10 OF 52 REFERENCES
Antibody Complementarity Determining Region Design Using High-Capacity Machine Learning
TLDR
A machine learning method is presented that can design human Immunoglobulin G (IgG) antibodies with target affinities that are superior to candidates from phage display panning experiments within a limited design budget and shows that data from disparate antibody campaigns can be combined by machine learning to improve antibody specificity.
Fast and flexible design of novel proteins using graph neural networks
TLDR
A deep graph neural network, ProteinSolver, can solve protein design by phrasing it as a constraint satisfaction problem (CSP), and develops a network that is accurately able to solve the related and straightforward problem of Sudoku puzzles.
Protein Design and Variant Prediction Using Autoregressive Generative Models
TLDR
This work introduces a deep generative model adapted from natural language processing for prediction and design of diverse functional sequences without the need for alignments and successfully design and test a diverse 105-nanobody library that shows better expression than a 1000-fold larger synthetic library.
Antibody design using LSTM based deep generative model from phage display library for affinity maturation
TLDR
A long short term memory network—a widely used deep generative model—based sequence generation and prioritization procedure to efficiently discover antibody sequences with higher affinity is employed to affinity maturation of antibodies against kynurenine.
In silico proof of principle of machine learning-based antibody design at unconstrained scale
TLDR
This work establishes a priori feasibility and the theoretical foundation of high-throughput ML-based mAb design and shows increased generation quality of low-N-based machine learning models via transfer learning.
RosettaAntibodyDesign (RAbD): A general framework for computational antibody design
TLDR
This work rigorously benchmarked RAbD on a set of 60 diverse antibody–antigen complexes, using two design strategies—optimizing total Rosetta energy and optimizing interface energy alone and utilized two novel metrics for measuring success in computational protein design.
Design of proteins presenting discontinuous functional sites using deep learning
TLDR
This work uses the trRosetta residual neural network, which maps input sequences to predicted inter-residue distances and orientations, to compute a loss function which simultaneously rewards recapitulation of a desired structural motif and the ideality of the surrounding scaffold, and generates diverse structures harboring the desired binding interface by optimizing this loss function by gradient descent.
Unified rational protein engineering with sequence-based deep representation learning
TLDR
Deep learning is applied to unlabeled amino-acid sequences to distill the fundamental features of a protein into a statistical representation that is semantically rich and structurally, evolutionarily and biophysically grounded and broadly applicable to unseen regions of sequence space.
OptCDR: a general computational method for the design of antibody complementarity determining regions for targeted epitope binding.
TLDR
The results demonstrate that OptCDR can efficiently generate diverse antibody libraries of a pre-specified size with promising antigen affinity potential as exemplified by computationally derived binding metrics.
AbDesign: An algorithm for combinatorial backbone design guided by natural conformations and sequences
TLDR
A combinatorial backbone and sequence optimization algorithm called AbDesign is described, which leverages the large number of sequences and experimentally determined molecular structures of antibodies to construct new antibody models, dock them against target surfaces and optimize their sequence and backbone conformation for high stability and binding affinity.
...
1
2
3
4
5
...