Unified rational protein engineering with sequence-based deep representation learning
@article{Alley2019UnifiedRP, title={Unified rational protein engineering with sequence-based deep representation learning}, author={E. C. Alley and Grigory Khimulya and Surojit Biswas and Mohammed AlQuraishi and G. Church}, journal={Nature Methods}, year={2019}, pages={1-8} }
Rational protein engineering requires a holistic understanding of protein function. Here, we apply deep learning to unlabeled amino-acid sequences to distill the fundamental features of a protein into a statistical representation that is semantically rich and structurally, evolutionarily and biophysically grounded. We show that the simplest models built on top of this unified representation (UniRep) are broadly applicable and generalize to unseen regions of sequence space. Our data-driven… Expand
Topics from this paper
95 Citations
Evolutionary context-integrated deep sequence modeling for protein engineering
- Computer Science, Biology
- 2020
- 5
- PDF
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences
- Biology
- 2019
- 103
- Highly Influenced
- PDF
Deep learning-based prediction of protein structure using learned representations of multiple sequence alignments
- Computer Science, Biology
- 2020
- PDF
Generating functional protein variants with variational autoencoders
- Medicine
- PLoS computational biology
- 2021
- PDF
Unsupervised protein embeddings outperform hand-crafted sequence and structure features at predicting molecular function.
- Computer Science, Medicine
- Bioinformatics
- 2020
Unsupervised protein embeddings outperform hand-crafted sequence and structure features at predicting molecular function
- Biology, Computer Science
- 2020
- 3
Neural networks to learn protein sequence-function relationships from deep mutational scanning data
- Computer Science, Biology
- 2020
- 1
- PDF
Semi-Supervised Learning of Protein Secondary Structure from Single Sequences
- Computer Science
- 2020
- PDF
References
SHOWING 1-10 OF 83 REFERENCES
Deep Recurrent Neural Network for Protein Function Prediction from Sequence
- Biology, Computer Science
- ArXiv
- 2017
- 49
- PDF
Deep Semantic Protein Representation for Annotation, Discovery, and Engineering
- Computer Science, Biology
- 2018
- 11
- PDF
Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics
- Biology, Medicine
- PloS one
- 2015
- 339
- PDF
ProteinNet: a standardized data set for machine learning of protein structure
- Computer Science, Biology
- BMC Bioinformatics
- 2019
- 28
- PDF
Computational protein design: a review.
- Physics, Medicine
- Journal of physics. Condensed matter : an Institute of Physics journal
- 2017
- 30
Navigating the protein fitness landscape with Gaussian processes
- Biology, Computer Science
- Proceedings of the National Academy of Sciences
- 2012
- 126
- PDF