PconsC4: fast, free, easy, and accurate contact predictions

  title={PconsC4: fast, free, easy, and accurate contact predictions},
  author={Mirco Michel and David M{\'e}nendez Hurtado and Arne Elofsson},
Motivation Residue contact prediction was revolutionized recently by the introduction of direct coupling analysis (DCA). Further improvements, in particular for small families, have been obtained by the combination of DCA and deep learning methods. However, existing deep learning contact prediction methods often rely on a number of external programs and are therefore computationally expensive. Results Here, we introduce a novel contact predictor, PconsC4, which performs on par with state of the… 
4 Citations

FilterDCA: Interpretable supervised contact prediction using inter-domain coevolution

FilterDCA is introduced, a simple, transparent and therefore fully interpretable inter-domain contact predictor, which uses the results of coevolutionary Direct Coupling Analysis in combination with explicitly constructed filters reflecting typical contact patterns in a training set of known protein structures, and which improves the accuracy of predicted contacts significantly.

DEEPCON: Protein Contact Prediction using Dilated Convolutional Neural Networks with Dropout

The proposed ConvNet architectures predict contacts with significantly more precision than the architectures used in several state-of-the-art methods.

DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins

DeepMSA, a new open-source method for sensitive MSA construction, which has homologous sequences and alignments created from multi-sources of whole-genome and metagenome databases through complementary hidden Markov model (HMM) algorithms, is developed.

Homology modeling in the time of collective and artificial intelligence



Improved Contact Predictions Using the Recognition of Protein Like Contact Patterns

PconsC2 is a novel method that uses a deep learning approach to identify protein-like contact patterns to improve contact predictions and is superior to earlier methods based on statistical inferences in comparison to state of the art methods using machine learning.

Large-scale structure prediction by improved contact predictions and model quality assessment

The PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein is presented and it is shown that the model quality estimation significantly increases the number of models that reliably can be identified.

Predicting accurate contacts in thousands of Pfam domain families using PconsC3

PconsC3, a fast and improved method for protein contact predictions that can be used for families with even 100 effective sequence members, outperforms direct coupling analysis (DCA) methods significantly independent on family size, secondary structure content, contact range, or the number of selected contacts.

DNCON2: improved protein contact prediction using two-level deep convolutional neural networks

The improved performance of DNCON2 is attributed to the inclusion of short- and medium-range contacts into training, two-level approach to prediction, use of the state-of-the-art optimization and activation functions, and a novel deep learning architecture that allows each filter in a convolutional layer to access all the input features of a protein of arbitrary length.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model

A new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks that greatly outperforms existing methods and leads to much more accurate contact-assisted folding.

Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction

A rapid, simple and general method based on information theory that accurately estimates the level of background mutual information for each pair of positions in a given protein family, and correctly identifies substantially more coevolving positions in protein families than any existing method.

Direct-coupling analysis of residue coevolution captures native contacts across many protein families

The findings suggest that contacts predicted by DCA can be used as a reliable guide to facilitate computational predictions of alternative protein conformations, protein complex formation, and even the de novo prediction of protein domain structures, contingent on the existence of a large number of homologous sequences which are being rapidly made available due to advances in genome sequencing.

Deep transfer learning in the assessment of the quality of protein models

A deep neural network architecture is introduced to predict model quality using significantly fewer input features than state-of-the-art methods and the possibility of applying transfer learning on databases of known protein structures is shown.

Fast and Accurate Multivariate Gaussian Modeling of Protein Families: Predicting Residue Contacts and Protein-Interaction Partners

The quality of inference is comparable or superior to the one achieved by mean-field approximations to inference with discrete variables, as done by direct-coupling analysis for the prediction of residue-residue contacts in proteins and the identification of protein-protein interaction partner in bacterial signal transduction.

Pythran: enabling static optimization of scientific Python programs

Pythran is an open source static compiler that turns modules written in a subset of Python language into native ones that takes advantage of modern C++11 features such as variadic templates, type inference, move semantics and perfect forwarding, as well as classical idioms such as expression templates.