Can correct protein models be identified?

  title={Can correct protein models be identified?},
  author={Bj{\"o}rn Wallner and Arne Elofsson},
  journal={Protein Science},
The ability to separate correct models of protein structures from less correct models is of the greatest importance for protein structure prediction methods. Several studies have examined the ability of different types of energy function to detect the native, or native‐like, protein structure from a large set of decoys. In contrast to earlier studies, we examine here the ability to detect models that only show limited structural similarity to the native structure. These correct models are… 

How well can the accuracy of comparative protein structure models be predicted?

A protocol optimized specifically for predicting the RMSD and NO3.5Å errors of a model in the absence of its native structure, which quantifies the error in an absolute sense, thus helping to determine whether or not the model is suitable for intended applications.

Improved prediction of protein model quality

An algorithm to estimate the quality of a predicted protein structure using a consensus approach that uses LGA alignments and aggregation of raw distances and shows that beyond a binary inclusion/exclusion decision, learning from the data a set of weights by which the structures of the different servers can be aggregated can further improve performance.

TR 08-002 Improved prediction of protein model quality

The results show that the use of LGA alignments and aggregation of raw distances is the primary reason for its performance advantage, and that beyond a binary inclusion/exclusion decision, learning from the data a set of weights by which the structures of the different servers can be aggregated can further improve performance.

Improved protein model quality assessments by changing the target function

An updated version of ProQ3D is provided that predicts local and global model quality estimates based on different quality estimates, and finds that the c ntact-based methods are easier to predict and that predictors trained on these measures provide some advantages when it comes to identifying the best model.

A composite score for predicting errors in protein structure models

The most accurate score is based on a combination of the DOPE non‐hydrogen atom statistical potential; surface, contact, and combined statistical potentials from MODPIPE; and two PSIPRED/DSSP scores, which can be applied to select the final model in various modeling problems, including fold assignment, target–template alignment, and loop modeling.

Evaluating the absolute quality of a single protein model using structural features and support vector machines

A model evaluation method to assess the absolute quality of a single protein model using only structural features with support vector machine regression that performs favorably when compared with the other methods used on the same dataset.

Sub-AQUA: real-value quality assessment of protein structure models.

Quality assessment methods are developed, which predict real value of the global and local quality of protein structure models, and a two-step approach, in which the global RMSD predicted in the first step is further combined with the other terms, can dramatically increase the accuracy.

Absolute quality evaluation of protein model structures using statistical potentials with respect to the native and reference states

A standardized measure of protein structure is proposed, called native‐likeness, by interpolating the score of a model structure between the native and reference state scores defined for each protein, with better accuracy than the raw score.

Predicting Local Quality of a Sequence-Structure Alignment

Results indicate that local quality can be predicted well; local sequence evolutionary information (i.e. sequence similarity) is the major factor in predicting local quality; and structural information such as solvent accessibility and secondary structure helps to improve the prediction performance.

Improved estimation of structure predictor quality

The work presented here provides the basis for the construction of a regression model trained on data from existing structure prediction servers, finding that a constrained regression approach shows consistently good performance.



Assessment of protein models with three-dimensional profiles

It is shown that an effective test of the accuracy of a 3D protein model is a comparison of the model to its own amino-acid sequence, using a3D profile, computed from the atomic coordinates of the structure 3D profiles of correct protein structures match their own sequences with high scores, in contrast,3D profiles for protein models known to be wrong score poorly.

A study of quality measures for protein threading models

If the intent is to reproduce the type of scoring done by the manual assessor in in CASP3, the best approach might be to use a combination of alignment independent and alignment dependent measures, as used in several recent studies.

Discrimination of the native from misfolded protein models with an energy function including implicit solvation.

The present results suggest that molecular mechanics (physics-based) energy functions, complemented by a simple model for the solvation free energy, should be tested for use in the inverse folding problem, and supports their use in studies of the effective energy surface of proteins in solution.

An improved pair potential to recognize native protein folds

A novel method to improve a simple pair potential of mean force, derived from experimentally determined protein structures, in such a way that it recognizes native protein folds with high reliability based on the use of mutation data matrices to overcome difficulties arising from the poor statistics of small sample sizes.

Statistical potentials for fold assessment

Four types of a residue‐level statistical potential were optimized, including distance‐dependent, contact, ϕ/Ψ dihedral angle, and accessible surface statistical potentials, providing a basis for an optimal use of Statistical potentials in fold assessment.

MaxSub: an automated measure for the assessment of protein structure prediction quality

This work investigates the suitability of a fully automated, simple, objective, quantitative and reproducible method that can be used in the automatic assessment of models in the upcoming CAFASP2 experiment and concludes that MaxSub is a suitable method for the automatic evaluation of models.

Factors affecting the ability of energy functions to discriminate correct from incorrect folds.

This work examines the factors that seem to be important for accurate discrimination of correct structures in each of the test sets, and notes that extremely simple functions are often as effective as more complex functions.

Identifying native‐like protein structures using physics‐based potentials

The utility of physics‐based force fields in identifying native‐like conformations in a variety of preconstructed structural databases is demonstrated and proteins in this study are shown to be more stable than their misfolded counterparts and readily identified based on energetic as well as dynamic criteria.

On the design and analysis of protein folding potentials

Pairwise interaction models to recognize native folds are designed and analyzed and it is conjecture that it is not possible to find a folding potential, resolved to 1 angstrom, which is a sum of pair interactions.

The complexity and accuracy of discrete state models of protein structure.

An efficient algorithm is developed for the near optimal fitting of arbitrary lattice and off-lattice models of polypeptide chains to their true X-ray structures and it is shown that low complexity models, which take into account the uneven distribution of residue conformations in real proteins, can represent X-Ray structures as accurately as more complex models,which do not.