Context-Aware Prediction of Pathogenicity of Missense Mutations Involved in Human Disease

@article{Feinauer2017ContextAwarePO,
  title={Context-Aware Prediction of Pathogenicity of Missense Mutations Involved in Human Disease},
  author={Christoph Feinauer and Martin Weigt},
  journal={bioRxiv},
  year={2017}
}
Amino-acid substitutions are implicated in a wide range of human diseases, many of which are lethal. Distinguishing such mutations from polymorphisms without significant effect on human health is a necessary step in understanding the etiology of such diseases. Computational methods can be used to select interesting mutations within a larger set, to corroborate experimental findings and to elucidate the cause of the deleterious effect. In this work, we show that taking into account the sequence… 
Predicting the impact of Lynch syndrome-causing missense mutations from structural calculations
TLDR
It is demonstrated, for the first time, that saturation mutagenesis, biophysical modeling and co-variation analysis, performed in silico, can predict the abundance, metabolic stability, and function of proteins inside living cells.
Rhapsody: predicting the pathogenicity of human missense variants
TLDR
A new interface (Rhapsody) is presented that enables fully automated assessment of pathogenicity, incorporating both sequence coevolution data and structure- and dynamics-based features, and is shown to outperform well-established and/or advanced prediction tools.
Classifying disease-associated variants using measures of protein activity and stability
TLDR
This work shows how recent developments in both experiments and computational modelling can be used to determine whether a missense variant is likely to be pathogenic and compares two different multiplexed experiments and two computational methods to classify variant effects in PTEN.
Rhapsody: Pathogenicity prediction of human missense variants based on protein sequence, structure and dynamics
TLDR
A significant extension is presented that integrates coevolutionary data from Pfam database and a new interface is introduced (Rhapsody) that enables fully automated assessment of pathogenicity and is shown to outperform well-established and/or advanced prediction tools.
Functional characterization of single amino acid variants
TLDR
PMut2017 is a new general-purpose predictor based on SwissVar and showed improved performance, specially for those proteins that were more commonly misclassified by PMut, and PMut-S, a suite of 215 protein-specific predictors, proved that it is possible to train specific predictors using a unique automated pipeline and match the results of most gene specific predictions released to date.
Towards predictive biophysical and mechanistic models for disease-causing protein variants
TLDR
This work focuses on the role of changes in protein stability as a driver for disease, and how experiments, biophysical models and computation are together providing a framework for understanding and predicting how mutations affect cellular protein stability.
Multiplexed assays reveal effects of missense variants in MSH2 and cancer predisposition
TLDR
This work has shown that sequencing of genomes of missense variants that change protein sequences only understands the phenotypic and clinical consequences of a minute fraction of these, and this lack of knowledge has direct consequences for clinical action.
Toward mechanistic models for genotype–phenotype correlations in phenylketonuria using protein stability calculations
TLDR
It is proposed that many disease‐linked PAH variants are structurally destabilized, which in turn leads to proteasomal degradation and insufficient amounts of cellular PAH protein.
Toward Inferring Potts Models for Phylogenetically Correlated Sequence Data
TLDR
This work proposes an inference scheme that takes the phylogeny of a protein family into account in order to correct biases in estimating the frequencies of amino acids and shows that a Potts model inferred using these corrected frequencies performs better in predicting contacts and fitness effect of mutations.
...
...

References

SHOWING 1-10 OF 46 REFERENCES
Mutation effects predicted from sequence co-variation
TLDR
This work presents EVmutation, an unsupervised statistical method for predicting the effects of mutations that explicitly captures residue dependencies between positions and shows that it outperforms methods that do not account for epistasis.
Performance of mutation pathogenicity prediction methods on missense variants
TLDR
The overall best performing methods in this study were SNPs&GO and MutPred, with accuracies reaching 0.82 and 0.81, respectively.
The Evaluation of Tools Used to Predict the Impact of Missense Variants Is Hindered by Two Types of Circularity
TLDR
It is shown that comparative evaluations of predictors that do not address two types of circularity may erroneously conclude that circularity confounded tools are most accurate among all tools, and may even outperform optimized combinations of tools.
A general framework for estimating the relative pathogenicity of human genetic variants
TLDR
The ability of CADD to prioritize functional, deleterious and pathogenic variants across many functional categories, effect sizes and genetic architectures is unmatched by any current single-annotation method.
Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1
TLDR
A novel inference scheme for mutational landscapes, which is based on the statistical analysis of large alignments of homologs of the protein of interest, is developed, able to capture epistatic couplings between residues, and therefore to assess the dependence of mutational effects on the sequence context where they appear.
PANTHER-PSEP: predicting disease-causing genetic variants using position-specific evolutionary preservation
TLDR
On standard benchmarks for distinguishing disease-associated from neutral variation in humans, PSEP outperforms not only previous tools that utilize evolutionary conservation, but also several highly used tools that include multiple other sources of information as well.
Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm
TLDR
This protocol describes the use of the 'Sorting Tolerant From Intolerant' (SIFT) algorithm in predicting whether an AAS affects protein function.
Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2
PolyPhen‐2 (Polymorphism Phenotyping v2), available as software and via a Web server, predicts the possible impact of amino acid substitutions on the stability and function of human proteins using
Analysis of 589,306 genomes identifies individuals resilient to severe Mendelian childhood diseases
TLDR
A comprehensive screen of 874 genes in 589,306 genomes led to the identification of 13 adults harboring mutations for 8 severe Mendelian conditions, with no reported clinical manifestation of the indicated disease.
De novo mutations in human genetic disease
TLDR
Recent findings suggesting that de novo mutations play a prominent part in rare and common forms of neurodevelopmental diseases, including intellectual disability, autism and schizophrenia are discussed.
...
...