Predicting protein function from sequence and structure

@article{Lee2007PredictingPF,
  title={Predicting protein function from sequence and structure},
  author={David A. Lee and Oliver Redfern and Christine A. Orengo},
  journal={Nature Reviews Molecular Cell Biology},
  year={2007},
  volume={8},
  pages={995-1005}
}
While the number of sequenced genomes continues to grow, experimentally verified functional annotation of whole genomes remains patchy. Structural genomics projects are yielding many protein structures that have unknown function. Nevertheless, subsequent experimental investigation is costly and time-consuming, which makes computational methods for predicting protein function very attractive. There is an increasing number of noteworthy methods for predicting protein function from sequence and… 

Figures and Topics from this paper

Current challenges in genome annotation through structural biology and bioinformatics.
TLDR
A central component to this process is to use experimentally determined structures, which provide a means to detect homology that is not discernable from just the sequence and permit the consequences of genomic variation to be realized at the molecular level.
Inferring Protein Function from Structure
A major goal of molecular biology is to understand functions of all genes in nature. Accordingly, it is of great importance to improve large functional genomics and proteomics experiments. However,
Biochemical functional predictions for protein structures of unknown or uncertain function
TLDR
The efforts to address the challenge of annotating the function at the molecular level of uncharacterized proteins by evaluating and testing the different approaches to predicting the function of proteins of unknown function are reviewed.
Structure-based Methods for Computational Protein Functional Site Prediction
  • B. K. Dukka
  • Computer Science, Medicine
    Computational and structural biotechnology journal
  • 2013
TLDR
A number of methods to predict protein functional site especially focusing on protein ligand binding site prediction are discussed, including structural based approaches and sequence-and-structure based approaches for protein functional sites.
Exploiting Protein Structures to Predict Protein Functions
TLDR
This chapter discusses a range of methods which aim to recognise enzyme active sites and predict protein-ligand interactions, and focuses on algorithms developed as part of the CATH database of structural domains.
Exploring the structure and function paradigm.
TLDR
The meaning of structural novelty is reassessed and its relevance to the complexity of the structure-function paradigm is examined, as well as new sequence-based methods for partitioning large, diverse superfamilies into biologically meaningful clusters.
Analysis of protein function and its prediction from amino acid sequence
TLDR
It is found that the transfer of GO terms by pairwise sequence alignments is only moderately accurate, showing a surprisingly small influence of sequence identity (SID) in a broad range (30–100%).
Protein Function Prediction.
TLDR
Current strategies used to predict protein function focused on high-throughput sequence analysis, as for example, inference based on sequence similarity, sequence composition, structure, and protein-protein interaction are explored.
A Survey of Computational Methods for Protein Function Prediction
TLDR
These methods and the state of protein function prediction are reviewed, emphasizing recent algorithmic developments, remaining challenges, and prospects for future research.
A large-scale evaluation of computational protein function prediction
TLDR
Today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets, and there is considerable need for improvement of currently available tools.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 126 REFERENCES
Predicting function: from genes to genomes and back.
TLDR
This review focuses on the added value that is provided by completely sequenced genomes in function prediction, and various levels of sequence annotation and function prediction are discussed, ranging from genomic sequence to that of complex cellular processes.
Predicting protein function from sequence and structural data.
TLDR
Several automated servers that integrate evidence from multiple sources have been released this year and particular improvements have been seen with methods utilizing the Gene Ontology functional annotation schema.
Computational protein function prediction: Are we making progress?
TLDR
It is concluded that the task is far from complete and that a critical assessment of the performance of function prediction programs is necessary to make true progress in computational function prediction.
Protein surface analysis for function annotation in high‐throughput structural genomics pipeline
TLDR
A novel method, named pvSOAR (pocket and void Surface of Amino Acid Residues), of comparing the protein surfaces of geometrically defined pockets and voids was developed and was able to detect previously unrecognized and novel functional relationships between surface features of proteins.
Prediction of human protein function from post-translational modifications and localization features.
We have developed an entirely sequence-based method that identifies and integrates relevant features that can be used to assign proteins of unknown function to functional classes, and enzyme
Effective function annotation through catalytic residue conservation.
TLDR
This work introduces a computational approach to accurately identify and segregate related proteins into those with a functional similarity and those where function differs, and should find a wide range of applications, including the interpretation of genomics/proteomics data and the prioritization of targets for high-throughput structure determination.
Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.
TLDR
It is shown that proteins having matching or similar profiles strongly tend to be functionally linked, and this method of phylogenetic profiling allows us to predict the function of uncharacterized proteins.
Intrinsic errors in genome annotation.
TLDR
The number of potential errors in the prediction of detailed functions is higher than is usually believed in the annotation of microbial genomes.
Prediction of active sites for protein structures from computed chemical properties
TLDR
The power of theoretical microscopic titration curves, when coupled with a statistical analysis, provides a method for high-throughput screening for identification of catalytic sites and binding sites with high accuracy and precision.
Detecting protein function and protein-protein interactions from genome sequences.
TLDR
Searching sequences from many genomes revealed 6809 putative protein-protein interactions in Escherichia coli and 45,502 in yeast, and many members of these pairs were confirmed as functionally related; computational filtering further enriches for interactions.
...
1
2
3
4
5
...