Prediction of protein function from protein sequence and structure

@article{Whisstock2003PredictionOP,
  title={Prediction of protein function from protein sequence and structure},
  author={James C. Whisstock and Arthur M. Lesk},
  journal={Quarterly Reviews of Biophysics},
  year={2003},
  volume={36},
  pages={307 - 340}
}
1. Introduction 308 2. Plan of this article 312 3. Natural mechanisms of development of novel protein functions 313 3.1 Divergence 313 3.2 Recruitment 316 3.3 ‘Mixing and matching’ of domains, including duplication/oligomerization, and domain swapping or fusion 316 4. Classification schemes for protein functions 317 4.1 General schemes 317 4.2 The EC classification 318 4.3 Combined classification schemes 319 4.4 The Gene Ontology Consortium 321 5. Methods for assigning protein function 321 5.1… 
Mining Conserved Structures of Enzymes from Functional Hierarchical Classification
TLDR
This study chooses alternative to figure out function-related local structure without using protein binding site information of protein-ligand complexes, and proposes neighborhood residue sphere (NRS), which is a sphere with 10 A radius of each residue with the combination of sequence and spatial information.
Inference of protein function from protein structure.
TLDR
Pro know extracts features from the protein such as 3D fold, sequence, motif, and functional linkages and relates them to function via the ProKnow knowledgebase of features, which links features to annotated functions via annotation profiles, which suggests that ProKnow is a useful resource in functional assessments of novel proteins.
Classification of Intrinsically Disordered Regions and Proteins
TLDR
Characterization of unannotated and uncharacterized protein segments is expected to lead to the discovery of novel functions as well as provide important insights into existing biological processes and is likely to shed new light on molecular mechanisms of diseases that are not yet fully understood.
Protein Function Prediction Using Motifs , Sequence Features , Alignment Scores and Classifier Combination
Protein function prediction [1] is one of the most important problems in bioinformatics. Advances in solutions of this problem could lead to better understanding of how some diseases occur and how to
Prediction of enzyme function based on 3D templates of evolutionarily important amino acids
TLDR
The data suggest that knowledge of evolutionarily important residues improves functional annotation among distant enzyme homologs, and the ETA method should prove a useful, large scale, and general adjunct to combine with other methods to decipher protein function in the structural proteome.
Quantitative sequence-function relationships in proteins based on gene ontology
TLDR
The results show that the proportion of divergent functions decreases dramatically above a threshold of sequence similarity at about 50% residue identity, which describes general features of the evolution of protein function, and serves as a guide to the reliability of annotation transfer.
Predicting functional family of novel enzymes irrespective of sequence similarity: a statistical learning approach.
TLDR
Here SVM is tested for functional family assignment of two groups of enzymes, suggesting that it is potentially useful for facilitating functional study of novel proteins.
Automated functional classification of experimental and predicted protein structures
TLDR
The results indicate that the FSSA algorithm has better accuracy when compared to homology-based approaches for functional classification of both experimental and predicted protein structures, in part due to its use of local, as opposed to global, information for classifying function.
Protein structure and function prediction using kernel methods
Experimental methods to determine the structure and function of proteins have not been able to keep up with the high-throughput sequencing technologies. As a result, there is an over abundance of
Quality assessment of protein model-structures based on structural and functional similarities
TLDR
GOBA - Gene Ontology-Based Assessment is a novel Protein Model Quality Assessment Program that estimates the compatibility between a model-structure and its expected function and is able to discriminate between good and bad model-structures.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 159 REFERENCES
Protein folds and families: sequence and structure alignments
TLDR
Dali and HSSP are derived databases organizing protein space in the structurally known regions that provide a map of the currently known regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination.
The CATH database: an extended protein family resource for structural and functional genomics
TLDR
The philosophy behind CATHEDRAL is the recognition of recurrent folds already classified in CATH, which will help to increase the frequency of CATH updates and adaptation of a recently developed method for rapid structure comparison for domain boundary assignment.
Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking.
TLDR
It is shown that one can employ a strategy that uses a structure-based prediction of protein functional sites to assess the reliability of functional inheritance and automated and benchmarked a method based on the evolutionary trace approach.
Identification of homology in protein structure classification
TLDR
The discrimination between analogy and homology of close structural neighbors will lead to functional predictions while avoiding overprediction, allowing fast, self-consistent and complete classification of large numbers of protein structures.
Analysis and prediction of functional sub-types from protein sequence alignments.
TLDR
A method for analysis and prediction of functional sub-types from multiple protein sequence alignments and describes the derivation of a set of sub-type groupings derived from an automated parsing of alignments from PFAM and the SWISSPROT database.
Practical limits of function prediction
TLDR
A critical view of the theoretical and practical bases for the practice of assigning a potential function to a protein on the basis of sequence similarity to proteins whose function has been experimentally investigated is presented.
Protein folds, functions and evolution.
TLDR
The evolution of proteins and their functions is reviewed from a structural perspective in the light of the current database, finding that the number of new topologies is still increasing, although 25 new structures are now determined for each new topology.
Structural similarity to link sequence space: New potential superfamilies and implications for structural genomics
TLDR
This work uses the Structural Classification of Proteins (SCOP) database to link sequence alignments from the SMART and Pfam databases, and extends the method of Murzin (1993b) to assign statistical significance to sequence identities found after structural alignment and thus suggests the best link between diverse sequence families.
The SBASE protein domain library, release 9.0: an online resource for protein domain identification
TLDR
The ninth release of the SBASE library of protein domain sequences contains 320 000 annotated structural, functional, ligand-binding and topogenic segments of proteins clustered into over 3481 domain groups and 483 protein families.
PIR-ALN: a database of protein sequence alignments
TLDR
PIR-ALN includes 529 alignments that can be used to develop patterns not represented in PROSITE, Blocks, PRINTS and Pfam databases and can beUsed to standardize and spread annotation to newer members by homology, as well as to understand the modular architecture of multidomain proteins.
...
1
2
3
4
5
...