Protein superfamilles and domain superfolds

@article{Orengo1994ProteinSA,
  title={Protein superfamilles and domain superfolds},
  author={Christine A. Orengo and David T. Jones and Janet M. Thornton},
  journal={Nature},
  year={1994},
  volume={372},
  pages={631-634}
}
As the protein sequence and structure databases expand rapidly a better understanding of the relationships between proteins is required. A classification is considered that extends the sequence-based superfamilies to include proteins with similar function and three-dimensional structures but no sequence similarity. So far there are only nine protein folds known to recur in proteins having neither sequence nor functional similarity. These folds dominate the structure database, representing more… Expand

Topics from this paper

From protein structure to function.
Several databases of protein structural families now exist-organised according to both evolutionary relationships and common folding arrangements. Although these lag behind sequence databases inExpand
Structural classification of proteins: new superfamilies.
  • A. Murzin
  • Biology, Medicine
  • Current opinion in structural biology
  • 1996
TLDR
Those new superfamilies that include proteins of general interest are reviewed, including Sonic hedgehog, macrophage migration inhibitory factor, nuclear transport factor-2, double stranded RNA binding domain, GroES, the proteasome, new ATP-hydrolyzing ligases and flavoproteins. Expand
Do transmembrane protein superfolds exist?
A reliable and widely used transmembrane protein structure prediction algorithm was applied to five representative genomic sequence data sets in order to re‐examine the hypothesis that in contrast toExpand
The Classification of Protein Domains.
TLDR
This chapter discusses the approaches and methods that are frequently used in the classification of proteins, with a specific emphasis on the classified protein domains and shows how the use of domain family annotations to assign structural and functional information is enhancing the authors' understanding of genomes. Expand
Structure comparison and protein structure classifications
TLDR
This chapter reviews the most popular methods and how they are combined with sequence comparison to recognize protein homologs and approaches to automatic domain boundary assignment algorithms in parallel with the construction of protein structure classifications. Expand
PASS2: an automated database of protein alignments organised as structural superfamilies
BackgroundThe functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similarExpand
A common sequence-associated physicochemical feature for proteins of beta-trefoil family
TLDR
It is observed that their amino acid sequences, despite being considerably divergent from each other, can be accounted for by matching to a repetition of three physicochemically similar segments, consistent with the three-fold pseudo-symmetry in tertiary structures of these proteins. Expand
The CATH protein family database: A resource for structural and functional annotation of genomes
TLDR
Recent developments to the CATH domain database of protein structural families are described which have facilitated genome annotation and which have also revealed important caveats that must be considered when transferring functional data between homologous proteins. Expand
Searching for functional sites in protein structures.
TLDR
This work has shown that function prediction techniques have been applied to the identification of enzyme catalytic triads and DNA-binding motifs using evolutionary trace methods, methods that involve the calculation and assessment of maximal superpositions, and methods based on graph theory. Expand
Genome-wide survey of remote homologues for protein domain superfamilies of known structure reveals unequal distribution across structural classes.
TLDR
The distribution of remote homologues across different classes, folds and superfamilies was studied and reveals that sequences are unequally distributed across structural classes. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 26 REFERENCES
Identification and classification of protein fold families.
TLDR
Analysis of sequence and structure conservation within the larger families shows the globins to be the most highly conserved family and the TIM barrels the most weakly conserved. Expand
A new approach to protein fold recognition
TLDR
A new approach to fold recognition, whereby sequences are fitted directly onto the backbone coordinates of known protein structures, using a given sequence as a guide for the matching of sequences to backbone coordinates. Expand
From genome sequences to protein function
TLDR
This work has shown that evolutionary relationships can be exploited to predict the function of many other proteins from their amino acid sequence, and the techniques for such predictions are becoming increasingly sophisticated and are now an essential part of genome analysis. Expand
A data bank merging related protein structures and sequences.
TLDR
Wedding the primary and tertiary structural data resulted in an 8-fold increase of data bank sequence entries over those associated with the known three-dimensional architectures alone. Expand
A database of protein structure families with common folding motifs
TLDR
The database makes explicitly visible architectural similarities in the known part of the universe of protein folds and may be useful for understanding protein folding and for extracting structural modules for protein design. Expand
Structural relationships of homologous proteins as a fundamental principle in homology modeling
TLDR
The main goal of the present work is to provide tools for the assessment of accuracy of modeling at a given level of sequence homology, and it is shown that both the topological differences of the protein backbones and the relative positions of corresponding side chains diverge with decreasing sequence identity. Expand
Families and the structural relatedness among globular proteins
  • D. Yee, K. Dill
  • Biology, Medicine
  • Protein science : a publication of the Protein Society
  • 1993
TLDR
It is found that protein families are not tightly knit entities, by using an analogy to distributions of Euclidean distances. Expand
Comparison of conformational characteristics in structurally similar protein pairs
TLDR
This study presents further quantitative evidence that structure is remarkably well conserved in detail, as well as at the topological level, even when the sequences do not show similarity that is significant statistically. Expand
Database of homology‐derived protein structures and the structural meaning of sequence alignment
TLDR
A database of homology‐derived secondary structure of proteins (HSSP) is produced by aligning to each protein of known structure all sequences deemed homologous on the basis of the threshold curve, effectively increasing the number of known protein structures by a factor of five to more than 1800. Expand
Protein structure alignment.
TLDR
A new method of comparing protein structures is described, based on distance plot analysis, which uses the dynamic programming optimization technique, which is widely used in the comparison of protein sequences and thus unifies the techniques of protein structure and sequence comparison. Expand
...
1
2
3
...