Automated assignment of SCOP and CATH protein structure classifications from FSSP scores

@article{Getz2002AutomatedAO,
  title={Automated assignment of SCOP and CATH protein structure classifications from FSSP scores},
  author={Gad Getz and Michele Vendruscolo and David Sachs and Eytan Domany},
  journal={Proteins: Structure},
  year={2002},
  volume={46}
}
We present an automated procedure to assign CATH and SCOP classifications to proteins whose FSSP score is available. CATH classification is assigned down to the topology level, and SCOP classification is assigned to the fold level. Because the FSSP database is updated weekly, this method makes it possible to update also CATH and SCOP with the same frequency. Our predictions have a nearly perfect success rate when ambiguous cases are discarded. These ambiguous cases are intrinsic in any protein… 
SCOPmap: Automated assignment of protein structures to evolutionary superfamilies
TLDR
An algorithm to map protein domains to an existing structural classification scheme and has been applied to the SCOP database and is useful for assigning domains in newly solved structures to appropriate superfam families and for identifying evolutionary links between different superfamilies.
Assessment of Protein Domain Classifications: SCOP, CATH, Dali and EVEREST
Background: SCOP is a manual classification of protein domain structures. CATH is a classification of protein domain structures created through a combination of manual and automatic methods. The Dali
Towards an automatic classification of protein structural domains based on structural similarity
TLDR
Different clustering procedures give rise to different levels of agreement between automatic and manual protein classifications, and visual examination of hard-to-classify proteins confirms that global structural similarity of domains is not the only criterion used in the SCOP classification.
FSSP to SCOP and CATH (F2CS) Prediction Server
Summary: The F2CS server provides access to the software, F2CS2.00, that implements an automated prediction method of SCOP and CATH classifications of proteins, based on their FSSP Z-scores (Getz et
Automated protein classification using consensus decision
TLDR
A novel technique for automatically generating the SCOP classification of a protein structure with high accuracy by combining the decisions of multiple methods using the consensus of a committee (or an ensemble) classifier is proposed.
Automated protein classification using consensus decision
TLDR
A novel technique for automatically generating the SCOP classification of a protein structure with high accuracy by combining the decisions of multiple methods using the consensus of a committee (or an ensemble) classifier is proposed.
F2CS: FSSP to CATH and SCOP prediction server
UNLABELLED The F2CS server provides access to the software, F2CS2.00, which implements an automated prediction method of SCOP and CATH classifications of proteins, based on their FSSP Z-scores.
Impact of structure space continuity on protein fold classification
TLDR
A likelihood method is developed to classify a domain into the existing folds of CATH or SCOP using both query- fold structure similarities and within-fold structure heterogeneities, which indicates that the importance of considering structure space continuity in fold classification depends on the specific question asked.
3-State Protein Secondary Structure Prediction based on SCOPe Classes
Improving the accuracy of protein secondary structure prediction has been an important task in bioinformatics since it is not only the starting point in obtaining tertiary structure in hierarchical
Development of a New Protein Classification Scheme
TLDR
The analysis of the existing protein structure classification databases reveals that there are consistencies among major databases such as CATH, SCOP, and FSSP, and new techniques for analysis and classification of proteins need to be tested for consistency before adoption.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 53 REFERENCES
A systematic comparison of protein structure classifications: SCOP, CATH and FSSP.
TLDR
A systematic comparison of these databases has been carried out to determine their overall agreement in classifying protein structures, and extracts information from a consensus database, encompassing agreements between SCOP, CATH and FSSP.
CATH--a hierarchic classification of protein domain structures.
TLDR
Analysis of the structural families generated by CATH reveals the prominent features of protein structure space and a database of well-characterised protein structure families will facilitate the assignment of structure-function/evolution relationships to both known and newly determined protein structures.
Dali/FSSP classification of three-dimensional protein folds
TLDR
The FSSP database presents a continuously updated structural classification of three-dimensional protein folds that define useful test sets and a standard of truth for assessing the correctness of sequence-sequence or sequence-structure alignments.
SCOP: a structural classification of proteins database
TLDR
The Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the relationships of known protein structures that provide the basis of the ASTRAL sequence libraries that can be used as a source of data to calibrate sequence search algorithms and for the generation of statistics on, or selections of, protein structures.
Contemporary approaches to protein structure classification
TLDR
The most popular structure comparison methods are summarized, it is shown how they can be used for database searching, and some of the most advanced attempts to develop comprehensive protein structure classifications are described.
The FSSP database of structurally aligned protein fold families.
TLDR
The FSSP database currently contains an extended structural family for each of 330 representative protein chains, and all such comparisons are based purely on the 3D co-ordinates of the proteins and are derived by automatic structure comparison programs.
The ASTRAL compendium for protein structure and sequence analysis
TLDR
The ASTRAL compendium provides several databases and tools to aid in the analysis of protein structures, particularly through the use of their sequences, and summarizes the overall characteristics of a protein structure.
A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3
TLDR
The Dali Domain Dictionary is extended to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologue with sequence identity above 25% (sequence families).
Towards a covering set of protein family profiles.
  • A. Heger, L. Holm
  • Biology, Medicine
    Progress in biophysics and molecular biology
  • 2000
TLDR
Emerging concepts and caveats of global sequence clustering are reviewed and explicit multiple alignments coupled with neighbourhood analysis lead to domain segmentation, and hierarchical unification helps to resolve conflicts and validate clusters.
The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologues.
TLDR
The value of the CATH Dictionary of Homologous Superfamilies, for assessing functional variability and identifying distant evolutionary relationships, is illustrated using the pyridoxal-5'-phosphate (PLP) binding aspartate aminotransferase superfamily.
...
1
2
3
4
5
...