Predicting protein quaternary structure by pseudo amino acid composition

@article{Chou2003PredictingPQ,
  title={Predicting protein quaternary structure by pseudo amino acid composition},
  author={Kuo-Chen Chou and Yu-Dong Cai},
  journal={Proteins: Structure},
  year={2003},
  volume={53}
}
In the protein universe, many proteins are composed of two or more polypeptide chains, generally referred to as subunits, that associate through noncovalent interactions and, occasionally, disulfide bonds. With the number of protein sequences entering into data banks rapidly increasing, we are confronted with a challenge: how to develop an automated method to identify the quaternary attribute for a new polypeptide chain (i.e., whether it is formed just as a monomer, or as a dimer, trimer, or… 
Prediction of Protein Quaternary Structural Type with Functional Domain and Pseudo Amino Acid Composition
  • X. Xiao, Pu Wang
  • Computer Science
    2009 3rd International Conference on Bioinformatics and Biomedical Engineering
  • 2009
TLDR
A prediction system for protein quaternary structural type is developed in which it is incorporated functional domain composition (FunD) and pseudo-amino acid composition (PseAA) and the overall accuracy achieved is more than 89% in the Jack-knife test.
Predicting the quaternary structure attribute of a protein by hybridizing functional domain composition and pseudo amino acid composition
TLDR
With the avalanche of protein sequences generated in the post-genomic age, it is highly desirable to develop an automated method by which crystallographic scientists can rapidly and effectively identify which quaternary attribute a particular protein chain has according to its sequence information.
Using Chou’s pseudo amino acid composition to predict protein quaternary structure: a sequence-segmented PseAAC approach
TLDR
A sequence-segmented PseAAC is introduced to represent protein samples and multiclass SVM classifier modules were adopted to classify protein quaternary structures, indicating that the new approach is quite promising for predicting protein quatenary structure.
Predicting Protein Quaternary Structure with Multi-scale Energy of Amino Acid Factor Solution Scores and Their Combination
TLDR
The rates of correct identification suggest that the individual primary sequence of an oligomeric protein do contain the information of its quaternary structure, and the results of multi-scale energy of Factor 1 solution scores indirectly prove that biologically relevant complex formation is driven predominantly by the hydrophobic effect.
Application of Function Domain and Pseudo Amino Acid Composition to Predict Hetero-Oligomer Protein Structural Types
  • X. Xiao, Pu Wang
  • Computer Science
    2010 4th International Conference on Bioinformatics and Biomedical Engineering
  • 2010
TLDR
Using machine learning approach, the Fuzzy Nearest Neighbor Algorithm, a prediction system for protein quaternary structural type is developed in which it incorporated functional domain composition (FunD) and pseudo-amino acid composition (PseAA) and the overall accuracy achieved is more than 80% in the Jack-knife test.
Protein Data Condensation for Effective Quaternary Structure Classification
TLDR
This work proposes a protein quaternary structure classification method exploiting the functional domain composition of proteins based on a nearest neighbor condensation technique in order to reduce both the portion of dataset to be stored and the number of comparisons to carry out.
Classification of protein quaternary structure by functional domain composition
TLDR
Compared with the amino acid composition method and Blast, the results indicate that the domain composition approach may be a more effective and promising high-throughput method in dealing with this complicated problem in bioinformatics.
Quat-2L: a web-server for predicting protein quaternary structural attributes
TLDR
A 2-layer predictor called “Quat-2L” was developed for predicting the quaternary structural attribute of a protein according to its sequence information alone, and success rates were derived by the jackknife cross-validation tests on the stringent benchmark data set.
...
...

References

SHOWING 1-10 OF 48 REFERENCES
Prediction of the amount of secondary structure in a globular protein from its aminoacid composition.
TLDR
These relations were tested using 18 proteins of known structure, but omitting the protein to be predicted, using multiple regression to obtain relationships for predicting the amount of secondary structure in a protein molecule from a knowledge of its aminoacid composition.
Understanding the recognition of protein structural classes by amino acid composition
TLDR
The analysis shows that proteins belonging to the four distinct folding classes exhibit significant differences in their distributions of non‐bonded contacts, which more directly explains the success in predicting structural class from amino acid composition.
Prediction of Protein Structural Classes from Amino Acid Compositions
TLDR
Proteins are simple and yet quite complex biological macromolecules, and the link between the simple and the complex in that the protein’s conformation is coded in its amino acid sequence.
Prediction of protein cellular attributes using pseudo‐amino acid composition
TLDR
A remarkable improvement in prediction quality has been observed by using the pseudo‐amino acid composition and its mathematical framework and biochemical implication may also have a notable impact on improving the prediction quality of other protein features.
A key driving force in determination of protein structural classes.
  • K. Chou
  • Biology
    Biochemical and biophysical research communications
  • 1999
TLDR
It was observed that the correct rates in recognizing protein structural classes by U ((2)) are significantly higher than those by U((0)) and U((1)), indicating that an algorithm that can more completely incorporate the interaction contributions will yield better recognition quality, and hence demonstrate that the interaction among the components of amino acid composition is an important driving force in determining the structural class of a protein during the sequence folding process.
Prediction of the Secondary Structure Contents of Globular Proteins Based on Three Structural Classes
TLDR
The new algorithm is an improved multiple linear regression method, taking into account the nonlinear and coupling terms of the frequencies of different amino acids and the length of the protein, with merits of simplicity and convenience for use, as well as high prediction accuracy.
Prediction of protein secondary structure content.
TLDR
An algorithm was developed for predicting the content of protein secondary structure elements that was based on a new amino-acid-composition, in which the sequence coupling effects are explicitly included through a series of conditional probability elements.
Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies.
TLDR
Segregation by sequence was found to be a more reliable procedure for distinguishing intra/extracellular proteins than methods using structural class.
Prediction of protein structural classes.
  • K. Chou, C. Zhang
  • Computer Science
    Critical reviews in biochemistry and molecular biology
  • 1995
TLDR
The very high success rate for both the training- set proteins and the testing-set proteins, which has been further validated by a simulated analysis and a jackknife analysis, indicates that it is possible to predict the structural class of a protein according to its amino acid composition if an ideal and complete database can be established.
...
...