Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)


Understanding the relationship between the protein sequence and the 3D structure is a major research area in bioinformatics. The prediction of complete protein tertiary structure based only on sequence information is still an impractical work. This paper aims at revealing the hidden knowledge of the sequence motifs and the local tertiary structure. In this paper, we propose a Super Granule Support Vector Machine (Super GSVM) model to obtain the high quality protein sequence motifs and to predict local tertiary structure information based on purely sequence information. The proposed model overcomes the innate shortcoming of using the SVM on such a large data set, which is the inherent computational complexity involved in training support vectors for huge datasets including half million of samples. The satisfactory prediction results show the Super GSVM model generates decent protein sequence clusters and has the ability to capture the hidden sequence-to-structure information. This model also has a strong potential in the application of SVMs on other research areas with huge datasets.

DOI: 10.1186/1471-2105-10-S11-S15

Extracted Key Phrases

5 Figures and Tables

Citations per Year

208 Citations

Semantic Scholar estimates that this publication has 208 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Chen2009ProteinL3, title={Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)}, author={Bernard Chen and Matthew Johnson}, booktitle={BMC Bioinformatics}, year={2009} }