MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition

@article{Hglund2006MultiLocPO,
  title={MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition},
  author={Annette H{\"o}glund and Pierre D{\"o}nnes and Torsten Blum and Hans Werner Adolph and Oliver Kohlbacher},
  journal={Bioinformatics},
  year={2006},
  volume={22 10},
  pages={
          1158-65
        }
}
MOTIVATION Functional annotation of unknown proteins is a major goal in proteomics. A key annotation is the prediction of a protein's subcellular localization. Numerous prediction techniques have been developed, typically focusing on a single underlying biological aspect or predicting a subset of all possible localizations. An important step is taken towards emulating the protein sorting process by capturing and bringing together biologically relevant information, and addressing the clear need… 

Figures and Tables from this paper

Protein subcellular localization prediction based on compartment-specific features and structure conservation
TLDR
The overall accuracy of combining the structural homology approach is further improved, which suggests that structural conservation could be a useful indicator for inferring localization in addition to sequence homology.
MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction
TLDR
MultiLoc2 is an extensive high-performance subcellular protein localization prediction system that outperforms other prediction systems in two benchmarks studies and yields higher accuracies compared to its previous version.
MultiLoc 2 and SherLoc 2 : improved prediction of subcellular protein localization
TLDR
This work significantly extends and improves upon previous high-accuracy location prediction methods and provides a low-resolution version of MultiLoc2, which is specialized on discriminating globular proteins from secreted proteins and, consequently, predicts only up to five key locations.
EuLoc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of Chou’s PseAAC
TLDR
This work develops a hybrid method for computationally predicting the subcellular localization of eukaryotic protein that incorporates the Hidden Markov Model (HMM) method, homology search approach and the support vector machines (SVM) method by fusing several new features into Chou’s pseudo-amino acid composition.
A novel approach for protein subcellular location prediction using amino acid exposure
TLDR
A new algorithm for subcellular location prediction that uses residue exposure signatures and a novel approach to address the multiclass classification problem is developed.
DeepLoc: prediction of protein subcellular localization using deep learning
TLDR
This work presents a prediction algorithm using deep neural networks to predict protein subcellular localization relying only on sequence information, outperforming current state‐of‐the‐art algorithms, including those relying on homology information.
Plant-mSubP: a computational framework for the prediction of single- and multi-target protein subcellular localization using integrated machine-learning approaches
TLDR
The results indicate that the proposed approach is comparable to the existing methods in single localization prediction and outperforms all other existing tools when compared for dual-label proteins.
Subcellular Localization Prediction through Boosting Association Rules
TLDR
The approach does not use biological knowledge such as the sorting signals or homologues, but use just protein sequence information, and divides a protein sequence into short k-mer sequence fragments which can be mapped to word features in document classification.
A Comparative Study on Feature Extraction from Protein Sequences for Subcellular Localization Prediction
TLDR
Comparisons are conducted among amino acid composition approach, amino acid tuple approach, voting scheme, and a new characteristic representation of proteins proposed in this paper, and it is demonstrated that there are considerable potentials on improving prediction accuracy by exploiting protein sequences.
SherLoc: high-accuracy prediction of protein subcellular localization by integrating text and protein sequence data
TLDR
SherLoc is introduced, a new comprehensive system for predicting the localization of eukaryotic proteins that integrates several types of sequence and text-based features and selects its text sources and features, and integrates those with sequence- based features.
...
...

References

SHOWING 1-10 OF 58 REFERENCES
Support vector machine approach for protein subcellular localization prediction
TLDR
Support Vector Machine has been introduced to predict the subcellular localization of proteins from their amino acid compositions and can be a complementary method to other existing methods based on sorting signals.
ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST
TLDR
Support vector machine (SVM) has been used to predict the subcellular location of eukaryotic proteins from their different features such as amino acid composition, dipeptide composition and physico-chemical properties, and an online web server ESLpred was developed.
Predicting subcellular localization via protein motif co-occurrence.
TLDR
A Bayesian network localization predictor called PSLT that is based on the combinatorial presence of InterPro motifs and specific membrane domains in human proteins generates a likelihood of localization to all organelles and allows to predict multicompartmental proteins.
Mimicking cellular sorting improves prediction of subcellular localization.
Predicting subcellular localization of proteins based on their N-terminal amino acid sequence.
TLDR
A neural network-based tool, TargetP, for large-scale subcellular location prediction of newly identified proteins has been developed and it is estimated that 10% of all plant proteins are mitochondrial and 14% chloroplastic, and that the abundance of secretory proteins, in both Arabidopsis and Homo, is around 10%.
Predicting subcellular localization of proteins by hybridizing functional domain composition and pseudo‐amino acid composition
TLDR
The Intimate Sorting Algorithm (ISort predictor) was developed for predicting the protein subcellular location by hybridizing the functional domain composition and the pseudo‐amino acid composition and it is confirmed that such a hybrid approach may become a very useful high‐throughput tool in the area of bioinformatics, proteomics, as well as molecular cell biology.
Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs
TLDR
A method based on support vector machines to predict the subcellular location of a given protein based on its amino acid, amino acid pair, and gapped amino acids pair compositions showed an improvement in prediction accuracy over the algorithm based on the amino acid composition only.
Prediction of protein subcellular locations using fuzzy k-NN method
TLDR
Fuzzy k-nearest neighbors (k-NN) algorithm has been introduced to predict proteins' subcellular locations from their dipeptide composition and the overall predictive accuracy about 80% has been achieved in a jackknife test.
Predicting 22 protein localizations in budding yeast.
  • Yu-Dong CaiK. Chou
  • Computer Science
    Biochemical and biophysical research communications
  • 2004
...
...