Predicting Protein Subcellular Localization: Past, Present, and Future

  title={Predicting Protein Subcellular Localization: Past, Present, and Future},
  author={Pierre D{\"o}nnes and Annette H{\"o}glund},
  journal={Genomics, Proteomics \& Bioinformatics},
  pages={209 - 215}

Tables from this paper

'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
An approach is developed that enhances the prediction accuracy of mitochondrial proteins by uniting the strength of specialized tools by combining machine-learning based integration with biological expert knowledge, which alleviates the conundrum of how to choose between conflicting predictions.
Prediction of subcellular locations of proteins: Where to proceed?
This review of prediction algorithms for the subcellular localization sites of input protein sequences focuses on the biological grounds of the prediction methods rather than the algorithmic issues because it believes the former will be more fruitful for future development.
Advance in predicting subcellular localization of multi-label proteins and its implication for developing multi-target drugs.
  • K. Chou
  • Biology
    Current medicinal chemistry
  • 2019
Focused in this review are those methods, which have the capacity to deal with multi-label proteins that may simultaneously exist in two or more subcellular location sites, which are vitally important for finding multi-target drugs, a current hot trend in drug development.
Using N-terminal targeting sequences, amino acid composition, and sequence motifs for predicting protein subcellular localizations
This work presents a novel Support Vector Machine (SVM)-based approach for predicting protein subcellular localization, which integrates information about N-terminal targeting sequences, amino acid composition, and protein sequence motifs and shows better or comparable results to methods that are specialized on fewer localizations or for one organism.
Computational prediction of subcellular localization.
This chapter discusses issues of prediction of protein localization sites, and closes with an example illustrating the use of the WoLF PSORT web server for localization prediction.
Protein subcellular localization prediction using artificial intelligence technology.
  • R. NairB. Rost
  • Biology, Computer Science
    Methods in molecular biology
  • 2008
This chapter reviews some of the most important methods for the prediction of subcellular localization, and indicates that these methods are beginning to play an important role in directing experimental research.
Proteomics methods for subcellular proteome analysis
The elucidation of the subcellular distribution of proteins under different conditions is a major challenge in cell biology. This challenge is further complicated by the multicompartmental and
Protein-protein interaction as a predictor of subcellular location
This work evaluates the potential of using PPI annotation in determining subcellular location (SCL) of proteins in human, mouse, fly and yeast, and develops an approach to predict the SCL of a protein based on the S CL of its interaction partners, given sufficient confidence in the interaction itself.
GP4: an integrated Gram-Positive Protein Prediction Pipeline for subcellular localization mimicking bacterial sorting
GP4 is presented, a protein subcellular localization meta-predictor mainly for Firmicutes, but also Actinobacteria, based on the combination of multiple tools, each specific for different sorting signals and compartments, with improved cell-wall protein prediction.
MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition
This work presents a novel SVM-based approach for predicting subcellular localization, which integrates N-terminal targeting sequences, amino acid composition and protein sequence motifs, and shows better or comparable results to methods that are specialized on fewer localizations or for one organism.


Predicting subcellular localization via protein motif co-occurrence.
A Bayesian network localization predictor called PSLT that is based on the combinatorial presence of InterPro motifs and specific membrane domains in human proteins generates a likelihood of localization to all organelles and allows to predict multicompartmental proteins.
Support vector machine approach for protein subcellular localization prediction
Support Vector Machine has been introduced to predict the subcellular localization of proteins from their amino acid compositions and can be a complementary method to other existing methods based on sorting signals.
Predicting subcellular localization of proteins based on their N-terminal amino acid sequence.
A neural network-based tool, TargetP, for large-scale subcellular location prediction of newly identified proteins has been developed and it is estimated that 10% of all plant proteins are mitochondrial and 14% chloroplastic, and that the abundance of secretory proteins, in both Arabidopsis and Homo, is around 10%.
Predicting protein cellular localization using a domain projection method.
The co-occurrence of domain families in eukaryotic proteins is investigated to predict protein cellular localization and this method is complementary to approaches that use amino-acid composition or identify sorting sequences; these methods may be combined to further enhance prediction accuracy.
Predicting subcellular localization of proteins using machine-learned classifiers
Five machine-learning classifiers for predicting subcellular localization of proteins from animals, plants, fungi, Gram-negative bacteria and Gram-positive bacteria are constructed, which are the most accurate sub cellular predictors across the widest set of organisms ever published.
Predicting subcellular localization of proteins for Gram‐negative bacteria by support vector machines based on n‐peptide compositions
This method uses the support vector machines trained by multiple feature vectors based on n‐peptide compositions to predict subcellular localization for Gram‐negative bacteria, and achieves the highest prediction rate ever reported.
Prediction of protein subcellular locations using fuzzy k-NN method
Fuzzy k-nearest neighbors (k-NN) algorithm has been introduced to predict proteins' subcellular locations from their dipeptide composition and the overall predictive accuracy about 80% has been achieved in a jackknife test.
ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST
Support vector machine (SVM) has been used to predict the subcellular location of eukaryotic proteins from their different features such as amino acid composition, dipeptide composition and physico-chemical properties, and an online web server ESLpred was developed.
Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs
A method based on support vector machines to predict the subcellular location of a given protein based on its amino acid, amino acid pair, and gapped amino acids pair compositions showed an improvement in prediction accuracy over the algorithm based on the amino acid composition only.
Localizing proteins in the cell from their phylogenetic profiles.
Application of a computational method for identifying subcellular locations of proteins from the phylogenetic distribution of the homologs of organellar proteins reveals that nucleus-encoded proteins previously known to be destined for mitochondria fall into three groups: prokaryote-derived, eukaryote -derived, and organism-specific (i.e., found only in the organism under study).