LOCATE: a mammalian protein subcellular localization database

  title={LOCATE: a mammalian protein subcellular localization database},
  author={Josefine Sprenger and J. Lynn Fink and Seetha Karunaratne and Kelly Hanson and Nicholas A. Hamilton and Rohan D. Teasdale},
  journal={Nucleic Acids Research},
  pages={D230 - D233}
LOCATE is a curated, web-accessible database that houses data describing the membrane organization and subcellular localization of mouse and human proteins. Over the past 2 years, the data in LOCATE have grown substantially. The database now contains high-quality localization data for 20% of the mouse proteome and general localization annotation for nearly 36% of the mouse proteome. The proteome annotated in LOCATE is from the RIKEN FANTOM Consortium Isoform Protein Sequence sets which contains… 

Figures and Tables from this paper

COMPARTMENTS: unification and visualization of protein subcellular localization evidence
The COMPARTMENTS resource is presented, which integrates all sources listed above as well as the results of automatic text mining, and all localization evidence is mapped onto common protein identifiers and Gene Ontology terms.
LocSigDB: a database of protein localization signals
LocSigDB is the most comprehensive compendium of protein localization signals for eight distinct subcellular locations and is linked to the proteins in UniProt database along with the organism information that contain the same amino acid pattern as the given signal.
MetazSecKB: the human and animal secretome and subcellular proteome knowledgebase
The Gene Ontology and protein family domain analysis of human secreted proteins revealed that these proteins play important roles in regulation of human structure development, signal transduction, immune systems and many other biological processes.
Network analysis of human protein location
The findings indicate that the metabolic network adds value to the information in the PPI network for the localisation process of proteins in human subcellular compartments, as the MLPI network has evolved to maintain high substrate specificity for proteins.
LocDB: experimental annotations of localization for Homo sapiens and Arabidopsis thaliana
Over 40% of the proteins in LocDB have multiple localization annotations providing a better platform for development of new multiple localization prediction methods with higher coverage and accuracy.
Bioimage-based protein subcellular location prediction: a comprehensive review
This paper systematically reviewed the recent progresses in the field of automated image-based protein sub cellular location prediction, and classified them into four categories including growing of bioimage databases, description of subcellular location distribution patterns, classification methods, and applications of the prediction systems.
The integrated latest database and three techniques were employed for studying the subcellular localization which documents the increase in the accuracy of the prediction, by 87.711 % with J48, 81.67% with random forest, and 88.125% with BF Tree based on the features discussed by comparing the techniques over others.
ComPPI: a cellular compartment-specific database for protein–protein interaction network analysis
Due to its novel features, ComPPI is useful for the analysis of experimental results in biochemistry and molecular biology, as well as for proteome-wide studies in bioinformatics and network science helping cellular biology, medicine and drug design.
Critical evaluation of web-based prediction tools for human protein subcellular localization
A systematic evaluation of several publicly available subcellular localization prediction methods on various benchmark data sets finds that mLASSO-Hum and pLoc-mHum provide a statistically significant improvement in performance, as measured by the value of accuracy, relative to the other methods.
Towards defining the nuclear proteome
This work reports direct experimental evidence that the nuclear proteome consists of at least 14% of the entire proteome, and determines the stringency and types of lines of evidence researchers consider to infer the size and complement of thenuclear proteome.


LOCATE: a mouse protein subcellular localization database
We present here LOCATE, a curated, web-accessible database that houses data describing the membrane organization and subcellular localization of proteins from the FANTOM3 Isoform Protein Sequence
Evaluation and comparison of mammalian subcellular localization prediction methods
No individual method had a sufficient level of sensitivity across both evaluation sets that would enable reliable application to hypothetical proteins, and all methods showed lower performance on the LOCATE dataset and variable performance on individual subcellular localizations was observed.
Subcellular Localization of Mammalian Type II Membrane Proteins
This approach combines mining of published literature to identify sub cellular localization data and a high‐throughput, polymerase chain reaction (PCR)‐based approach to experimentally characterize subcellular localization of type II membrane proteins.
Location proteomics: a systems approach to subcellular location.
  • R. Murphy
  • Biology
    Biochemical Society transactions
  • 2005
Preliminary work suggests the feasibility of expressing each unique pattern as a generative model that can be incorporated into comprehensive models of cell behaviour, and automated, objective high-resolution descriptions of protein location patterns within cells.
MemO: A Consensus Approach to the Annotation of a Protein's Membrane Organization
The MemO pipeline represents an integrated strategy for the application of state-of-the-art bioinformatics tools to the annotation of protein membrane organization, a property which adds biological context to the large quantities of protein sequence information available.
Differential Use of Signal Peptides and Membrane Domains Is a Common Occurrence in the Protein Output of Transcriptional Units
The generation of protein isoforms that are targeted to multiple subcellular locations represents a major functional consequence of transcript variation within the mouse transcriptome.
The Mouse Genome Database (MGD): from genes to mice—a community resource for mouse biology
Improvements in MGD discussed here include the enhancement of phenotype resources, the re-development of the International Mouse Strain Resource, IMSR, the update of mammalian orthology datasets and the electronic publication of classic books in mouse genetics.
LIFEdb: a database for functional genomics experiments integrating information from external sources, and serving as a sample tracking system
LIFEdb is implemented to link information regarding novel human full-length cDNAs generated and sequenced by the German cDNA Consortium with functional information on the encoded proteins produced in functional genomics and proteomics approaches.
MICheck: a web tool for fast checking of syntactic annotations of bacterial genomes
A new web program, MICheck (MIcrobial genome Checker), that enables rapid verification of sets of annotated genes and frameshifts in previously published bacterial genomes and can be seen as a preliminary step before the functional re-annotation step to check quickly for missing or wrongly annotated gene annotations.
The Transcriptional Landscape of the Mammalian Genome
Detailed polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.