Mass-spectrometry-based draft of the human proteome

@article{Wilhelm2014MassspectrometrybasedDO,
  title={Mass-spectrometry-based draft of the human proteome},
  author={Mathias Wilhelm and Judith Schlegl and Hannes Hahne and Amin Moghaddas Gholami and M. Lieberenz and Mikhail M. Savitski and Emanuel Ziegler and Lars Butzmann and Siegfried Gessulat and Harald Marx and Toby Mathieson and Simone Lemeer and Karsten Schnatbaum and Ulf Reimer and Holger Wenschuh and Martin Mollenhauer and Julia B. Slotta-Huspenina and Joos-Hendrik Boese and Marcus Bantscheff and Anja Gerstmair and Franz Faerber and Bernhard Kuster},
  journal={Nature},
  year={2014},
  volume={509},
  pages={582-587}
}
Proteomes are characterized by large protein-abundance differences, cell-type- and time-dependent expression patterns and post-translational modifications, all of which carry biological information that is not accessible by genomics or transcriptomics. Here we present a mass-spectrometry-based draft of the human proteome and a public, high-performance, in-memory database for real-time analysis of terabytes of big data, called ProteomicsDB. The information assembled from human tissues, cell… Expand
Systematic detection of functional proteoform groups from bottom-up proteomic datasets
TLDR
A novel, data-driven strategy to assign peptides to unique functional proteoform groups based on peptide correlation patterns across large bottom-up proteomic datasets is presented, which enabled the systematic detection and evaluation of assembly specific proteoforms at an unprecedented scale. Expand
Proteomic Profiling of the Human Tissue and Biological Fluid Proteome.
TLDR
This work performed label-free liquid chromatography coupled to tandem MS (LC-MS/MS) to profile the normal human proteome and generated tandem mass spectra corresponding to 13,028 unique human protein-coding genes, which did not accomplish complete proteome coverage. Expand
ProteomeGenerator: A framework for comprehensive proteomics based on de novo transcriptome assembly and high-accuracy peptide mass spectral matching
TLDR
ProteomeGenerator is reported, a framework for de novo and reference-assisted proteogenomic database construction and analysis based on sample-specific transcriptome sequencing and high-resolution andhigh-accuracy mass spectrometry proteomics, demonstrating high-confidence identification of non-canonical protein isoforms arising from alternative transcriptional start sites, intron retention, and cryptic exon splicing and improved accuracy of genome-scale proteome discovery. Expand
ProteomeGenerator: A Framework for Comprehensive Proteomics Based on de Novo Transcriptome Assembly and High-Accuracy Peptide Mass Spectral Matching.
TLDR
ProteomeGenerator, a framework for de novo and reference-assisted proteogenomic database construction and analysis based on sample-specific transcriptome sequencing and high-accuracy mass spectrometry proteomics, is reported, demonstrating high-confidence identification of non-canonical protein isoforms arising from alternative transcriptional start sites, intron retention, and cryptic exon splicing as well as improved accuracy of genome-scale proteome discovery. Expand
Multiplexed Quantitative Proteomics for High-Throughput Comprehensive Proteome Comparisons of Human Cell Lines.
TLDR
The use of multiplexed quantitative proteomics using isobaric labeling with tandem mass tags (TMT) for the simultaneous quantitative analysis of five cancer cell proteomes in biological duplicates in one mass spectrometry experiment is described. Expand
Proteomics beyond large-scale protein expression analysis.
TLDR
This work states that an explosion of novel proteomics workflows has significantly expanded proteomics beyond the analysis of protein expression, providing new tools for multi-level 'omics' analysis and for the modeling of biological networks in the context of systems biology studies. Expand
Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation.
TLDR
The field of human proteogenomics is reviewed, with an emphasis on its history, current implementations, the types of proteomic variations it reveals, and several important applications. Expand
The Human Protein Atlas – an important resource for basic and clinical research
  • C. Lindskog
  • Biology, Medicine
  • Expert review of proteomics
  • 2016
TLDR
The transcriptomics data generated by the Human Protein Atlas project were compared with other available human transcriptome resources focusing on protein-coding genes, and it is suggested that next generation sequencing of the transcriptome is an attractive tool for indirect measurements of protein expression. Expand
Mass-spectrometry-based near-complete draft of the Saccharomyces cerevisiae proteome
TLDR
This work generates the largest yeast proteome dataset, including 5610 identified proteins using a strategy based on optimized sample preparation and high-resolution mass spectrometry, which achieves near complete coverage of the yeast ORFs. Expand
The quest of the human proteome and the missing proteins: digging deeper.
TLDR
The possible reasons for the "missing proteins" in shotgun MS workflow were identified and debated by experts as low tissue expression, lack of enzymatic digestion site, or protein lost during extraction, among other contributing factors. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 47 REFERENCES
Deep proteome and transcriptome mapping of a human cancer cell line
TLDR
Comparisons of the proteome and the transcriptome, and analysis of protein complex databases and GO categories, suggest that deep coverage of the functional transcriptome andThe proteome of a single cell type is achieved. Expand
Mapping Intact Protein Isoforms in Discovery Mode Using Top Down Proteomics
TLDR
Identification of 1,043 gene products from human cells that are dispersed into more than 3,000 protein species created by post-translational modification, RNA splicing and proteolysis is shown, using a new four-dimensional separation system. Expand
Quantitative and qualitative proteome characteristics extracted from in-depth integrated genomics and proteomics analysis.
TLDR
The multilevel analysis identified a genomic variant in the promoter of the most differentially expressed gene Cyp17a1, a previously reported top hit in genome-wide association studies for human hypertension, as a potential contributor to the hypertension phenotype in SHR rats. Expand
Initial Quantitative Proteomic Map of 28 Mouse Tissues Using the SILAC Mouse*
TLDR
A computational framework is described with which to correlate proteome profiles with physiological functions of the tissue and it is shown that physiologically related tissues clustered together and that highly expressed proteins represented the characteristic tissue functions. Expand
Comparative Proteomic Analysis of Eleven Common Cell Lines Reveals Ubiquitous but Varying Expression of Most Proteins*
TLDR
This work analyzes 11 human cell lines using an LTQ-Orbitrap family mass spectrometer with a “high field” Orbitrap mass analyzer with improved resolution and sequencing speed to construct a broad coverage “super-SILAC” quantification standard. Expand
Analysis of the Human Tissue-specific Expression by Genome-wide Integration of Transcriptomics and Antibody-based Proteomics*
TLDR
A quantitative transcriptomics analysis (RNA-Seq) is used to classify the tissue-specific expression of genes across a representative set of all major human organs and tissues and combined this analysis with antibody-based profiling of the same tissues. Expand
Profiling Core Proteomes of Human Cell Lines by One-dimensional PAGE and Liquid Chromatography-Tandem Mass Spectrometry*S
TLDR
Repeated analysis of peptides derived from the same gel band showed that the reproducibility of nanocapillary liquid chromatography-MS/MS of such complex mixtures is about 60–70% suggesting that a particular analytical experiment would need to be repeated about three times to arrive at a representative estimate of the set of highly abundant proteins in a given proteome. Expand
Computational prediction of proteotypic peptides for quantitative proteomics
TLDR
Using >600,000 peptide identifications generated by four proteomic platforms, it is shown that characteristic physicochemical properties of these peptides were used to develop a computational tool that can predict proteotypic peptides for any protein from any organism, for a given platform, with >85% cumulative accuracy. Expand
Identification of missing proteins in the neXtProt database and unregistered phosphopeptides in the PhosphoSitePlus database as part of the Chromosome-centric Human Proteome Project.
TLDR
The in-depth phosphoproteomic study represents a significant contribution to C-HPP and identifies 3,033 "missing proteins", i.e., proteins that currently lack evidence by mass spectrometry, in the neXtProt database and 12,852 unknown phosphorylation sites not registered in the PhosphoSitePlus database. Expand
Global proteome analysis of the NCI-60 cell line panel.
TLDR
A quantitative proteome and kinome profile of the NCI-60 panel covering, in total, 10,350 proteins (including 375 protein kinases) and including a core cancer proteome of 5,578 proteins that were consistently quantified across all tissue types is presented. Expand
...
1
2
3
4
5
...