Current trends in computational inference from mass spectrometry-based proteomics

  title={Current trends in computational inference from mass spectrometry-based proteomics},
  author={Bobbie-Jo M. Webb-Robertson and William R. Cannon},
  journal={Briefings in bioinformatics},
  volume={8 5},
Mass spectrometry offers a high-throughput approach to quantifying the proteome associated with a biological sample and hence has become the primary approach of proteomic analyses. Computation is tightly coupled to this advanced technological platform as a required component of not only peptide and protein identification, but quantification and functional inference, such as protein modifications and interactions. Proteomics faces several key computational challenges such as identification of… 

Figures from this paper

Developing Well-Annotated Species-Specific Protein Databases Using Comparative Proteogenomics.

Current proteomic and proteogenomic methods will be reviewed, and an example of a comparative proteomics method using lake trout liver samples will be described.

Protein inference based on peptides identified from tandem mass spectra

A new method to estimate the accuracy of peptide identification with logistic regression (LR) and exemplify it based on Sequest scores is proposed, which can robustly assign accurate probabilities to peptides and has a very high discrimination power, higher than that of PeptideProphet.

Data Preprocessing and Filtering in Mass Spectrometry Based Proteomics

The current main principles underlying the preprocessing of mass spectrometry data are reviewed and an overview of the publicly available tools are given.

Protein Inference by Assembling Peptides Identified from Tandem Mass Spectra

This review surveys the advantages and disadvantages of several state-of-the-art statistical models which applied parsimony principle, proteotypic peptides, and peptide detectability and gives a brief introduction of peptide identification.

Computational Approaches to Peptide Identification via Tandem MS

This chapter will provide an overview of the methods that common, popular software tools employ to search protein sequence databases to provide the non-expert reader with sufficient background to appreciate the choices they can make.

PROTEOMER: A workflow‐optimized laboratory information management system for 2‐D electrophoresis‐centered proteomics

A proteomic database custom tailored to the authors' needs aiming at connecting MS protein identification information to 2‐DE derived protein expression profiles and its high cross‐referencing capacity, enabling integration of a wide range of experimental data.

A Critical Review of Bottom-Up Proteomics: The Good, the Bad, and the Future of This Field

This work aims to describe a bottom-up proteomics workflow from sample preparation to data analysis, including all of its benefits and pitfalls, and describes potential improvements in this type of proteochemistry workflow for the future.

Improving peptide identification sensitivity in shotgun proteomics by stratification of search space.

A scheme that categorizes qualified peptides into three tiers: tryptic, semitryptic, and nontryptic is designed, which can significantly improve retrieval performance compared to those of search strategies that assign equal Bonferroni correction factors to allqualified peptides.

Chemical rule-based filtering of MS/MS spectra

A novel chemical rule-based filtering algorithm, termed CRF, which makes use of the predictable patterns (rules) of collision-induced peptide fragmentation to improve performance by 15-20% in terms of peptide annotation using search engines, such as X!Tandem.



Challenges in mass spectrometry‐based proteomics

The development of “functional proteomics” combines functional characterization, like regulation, localization and modification, with the identification of proteins for deeper insight into cellular functions, and quantification of protein levels within cells is becoming a focus of interest.

Computational prediction of proteotypic peptides for quantitative proteomics

Using >600,000 peptide identifications generated by four proteomic platforms, it is shown that characteristic physicochemical properties of these peptides were used to develop a computational tool that can predict proteotypic peptides for any protein from any organism, for a given platform, with >85% cumulative accuracy.

Mass spectrometry-based proteomics in the life sciences

  • C. Lane
  • Biology
    Cellular and Molecular Life Sciences CMLS
  • 2005
An overview of the current applications of mass spectrometry-based proteomics to the cellular and molecular life sciences is presented, and the challenges that exist for research in the future are discussed.

Toward a high-throughput approach to quantitative proteomic analysis: Expression-dependent protein identification by mass spectrometry

Mass Spectrometry and Protein Analysis

Recent advances in mass spectrometry instrumentation are reviewed in the context of current and emerging research strategies in protein science.

Characterizing phosphoproteins and phosphoproteomes using mass spectrometry.

  • M. Goshe
  • Biology
    Briefings in functional genomics & proteomics
  • 2006
This review addresses some of the strategies and limitations involving the use of MS to map and quantify changes in protein phosphorylation sites for samples that range from a single protein to an entire proteome, and presents several compelling reasons as to why comprehensive phosphorylated site analysis has proven to be so elusive.

The Need for Guidelines in Publication of Peptide and Protein Identification Data

In an effort to ensure that high-quality, significant data are entering the proteomics literature, Molecular & Cellular Proteomics (MCP) is introducing guidelines for authors planning to submit manuscripts containing large numbers of proteins identified primarily by LC-MS/MS.

An accurate mass tag strategy for quantitative and high‐throughput proteome measurements

A global strategy that extends the sensitivity, dynamic range, comprehensiveness, and throughput of proteomic measurements based upon the use of peptide “accurate mass tags” (AMTs) produced by global protein enzymatic digestion is described and demonstrated.

Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry

Comparison of the HMS-PCI data set with interactions reported in the literature revealed an average threefold higher success rate in detection of known complexes compared with large-scale two-hybrid studies.

An integrated mass spectrometric and computational framework for the analysis of protein interaction networks

This work describes a strategy that analyzes protein complexes through the integration of label-free, quantitative mass spectrometry and computational analysis that addresses several limitations of current approaches for protein complexes.