Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network

  title={Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network},
  author={Bo Xu and Yu Liu and Chi Lin and Jie Dong and Xiaoxia Liu and Zengyou He},
  journal={Frontiers in Genetics},
Identifying protein complexes from protein-protein interaction networks (PPINs) is important to understand the science of cellular organization and function. However, PPINs produced by high-throughput studies have high false discovery rate and only represent snapshot interaction information. Reconstructing higher quality PPINs is essential for protein complex identification. Here we present a Multi-Level PPINs reconstruction (MLPR) method for protein complexes detection. From existing PPINs, we… 

Identifying Protein Complexes With Clear Module Structure Using Pairwise Constraints in Protein Interaction Networks

A novel semi-supervised protein complex detection model based on non-negative matrix tri-factorization is proposed, which not only considers topological structure of a PPI network but also makes full use of available high quality known protein pairs with must-link constraints.

Small protein complex prediction algorithm based on protein–protein interaction network segmentation

A novel method, called BOPS, which enumerates the connected subset of each small PPINs, identifies potential protein complexes based on cohesion and removes those that are similar, and a weighted Homo sapiens PPIN is constructed, and BOPS gets the best result in it.

Protein‐protein interaction networks as miners of biological discovery

Experimental methods for identifying PPI pairs, including yeast two‐hybrid (Y2H), mass spectrometry (MS), co‐localization, and co‐immunoprecipitation are reviewed, which aid biological discovery through identifying hub genes and dynamic changes in the network.

Protein Complexes Identification with Family-Wise Error Rate Control

A new detection method SSF that is capable of controlling the FWER of each reported protein complex is proposed that can achieve the highest precision and outperforms three competing methods in terms of normalized mutual information (NMI) and F1 score in most cases.

Reconstruction of Nannochloropsis oculata protein-protein interaction network for growth and triacylglycerol production

Computational biological pathway reconstruction positively supported network perturbations to optimize microalgae lipid productivity and actual experimentation validated the protein-protein network.

Network Analysis of Heart Infarction Mice Transcriptomes

This work combines the differential gene expression data from mice transcriptomes that arose from a heart infarction experiment with the protein-protein interaction databases STRING and BioGRID to identify functional modules of genes that play a role in the healing process after a heart attack.

Big data, integrative omics and network biology.

An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning

It is demonstrated that AhGlasso improves protein network inference compared to the Netgsa approach by incorporating PPI information and outperforms weighted graphical Lasso-based algorithms with respect to computational time in simulated large-scale data settings while achieving better or comparable prediction accuracy of node connections.

Motifs in Big Networks: Methods and Applications

A comprehensive survey on motifs in the context of big networks, introducing the definition of motifs and other related concepts and examining methods for motif discovery, motif counting, and motif clustering.

Systems Medicine Design based on Systems Biology Approaches and Deep Neural Network for Gastric Cancer

This study proposed a systems medicine design procedure to identify essential biomarkers and find corresponding drugs for GC, and suggested potential multiple-molecule drugs efficiently.



Ontology integration to identify protein complex in protein interaction networks

A novel semantic similarity method, which use Gene Ontology (GO) annotations to measure the reliability of protein-protein interactions, which is applied to the protein interaction network of Sacchromyces cerevisiae and identifies many well known complexes.

Protein Complex Identification by Integrating Protein-Protein Interaction Evidence from Multiple Sources

This work combines PPI information from 6 different sources and obtained a reconstructed PPI network for yeast through machine learning and concludes that incorporating PPI Information from other sources can improve the effectiveness of protein complex identification.

Computational approaches for detecting protein complexes from protein interaction networks: a survey

The state-of-the-art techniques for computational detection of protein complexes are reviewed, some promising research directions in this field are discussed, and experimental results with yeast protein interaction data show that the interaction subgraphs discovered by various computational methods matched well with actual protein complexes.

Protein complex prediction based on simultaneous protein interaction network

The evaluation results show that the proposed method outperforms the simple PPIN-based method in terms of removing false positive proteins in the formation of complexes and shows that excluding competition between MEIs can be effective for improving prediction accuracy in general computational approaches involving protein interactions.

Complex discovery from weighted PPI networks

An algorithm called CMC (clustering-based on maximal cliques) is developed to discover complexes from the weighted PPI network and is shown to be an effective approach to protein complex prediction from protein interaction network.

A core-attachment based method to detect protein complexes in PPI networks

A novel core-attachment based method (COACH) which detects protein complexes in two stages and includes attachments into these cores to form biologically meaningful structures, which shows that COACH performs significantly better than the state-of-the-art techniques.

A Max-Flow-Based Approach to the Identification of Protein Complexes Using Protein Interaction and Microarray Data

The emergence of high-throughput technologies leads to abundant protein-protein interaction (PPI) data and microarray gene expression profiles, and provides a great opportunity for the identification

Clustering and Summarizing Protein-Protein Interaction Networks: A Survey

This issue is examined by classifying, discussing, and comparing a wide ranging approaches proposed by the bioinformatics community to cluster PPI networks, which can enable us to make sense out of the information contained in large PPI Networks by generating multi-level functional summaries.

Modifying the DPClus algorithm for identifying protein complexes based on new topological structures

A new topological structure for protein complexes is proposed, which is a combination of subgraph diameter (or average vertex distance) and subgraph density, which makes it possible to identify dense subgraphs in protein interaction networks, many of which correspond to known protein complexes.

An automated method for finding molecular complexes in large protein interaction networks

A novel graph theoretic clustering algorithm, "Molecular Complex Detection" (MCODE), that detects densely connected regions in large protein-protein interaction networks that may represent molecular complexes is described.