Multidimensional persistence in biomolecular data

  title={Multidimensional persistence in biomolecular data},
  author={Kelin Xia and Guowei Wei},
  journal={Journal of Computational Chemistry},
  pages={1502 - 1520}
  • Kelin Xia, G. Wei
  • Published 24 December 2014
  • Biology
  • Journal of Computational Chemistry
Persistent homology has emerged as a popular technique for the topological simplification of big data, including biomolecular data. Multidimensional persistence bears considerable promise to bridge the gap between geometry and topology. However, its practical and robust construction has been a challenge. We introduce two families of multidimensional persistence, namely pseudomultidimensional persistence and multiscale multidimensional persistence. The former is generated via the repeated… 

Multiresolution persistent homology for excessively large biomolecular datasets.

The topological persistence of a virus capsid with 273 780 atoms is successfully analyzed which would otherwise be inaccessible to the normal point cloud method and unreliable by using coarse-grained multiscale persistent homology.

Multiscale Persistent Functions for Biomolecular Structure Characterization

The multiscale persistent entropy (MPE) model is introduced and a special protein structure index (PSI) is proposed, for the first time, to describe the “regularity” of protein structures.

Persistent Cohomology for Data With Multicomponent Heterogeneous Information

It is found that the proposed framework outperforms or at least matches the state-of-the-art methods in the protein-ligand binding affinity prediction from massive biomolecular datasets without resorting to any deep learning formulation.

Pseudo-Multidimensional Persistence and Its Applications

This new approach is able to differentiate between topologically equivalent geometric objects and offers insight into the study of the Kuramoto–Sivashinsky partial differential equation and Lissajous knots.

A quantitative structure comparison with persistent similarity

A new method called persistent similarity, which is based on a newly-invented method in algebraic topology, known as persistent homology, is proposed, which can be used to describe the intrinsic similarities and differences between the structures very well.

Persistent similarity for biomolecular structure comparison

A new method called persistent similarity, which is based on a newly-invented method in algebraic topology, known as persistent homology, is proposed, which can be used to describe the intrinsic similarities anderences between the structures very well and delivers one of the best results for isomer total curvature energy prediction.

Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

It is demonstrated that the present topological learning outperforms other existing methods in protein-ligand binding affinity prediction and ligand-decoy discrimination.

Using persistent homology and dynamical distances to analyze protein binding

It is shown that the majority of active site residues and allosteric pathway residues are located in the vicinity of the most persistent loop in the corresponding filtered Vietoris-Rips complex, a finding that was not observed in the classical anisotropic network model.

Weighted persistent homology for biomolecular data analysis

The LWPH based principal component analysis (PCA) model can identify two configurational states of DNA structures in ion liquid environment, which can be revealed only by the complicated helical coordinate system.

Evolutionary homology on coupled dynamical systems with applications to protein flexibility analysis

Numerical results for the B-factor prediction of a benchmark set of 364 proteins indicate that the proposed evolutionary homology (EH) outperforms all the other state-of-the-art methods in the field.



Persistent homology analysis of protein structure, flexibility, and folding

  • Kelin XiaG. Wei
  • Biology
    International journal for numerical methods in biomedical engineering
  • 2014
The topology–function relationship of proteins is revealed, for the first time, and an excellent consistence between the persistent homology prediction and molecular dynamics simulation is found.

Objective-oriented Persistent Homology

The present work offers the first example to design objective-oriented persistent homology to enhance or preserve desirable features in the original data during the filtration process and then automatically detect or extract the corresponding topological traits from the data.

Persistent homology for the quantitative prediction of fullerene stability

Novel quantitative predictions of the energy and stability of fullerene molecules are reported, the very first attempt in using persistent homology in this context.

A topological measurement of protein compressibility

In this paper we partially clarify the relation between the compressibility of a protein and its molecular geometric structure. To identify and understand the relevant topological features within a

Multiscale geometric modeling of macromolecules II: Lagrangian representation

The resulting fine resolution representations of a biomolecular system enable the detailed analysis of solvent–solute interaction, and ion channel dynamics, whereas the coarse resolution representations highlight the compatibility of protein‐ligand bindings and possibility of protein–protein interactions.

Geometric modeling of subcellular structures, organelles, and multiprotein complexes

A suite of computational tools for volumetric data processing, information extraction, surface mesh rendering, geometric measurement, and curvature estimation of biomolecular complexes is introduced and a comprehensive protocol for the geometric modeling of subcellular structures, organelles, and multiprotein complexes is offered.

The Persistence Space in Multidimensional Persistent Homology

The persistence space of a vector-valued continuous function is introduced to generalize the concept of persistence diagram in this sense and a method to visualize topological features of a shape via persistence spaces is presented.

Persistent Brain Network Homology From the Perspective of Dendrogram

A novel multiscale framework that models all brain networks generated over every possible threshold and is based on persistent homology and its various representations such as the Rips filtration, barcodes, and dendrograms to quantify various persistent topological features at different scales in a coherent manner.

Topological Measurement of Protein Compressibility via Persistence Diagrams

We exploit recent advances in computational topology to study the compressibility of various proteins found in the Protein Data Bank (PDB). Our fundamental tool is the persistence diagram, a