Persistent topology for cryo‐EM data analysis

  title={Persistent topology for cryo‐EM data analysis},
  author={Kelin Xia and Guowei Wei},
  journal={International Journal for Numerical Methods in Biomedical Engineering},
  pages={n/a - n/a}
  • Kelin Xia, G. Wei
  • Published 7 December 2014
  • Biology
  • International Journal for Numerical Methods in Biomedical Engineering
In this work, we introduce persistent homology for the analysis of cryo‐electron microscopy (cryo‐EM) density maps. We identify the topological fingerprint or topological signature of noise, which is widespread in cryo‐EM data. For low signal‐to‐noise ratio (SNR) volumetric data, intrinsic topological features of biomolecular structures are indistinguishable from noise. To remove noise, we employ geometric flows that are found to preserve the intrinsic topological fingerprints of cryo‐EM… 
Multidimensional persistence in biomolecular data
The utility, robustness, and efficiency of the proposed topological methods are demonstrated via protein folding, protein flexibility analysis, the topological denoising of cryoelectron microscopy data, and the scale dependence of nanoparticles.
Multiresolution persistent homology for excessively large biomolecular datasets.
The topological persistence of a virus capsid with 273 780 atoms is successfully analyzed which would otherwise be inaccessible to the normal point cloud method and unreliable by using coarse-grained multiscale persistent homology.
Persistent Homology Analysis of Biomolecular Data
Persistent homology, a new branch of algebraic topology, embeds multiscale geometric information into topological invariants to achieve an interplay between geometry and topology.
A quantitative structure comparison with persistent similarity
A new method called persistent similarity, which is based on a newly-invented method in algebraic topology, known as persistent homology, is proposed, which can be used to describe the intrinsic similarities and differences between the structures very well.
Persistent similarity for biomolecular structure comparison
A new method called persistent similarity, which is based on a newly-invented method in algebraic topology, known as persistent homology, is proposed, which can be used to describe the intrinsic similarities anderences between the structures very well and delivers one of the best results for isomer total curvature energy prediction.
Using persistent homology and dynamical distances to analyze protein binding
It is shown that the majority of active site residues and allosteric pathway residues are located in the vicinity of the most persistent loop in the corresponding filtered Vietoris-Rips complex, a finding that was not observed in the classical anisotropic network model.
Multiscale Persistent Functions for Biomolecular Structure Characterization
The multiscale persistent entropy (MPE) model is introduced and a special protein structure index (PSI) is proposed, for the first time, to describe the “regularity” of protein structures.
Weighted persistent homology for biomolecular data analysis
The LWPH based principal component analysis (PCA) model can identify two configurational states of DNA structures in ion liquid environment, which can be revealed only by the complicated helical coordinate system.
Persistent homology analysis of ion aggregations and hydrogen-bonding networks.
  • Kelin Xia
  • Materials Science
    Physical chemistry chemical physics : PCCP
  • 2018
A newly-invented mathematical method called persistent homology is introduced, for the first time, to quantitatively analyze the intrinsic topological properties of ion aggregation systems and hydrogen-bonding networks, finding that for both models, KSCN systems demonstrate much more dramatic variations in their local circle structures with a concentration increase.
Persistent Cohomology for Data With Multicomponent Heterogeneous Information
It is found that the proposed framework outperforms or at least matches the state-of-the-art methods in the protein-ligand binding affinity prediction from massive biomolecular datasets without resorting to any deep learning formulation.


Improved Efficiency in Cryo-EM Secondary Structure Topology Determination from Inaccurate Data
A method that combines the concept of dynamic graph with the previous work of using constrained shortest path to identify the topology of the secondary structures is presented, showing a reduction of 34.55% of run-time as comparison to the naïve way of handling the inaccuracies.
Persistent homology analysis of protein structure, flexibility, and folding
  • Kelin XiaG. Wei
  • Biology
    International journal for numerical methods in biomedical engineering
  • 2014
The topology–function relationship of proteins is revealed, for the first time, and an excellent consistence between the persistent homology prediction and molecular dynamics simulation is found.
Topological De-Noising: Strengthening the Topological Signal
The topological de-noising algorithm allows for the recovery of topological information that is inaccessible by thresholding methods and is presented as an easy to implement, computationally efficient pre-processing algorithm to prepare noisy point cloud data sets for topological data analysis.
Multiscale geometric modeling of macromolecules I: Cartesian representation
A topological measurement of protein compressibility
In this paper we partially clarify the relation between the compressibility of a protein and its molecular geometric structure. To identify and understand the relevant topological features within a
Deriving Topology and Sequence Alignment for the Helix Skeleton in Low-Resolution protein Density Maps
This work developed a method to predict the topology and sequence alignment for the skeleton helices of protein complexes using the Rosetta ab initio structure prediction method, and analyzed the use of the skeletons as a clustering tool for the decoy structures generated by Rosetta.
Persistent homology for the quantitative prediction of fullerene stability
Novel quantitative predictions of the energy and stability of fullerene molecules are reported, the very first attempt in using persistent homology in this context.
Ab Initio Modeling of the Herpesvirus VP26 Core Domain Assessed by CryoEM Density
This work developed and applied a hybrid modeling approach, utilizing cryoEM density and ab initio modeling to produce a structural model for the core domain of a herpesvirus structural protein, VP26, based on the 8.5-Å resolution herpes simplex virus type 1 (HSV-1) capsid CryoEM structure and mutational data.