• Corpus ID: 1631247

BioMM: Biologically-informed Multi-stage Machine learning for identification of epigenetic fingerprints

  title={BioMM: Biologically-informed Multi-stage Machine learning for identification of epigenetic fingerprints},
  author={Junfang Chen and Emanuel Schwarz},
  journal={arXiv: Quantitative Methods},
The identification of reproducible biological patterns from high-dimensional data is a bottleneck for understanding the biology of complex illnesses such as schizophrenia. To address this, we developed a biologically informed, multi-stage machine learning (BioMM) framework. BioMM incorporates biological pathway information to stratify and aggregate high-dimensional biological data. We demonstrate the utility of this method using genome-wide DNA methylation data and show that it substantially… 
1 Citations

Tables from this paper

Leveraging TCGA gene expression data to build predictive models for cancer drug response
Primary tumor gene expression is a good predictor of cancer drug response and investment in larger datasets containing both patient gene expression and drug response is needed to support future work of machine learning models.


A System‐Level Pathway‐Phenotype Association Analysis Using Synthetic Feature Random Forest
A system‐level pathway analysis approach, synthetic feature random forest (SF‐RF), which is designed to detect pathway‐phenotype associations without making assumptions about the relationships among SNPs or pathways is proposed.
DNA methylation age of human tissues and cell types
It is proposed that DNA methylation age measures the cumulative effect of an epigenetic maintenance system, and can be used to address a host of questions in developmental biology, cancer and aging research.
Diagnostic classification of schizophrenia by neural network analysis of blood-based gene expression signatures
An integrated genetic-epigenetic analysis of schizophrenia: evidence for co-localization of genetic associations and differential DNA methylation
This study represents the first systematic integrated analysis of genetic and epigenetic variation in schizophrenia, introducing a methodological approach that can be used to inform epigenome-wide association study analyses of other complex traits and diseases.
DNA methylation in schizophrenia: progress and challenges of epigenetic studies
Epigenetic studies of schizophrenia patients using postmortem brains or peripheral tissues are reviewed, focusing mainly on DNA methylation, to contribute to understanding of schizophrenia etiology and provide novel opportunities for the development of therapeutic drugs.
Identification of Genetic and Epigenetic Marks Involved in Population Structure
The results suggest that the interrelationship between genetic and epigenetic population structure is mediated via complex multiple gene interactions in shared biological processes, through possibly, SNP-dependent modulation and ID2 repressor function.
Biological Insights From 108 Schizophrenia-Associated Genetic Loci
Associations at DRD2 and several genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses.
Tobacco Smoking Leads to Extensive Genome-Wide Changes in DNA Methylation
The results of this study confirm the broad effect of tobacco smoking on the human organism, but also show that quitting tobacco smoking presumably allows regaining the DNA methylation state of never smokers.
Epigenetic mechanisms in schizophrenia.
Pathway-wide association study identifies five shared pathways associated with schizophrenia in three ancestral distinct populations
Empirical support is provided for schizophrenia as a pathway disorder and it is suggested that schizophrenia is not only a polygenic but likely also a poly-pathway disorder characterized by both genetic and pathway heterogeneity.