Bivariate Causal Discovery and Its Applications to Gene Expression and Imaging Data Analysis

  title={Bivariate Causal Discovery and Its Applications to Gene Expression and Imaging Data Analysis},
  author={Rong Jiao and Nan Lin and Zixin Hu and David A. Bennett and Li Jin and Momiao Xiong},
  journal={Frontiers in Genetics},
The mainstream of research in genetics, epigenetics, and imaging data analysis focuses on statistical association or exploring statistical dependence between variables. Despite their significant progresses in genetic research, understanding the etiology and mechanism of complex phenotypes remains elusive. Using association analysis as a major analytical platform for the complex data analysis is a key issue that hampers the theoretic development of genomic science and its application in practice… 

Tables from this paper

Shared Causal Paths underlying Alzheimer’s dementia and Type 2 Diabetes
Novel causal inference methods for genetic studies of AD and T2DM and pipelines for systematic multi-omic casual analysis to infer multilevel omics causal networks for the discovery of common paths from genetic variants to AD andT2DM are developed.
Causal Analysis of Health Interventions and Environments for Influencing the Spread of COVID-19 in the United States of America
The results showed that the majority of non-pharmaceutical interventions had a large effect on slowing the transmission and reducing deaths, and that health interventions were still needed to contain COVID-19.
KOMPOS: Connecting Causal Knots in Large Nonlinear Time Series with Non-Parametric Regression Splines
This work proposes KOMPOS, a novel algorithmic framework that combines a powerful concept from causal discovery of additive noise models with graphical ones and demonstrates the superiority of the proposed algorithm over existing causal discovery methods, especially for the challenging cases of autocorrelated and non-stationary time series.
A Review of Causality for Learning Algorithms in Medical Image Analysis
It is found that causal analysis has the potential to mitigate critical problems for clinical translation but that uptake and clinical downstream research has been limited so far.
Causal Discovery using Compression-Complexity Measures
Population Vaccine Effectiveness and its Implication for Control of the Spread of COVID-19 in the US
A population vaccine efficacy which integrated individual vaccine efficacy, vaccine prioritization strategy and NPI to measure and monitor the control of the spread of COVID-19 was proposed and demonstrated that the proportions of adults in the population across 50 states were significantly associated with the forecasted ending date of the CO VID-19.


A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data
A new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratic regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA).
Identification of genes associated with dissociation of cognitive performance and neuropathological burden: Multistep analysis of genetic, epigenetic, and transcriptional data
ENC1 and UNC5C are identified as genes with convergent genetic, epigenetic, and transcriptomic evidence supporting a potential role in the dissociation of cognition and neuropathology in an aging population, and the understanding of the TMEM106B haplotype that is protective against TDP-43 proteinopathy is expanded.
Distinguishing Cause from Effect Using Observational Data: Methods and Benchmarks
Empirical results on real-world data indicate that certain methods are indeed able to distinguish cause from effect using only purely observational data, although more benchmark data would be needed to obtain statistically significant conclusions.
WGCNA: an R package for weighted correlation network analysis
The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis that includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software.
Big Data in Omics and Imaging: Integrated Analysis and Causal Inference.
This book does not fulfill the expectations of a textbook since graduate students will have to consult secondary literature for a full understanding of the theory, but academic instructors may find it helpful as a source of possible topics to include in their own classes.
A Linear Non-Gaussian Acyclic Model for Causal Discovery
This work shows how to discover the complete causal structure of continuous-valued data, under the assumptions that (a) the data generating process is linear, (b) there are no unobserved confounders, and (c) disturbance variables have non-Gaussian distributions of non-zero variances.
Score-based causal learning in additive noise models
This work uses a penalized likelihood as a score for model selection and proves consistency of the penalized maximum likelihood estimator when making smoothness assumptions on the probability density space.
Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
The results demonstrate that the randomized feature selection method coupled with functional principal component analysis substantially outperforms the current sparse clustering algorithms in image cluster analysis.
Nonlinear causal discovery with additive noise models
It is shown that the basic linear framework can be generalized to nonlinear models and, in this extended framework, nonlinearities in the data-generating process are in fact a blessing rather than a curse, as they typically provide information on the underlying causal system and allow more aspects of the true data-Generating mechanisms to be identified.
Causal Inference Using the Algorithmic Markov Condition
This work explains why a consistent reformulation of causal inference in terms of algorithmic complexity implies a new inference principle that takes into account also the complexity of conditional probability densities, making it possible to select among Markov equivalent causal graphs.