• Corpus ID: 218571445

The scalable Birth-Death MCMC Algorithm for Mixed Graphical Model Learning with Application to Genomic Data Integration

  title={The scalable Birth-Death MCMC Algorithm for Mixed Graphical Model Learning with Application to Genomic Data Integration},
  author={Nanwei Wang and Laurent Briollais and H{\'e}l{\`e}ne Massam},
Recent advances in biological research have seen the emergence of high-throughput technologies with numerous applications that allow the study of biological mechanisms at an unprecedented depth and scale. A large amount of genomic data is now distributed through consortia like The Cancer Genome Atlas (TCGA), where specific types of biological information on specific type of tissue or cell are available. In cancer research, the challenge is now to perform integrative analyses of high-dimensional… 
Continuous-Time Birth-Death MCMC for Bayesian Regression Tree Models
A new search algorithm which is based on a continuous-time birth-death Markov process which explores the model space by jumping between parameter spaces corresponding to different tree structures and can dramatically improve the convergence and mixing properties of the MCMC algorithm.


TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
An R/Bioconductor package called TCGAbiolinks is developed to address bioinformatics challenges of the Cancer Genome Atlas data by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data.
A Local Poisson Graphical Model for Inferring Networks From Sequencing Data
A novel method for inferring gene networks from sequencing data: the Local Poisson Graphical Model, which assumes a Local Markov property where each variable conditional on all other variables is Poisson distributed and develops a neighborhood selection algorithm to fit the model locally.
Mixed Graphical Models via Exponential Families
This work develops a novel class of mixed graphical models by specifying that each node-conditional distribution is a member of a possibly different univariate exponential family.
Bayesian Structure Learning in Sparse Gaussian Graphical Models
A novel and efficient Bayesian framework for Gaussian graphical model determination which is a trans-dimensional Markov Chain Monte Carlo (MCMC) approach based on a continuous-time birth-death process and gives a principled and, in practice, sensible approach for structure learning.
BDgraph: An R Package for Bayesian Structure Learning in Graphical Models
The R package BDgraph is introduced which performs Bayesian structure learning for general undirected graphical models (decomposable and non-decomPOSable) with continuous, discrete, and mixed variables with Bayesian improvements.
mgm: Structure Estimation for Time-Varying Mixed Graphical Models in high-dimensional Data
The R-package mgm is presented for the estimation of both stationary and time-varying mixed graphical models and mixed vector autoregressive models in high-dimensional data and provides performance benchmarks and applications to two medical datasets.
Supervised risk predictor of breast cancer based on intrinsic subtypes.
D diagnosis by intrinsic subtype adds significant prognostic and predictive information to standard parameters for patients with breast cancer.
High-Dimensional Mixed Graphical Models
A novel graphical model for mixed data is proposed, simple enough to be suitable for high-dimensional data, yet flexible enough to represent all possible graph structures, and can be easily extended to general discrete variables.
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups
The results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome, and identify novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort.
Extended Bayesian information criteria for model selection with large model spaces
This paper re-examine the Bayesian paradigm for model selection and proposes an extended family of Bayesian information criteria, which take into account both the number of unknown parameters and the complexity of the model space.