• Corpus ID: 236950538

Identifiable Energy-based Representations: An Application to Estimating Heterogeneous Causal Effects

  title={Identifiable Energy-based Representations: An Application to Estimating Heterogeneous Causal Effects},
  author={Yao Zhang and Jeroen Berrevoets and Mihaela van der Schaar},
Conditional average treatment effects (CATEs) allow us to understand the effect heterogeneity across a large population of individuals. However, typical CATE learners assume all confounding variables are measured in order for the CATE to be identifiable. This requirement can be satisfied by collecting many variables, at the expense of increased sample complexity for estimating CATEs. To combat this, we propose an energy-based model (EBM) that learns a low-dimensional representation of the… 
1 Citations

Figures and Tables from this paper

Causal machine learning for healthcare and precision medicine

Important challenges present in healthcare applications such as processing high-dimensional and unstructured data, generalization to out-of-distribution samples and temporal relationships, that despite the great effort from the research community remain to be solved are discussed.



Robust inference of conditional average treatment effects using dimension reduction.

This article proposes a double dimension reduction method, which reduces the curse of dimensionality as much as possible while keeping the nonparametric merit, and identifies the central mean subspace of the conditional average treatment effect using dimension reduction.

Metalearners for estimating heterogeneous treatment effects using machine learning

A metalearner, the X-learner, is proposed, which can adapt to structural properties, such as the smoothness and sparsity of the underlying treatment effect, and is shown to be easy to use and to produce results that are interpretable.

Optimal doubly robust estimation of heterogeneous causal effects

A two-stage doubly robust CATE estimator is studied and a generic model-free error bound is given and it is shown that this estimator can be oracle efficient under even weaker conditions, if used with a specialized form of sample splitting and careful choices of tuning parameters.

High-Dimensional Feature Selection for Sample Efficient Treatment Effect Estimation

A common objective function involving outcomes across treatment cohorts with nonconvex joint sparsity regularization that is guaranteed to recover $S$ with high probability under a linear outcome model for $Y$ and subgaussian covariates for each of the treatment cohort is proposed.

Learning Overlapping Representations for the Estimation of Individualized Treatment Effects

A deep kernel regression algorithm and posterior regularization framework that substantially outperforms the state-of-the-art on a variety of benchmarks data sets and demonstrates the dependence on domain overlap and the need for invertible latent maps is developed.

Quasi-oracle estimation of heterogeneous treatment effects

This paper develops a general class of two-step algorithms for heterogeneous treatment effect estimation in observational studies that have a quasi-oracle property, and implements variants of this approach based on penalized regression, kernel ridge regression, and boosting, and find promising performance relative to existing baselines.

Estimating individual treatment effect: generalization bounds and algorithms

A novel, simple and intuitive generalization-error bound is given showing that the expected ITE estimation error of a representation is bounded by a sum of the standard generalized-error of that representation and the distance between the treated and control distributions induced by the representation.

Sufficient dimension reduction for average causal effect estimation

It is proved that a large covariate set can be reduced to a lower dimensional representation which captures the complete information for adjustment in causal effect estimation, and an algorithm is developed that employs a supervised kernel dimension reduction method to learn a lowerdimensional representation from the original covariate space.

An Identifiable Double VAE For Disentangled Representations

This work proposes a novel VAE-based generative model with theoretical guarantees on identifiability, and obtains its conditional prior over the latents by learning an optimal representation, which imposes an additional strength on their regularization.

Double/Debiased Machine Learning for Treatment and Structural Parameters

This work revisits the classic semiparametric problem of inference on a low dimensional parameter θ_0 in the presence of high-dimensional nuisance parameters η_0 and proves that DML delivers point estimators that concentrate in a N^(-1/2)-neighborhood of the true parameter values and are approximately unbiased and normally distributed, which allows construction of valid confidence statements.