Identifiable Energy-based Representations: An Application to Estimating Heterogeneous Causal Effects
@article{Zhang2021IdentifiableER, title={Identifiable Energy-based Representations: An Application to Estimating Heterogeneous Causal Effects}, author={Yao Zhang and Jeroen Berrevoets and Mihaela van der Schaar}, journal={ArXiv}, year={2021}, volume={abs/2108.03039} }
Conditional average treatment effects (CATEs) allow us to understand the effect heterogeneity across a large population of individuals. However, typical CATE learners assume all confounding variables are measured in order for the CATE to be identifiable. This requirement can be satisfied by collecting many variables, at the expense of increased sample complexity for estimating CATEs. To combat this, we propose an energy-based model (EBM) that learns a low-dimensional representation of the…
Figures and Tables from this paper
One Citation
Causal machine learning for healthcare and precision medicine
- Computer ScienceRoyal Society Open Science
- 2022
Important challenges present in healthcare applications such as processing high-dimensional and unstructured data, generalization to out-of-distribution samples and temporal relationships, that despite the great effort from the research community remain to be solved are discussed.
References
SHOWING 1-10 OF 88 REFERENCES
Robust inference of conditional average treatment effects using dimension reduction.
- MathematicsStatistica Sinica
- 2022
This article proposes a double dimension reduction method, which reduces the curse of dimensionality as much as possible while keeping the nonparametric merit, and identifies the central mean subspace of the conditional average treatment effect using dimension reduction.
Metalearners for estimating heterogeneous treatment effects using machine learning
- Computer ScienceProceedings of the National Academy of Sciences
- 2019
A metalearner, the X-learner, is proposed, which can adapt to structural properties, such as the smoothness and sparsity of the underlying treatment effect, and is shown to be easy to use and to produce results that are interpretable.
Optimal doubly robust estimation of heterogeneous causal effects
- Mathematics, Computer Science
- 2020
A two-stage doubly robust CATE estimator is studied and a generic model-free error bound is given and it is shown that this estimator can be oracle efficient under even weaker conditions, if used with a specialized form of sample splitting and careful choices of tuning parameters.
High-Dimensional Feature Selection for Sample Efficient Treatment Effect Estimation
- Mathematics, Computer ScienceAISTATS
- 2021
A common objective function involving outcomes across treatment cohorts with nonconvex joint sparsity regularization that is guaranteed to recover $S$ with high probability under a linear outcome model for $Y$ and subgaussian covariates for each of the treatment cohort is proposed.
Learning Overlapping Representations for the Estimation of Individualized Treatment Effects
- Computer ScienceAISTATS
- 2020
A deep kernel regression algorithm and posterior regularization framework that substantially outperforms the state-of-the-art on a variety of benchmarks data sets and demonstrates the dependence on domain overlap and the need for invertible latent maps is developed.
Quasi-oracle estimation of heterogeneous treatment effects
- Computer Science, Mathematics
- 2017
This paper develops a general class of two-step algorithms for heterogeneous treatment effect estimation in observational studies that have a quasi-oracle property, and implements variants of this approach based on penalized regression, kernel ridge regression, and boosting, and find promising performance relative to existing baselines.
Estimating individual treatment effect: generalization bounds and algorithms
- Computer ScienceICML
- 2017
A novel, simple and intuitive generalization-error bound is given showing that the expected ITE estimation error of a representation is bounded by a sum of the standard generalized-error of that representation and the distance between the treated and control distributions induced by the representation.
Sufficient dimension reduction for average causal effect estimation
- Computer Science, MathematicsData Mining and Knowledge Discovery
- 2022
It is proved that a large covariate set can be reduced to a lower dimensional representation which captures the complete information for adjustment in causal effect estimation, and an algorithm is developed that employs a supervised kernel dimension reduction method to learn a lowerdimensional representation from the original covariate space.
An Identifiable Double VAE For Disentangled Representations
- Computer ScienceICML
- 2021
This work proposes a novel VAE-based generative model with theoretical guarantees on identifiability, and obtains its conditional prior over the latents by learning an optimal representation, which imposes an additional strength on their regularization.
Double/Debiased Machine Learning for Treatment and Structural Parameters
- Computer Science
- 2017
This work revisits the classic semiparametric problem of inference on a low dimensional parameter θ_0 in the presence of high-dimensional nuisance parameters η_0 and proves that DML delivers point estimators that concentrate in a N^(-1/2)-neighborhood of the true parameter values and are approximately unbiased and normally distributed, which allows construction of valid confidence statements.