Recursive partitioning for heterogeneous causal effects

@article{Athey2016RecursivePF,
  title={Recursive partitioning for heterogeneous causal effects},
  author={Susan Athey and Guido Imbens},
  journal={Proceedings of the National Academy of Sciences},
  year={2016},
  volume={113},
  pages={7353 - 7360}
}
  • S. Athey, G. Imbens
  • Published 5 April 2015
  • Mathematics
  • Proceedings of the National Academy of Sciences
In this paper we propose methods for estimating heterogeneity in causal effects in experimental and observational studies and for conducting hypothesis tests about the magnitude of differences in treatment effects across subsets of the population. We provide a data-driven approach to partition the data into subpopulations that differ in the magnitude of their treatment effects. The approach enables the construction of valid confidence intervals for treatment effects, even with many covariates… 

Tables from this paper

NON-PARAMETRIC INTERPRETABLE SCORE BASED ESTIMATION OF HETEROGENEOUS TREATMENT EFFECTS
TLDR
A non-parametric framework for estimating the Conditional Average Treatment Effect (CATE) function, which naturally stratifies treatment effects into subgroups over a 2d grid whose axis are the propensity and prognostic scores.
2D score based estimation of heterogeneous treatment effects
In the study of causal inference, statisticians show growing interest in estimating and analyzing heterogeneity in causal effects in observational studies. However, there usually exists a trade-off
Bayesian Regression Tree Models for Causal Inference: Regularization, Confounding, and Heterogeneous Effects
TLDR
The Bayesian causal forest model permits treatment effect heterogeneity to be regularized separately from the prognostic effect of control variables, making it possible to informatively "shrink to homogeneity".
Shrinkage Bayesian Causal Forests for Heterogeneous Treatment Effects Estimation*
TLDR
A sparsity-inducing version of Bayesian Causal Forests, a recently proposed nonparametric causal regression model that employs Bayesian Additive Regression Trees, is developed with an additional pair of priors allowing the model to adjust the weight of each covariate through the corresponding number of splits in the tree ensemble.
Sparse Bayesian Causal Forests for Heterogeneous Treatment Effects Estimation
This paper develops a sparsity-inducing version of Bayesian Causal Forests, a recently proposed nonparametric causal regression model that employs Bayesian Additive Regression Trees and is
The Econometrics of Randomized Experiments
Counterfactual Prediction Methods for Causal Inference in Observational Studies with Continuous Treatments
We develop methods for estimation, inference and optimization of causal effects from observational data with continuous treatments. We present a counterfactual prediction method based on the
Estimation and Inference of Heterogeneous Treatment Effects using Random Forests
  • Stefan Wager, S. Athey
  • Mathematics, Computer Science
    Journal of the American Statistical Association
  • 2018
TLDR
This is the first set of results that allows any type of random forest, including classification and regression forests, to be used for provably valid statistical inference and is found to be substantially more powerful than classical methods based on nearest-neighbor matching.
Effect or Treatment Heterogeneity? Policy Evaluation with Aggregated and Disaggregated Treatments
Binary treatments in empirical practice are often (i) ex-post aggregates of multiple treatments or (ii) can be disaggregated into multiple treatment versions after assignment. In such cases it is
A Generalized Framework for the Estimation of Causal Moderation Effects with Randomized Treatments and Non-Randomized Moderators
Researchers are often interested in analyzing conditional treatment effects. One variant of this is "causal moderation," which implies that intervention upon a third (moderator) variable would alter
...
...

References

SHOWING 1-10 OF 56 REFERENCES
Modeling heterogeneous treatment effects in large-scale experiments using Bayesian Additive Regression Trees
We present a methodology that largely automates the search for systematic treatment effect heterogeneity in large-scale experiments. We introduce a nonparametric estimator developed in statistical
Estimation and Inference of Heterogeneous Treatment Effects using Random Forests
  • Stefan Wager, S. Athey
  • Mathematics, Computer Science
    Journal of the American Statistical Association
  • 2018
TLDR
This is the first set of results that allows any type of random forest, including classification and regression forests, to be used for provably valid statistical inference and is found to be substantially more powerful than classical methods based on nearest-neighbor matching.
Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score
TLDR
It is shown that weighting with the inverse of a nonparametric estimate of the propensity Score, rather than the true propensity score, leads to efficient estimates of the various average treatment effects, whether the pre-treatment variables have discrete or continuous distributions.
Nonparametric Tests for Treatment Effect Heterogeneity
In this paper we develop two nonparametric tests of treatment effect heterogeneity. The first test is for the null hypothesis that the treatment has a zero average effect for all subpopulations
Estimating treatment effect heterogeneity in randomized program evaluation
TLDR
This paper proposes a method that adapts the Support Vector Machine classifier by placing separate sparsity constraints over the pre-treatment parameters and causal heterogeneity parameters of interest, and selects the most effective voter mobilization strategies from a large number of alternative strategies.
The central role of the propensity score in observational studies for causal effects
Abstract : The results of observational studies are often disputed because of nonrandom treatment assignment. For example, patients at greater risk may be overrepresented in some treatment group.
Heterogeneous Treatment Effects in Digital Experimentation
TLDR
A fast and scalable Bayesian nonparametric analysis of heterogeneity and its measurement in relation to observable covariates leads to a novel estimator of heterogeneity that is based around the distribution of covariates pooled across treatment groups.
A Simple Method for Estimating Interactions Between a Treatment and a Large Number of Covariates
TLDR
It is shown that coupled with an efficiency augmentation procedure, this method produces clinically meaningful estimators in a variety of settings and can be useful for practicing personalized medicine: determining from a large set of biomarkers, the subset of patients that can potentially benefit from a treatment.
Estimating causal effects of treatments in randomized and nonrandomized studies.
A discussion of matching, randomization, random sampling, and other methods of controlling extraneous variation is presented. The objective is to specify the benefits of randomization in estimating
Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction
Most questions in social and biomedical sciences are causal in nature: what would happen to individuals, or to groups, if part of their environment were changed? In this groundbreaking text, two
...
...