Retrospective Causal Inference with Machine Learning Ensembles: An Application to Anti-recidivism Policies in Colombia

  title={Retrospective Causal Inference with Machine Learning Ensembles: An Application to Anti-recidivism Policies in Colombia},
  author={Cyrus Samii and Laura Paler and Sarah Zukerman Daly},
  journal={Political Analysis},
  pages={434 - 456}
We present new methods to estimate causal effects retrospectively from micro data with the assistance of a machine learning ensemble. This approach overcomes two important limitations in conventional methods like regression modeling or matching: (i) ambiguity about the pertinent retrospective counterfactuals and (ii) potential misspecification, overfitting, and otherwise bias-prone or inefficient use of a large identifying covariate set in the estimation of causal effects. Our method targets… 
Relaxing Assumptions, Improving Inference: Utilizing Machine Learning for Valid Causal Inference∗
This work introduces a method that combines a machine learning approach to control for background covariates and learn patterns of interference with a regression to estimate the coefficient on the treatment variable of theoretical interest, allowing recovery of a causal effect.
Leveraging machine learning methods to estimate heterogeneous effects: father absence in China as an example
  • Ran Liu
  • Computer Science
    Chinese Sociological Review
  • 2021
Abstract Individuals differ in their personal and environmental characteristics, and the same treatment or condition may affect individuals in different ways or magnitudes. Heterogeneity in effects
Causal Inference through the Method of Direct Estimation
This work proposes a new approach, the method of direct estimation, that draws on both Bayesian sparse modeling and mediation analysis in order to obtain nonparametric estimates of treatment effects, and shows how the method can easily extend to structural estimators commonly used in a variety of disciplines.
Improved Multilevel Regression with Poststratification through Machine Learning (autoMrP)
This work proposes a systematic approach to estimating MrP models that addresses issues by employing a number of machine learning techniques and demonstrates that this approach outperforms a standard MrP model in which the choice of context-level variables has been informed by a rich tradition of public opinion research.
Estimating controlled direct effects through marginal structural models
Abstract When working with panel data, many researchers wish to estimate the direct effects of time-varying factors on future outcomes. However, when a baseline treatment affects both the confounders
How Effective Was the UK Carbon Tax? - A Machine Learning Approach to Policy Evaluation
Carbon taxes are commonly seen as a rational policy response to climate change, but little is known about their performance from an ex-post perspective. This paper analyzes the emissions and cost
How Effective Was the UK Carbon Tax?—A Machine Learning Approach to Policy Evaluation
Carbon taxes are commonly seen as a rational policy response to climate change, but little is known about their performance from an ex-post perspective. This paper analyzes the emissions and cost
Tree-Based Models for Political Science Data
A family of tree-based nonparametric techniques from the machine learning literature are introduced, arguing that, under specific circumstances, regression and classification tree models are an appropriate standard choice for analyzing high-dimensional data sets.
Estimating Heterogeneous Treatment Effects and the Effects of Heterogeneous Treatments with Ensemble Methods
It is shown how an ensemble of methods—weighted averages of estimates from individual models increasingly used in machine learning—accurately measure heterogeneous effects and how pooling models lead to superior performance to individual methods across diverse problems.
The Divided (But Not More Predictable) Electorate: A Machine Learning Analysis of Voting in American Presidential Elections
Partisan sorting by social groups is believed to increase affective polarization and decrease group-level leverage in representation. Mounting evidence suggests that social groups are increasingly


Machine Learning for Estimating Heterogeneous Causal Effects
This paper proposes several novel cross-validation criteria for this problem of estimating heterogeneity in causal effects in experimental or observational studies and demonstrates the conditions under which they perform better than standard methods for the problem of causal effects.
Balancing within the Margin: Causal Effect Estimation with Support Vector Machines∗
Matching and weighting methods are commonly used to reduce confounding bias in observational studies. Many existing methods are sensitive to user-provided inputs, provide little formal guidance in
A Framework for Dynamic Causal Inference in Political Science
Dynamic strategies are an essential part of politics. In the context of campaigns, for example, candidates continuously recalibrate their campaign strategy in response to polls and opponent actions.
A Machine-Learning Algorithm for Estimating and Ranking the Impact of Environmental Risk Factors in Exploratory Epidemiological Studies
This paper proposes an algorithm designed to address each of these limitations in turn by combining recent advances in the causal inference and multiple-testing literature along with modifications to traditional nonparametric inference methods.
Causal Inference With General Treatment Regimes
In this article we develop the theoretical properties of the propensity function, which is a generalization of the propensity score of Rosenbaum and Rubin. Methods based on the propensity score have
Kernel Regularized Least Squares: Reducing Misspecification Bias with a Flexible and Interpretable Machine Learning Approach
It is argued that the KRLS method is well-suited for social science inquiry because it avoids strong parametric assumptions, yet allows interpretation in ways analogous to generalized linear models while also permitting more complex interpretation to examine nonlinearities, interactions, and heterogeneous effects.
Population intervention models in causal inference.
This work proposes a new causal parameter, which is a natural extension of existing approaches to causal inference such as marginal structural models, and provides a class of inverse-probability-of-treatment-weighted and doubly-robust estimators of the causal parameters in these models.
Opiates for the Matches: Matching Methods for Causal Inference
In recent years, there has been a burst of innovative work on methods for estimating causal effects using observational data. Much of this work has extended and brought a renewed focus on old
Bayesian Nonparametric Modeling for Causal Inference
Researchers have long struggled to identify causal effects in nonexperimental settings. Many recently proposed strategies assume ignorability of the treatment assignment mechanism and require fitting
Estimation of Heterogeneous Treatment Effects from Randomized Experiments, with Application to the Optimal Planning of the Get-Out-the-Vote Campaign
Although a growing number of political scientists are conducting randomized experiments, many of them only report the average treatment effects and do not systematically explore the variation in