• Corpus ID: 237563270

Regression Discontinuity Design with Potentially Many Covariates

  title={Regression Discontinuity Design with Potentially Many Covariates},
  author={Yoichi Arai and Taisuke Otsu and Myung Hwan Seo},
Abstract. This paper studies the case of possibly high-dimensional covariates in the regression discontinuity design (RDD) analysis. In particular, we propose estimation and inference methods for the RDD models with covariate selection which perform stably regardless of the number of covariates. The proposed methods combine the local approach using kernel weights with l1-penalization to handle high-dimensional covariates, and the combination is new in the literature. We provide theoretical and… 

Tables from this paper

Covariate Adjustment in Regression Discontinuity Designs
The Regression Discontinuity (RD) design is a widely used non-experimental method for causal inference and program evaluation. While its canonical formulation only requires a score and an outcome
Inference in Regression Discontinuity Designs with High-Dimensional Covariates
We study regression discontinuity designs in which many covariates, possibly much more than the number of observations, are available. We consider a two-step algorithm which first selects the set of


Including Covariates in the Regression Discontinuity Design
This article proposes a fully nonparametric kernel method to account for observed covariates in regression discontinuity designs (RDD), which may increase precision of treatment effect estimation. It
Regression Discontinuity Designs Using Covariates
Abstract We study regression discontinuity designs when covariates are included in the estimation. We examine local polynomial estimators that include discrete or continuous covariates in an additive
The lasso for high dimensional regression with a possible change point
  • S. Lee, M. Seo, Youngki Shin
  • Mathematics, Medicine
    Journal of the Royal Statistical Society. Series B, Statistical methodology
  • 2016
A high dimensional regression model with a possible change point due to a covariate threshold is considered and the lasso estimator of regression coefficients as well as the threshold parameter is developed, showing that oracle inequalities can be established without pretesting the existence of the threshold effect.
Inference on Treatment Effects after Selection Amongst High-Dimensional Controls
This work develops a novel estimation and uniformly valid inference method for the treatment effect in this setting, called the "post-double-selection" method, which resolves the problem of uniform inference after model selection for a large, interesting class of models.
Regression discontinuity designs with unknown discontinuity points: Testing and estimation
The regression discontinuity design has become a common framework among applied economists for measuring treatment effects. A key restriction of the existing literature is the assumption that the
Rdrobust: Software for Regression-discontinuity Designs
We describe a major upgrade to the Stata (and R) rdrobust package, which provides a wide array of estimation, inference, and falsification methods for the analysis and interpretation of
High-dimensional econometrics and regularized GMM
This chapter presents key concepts and theoretical results for analyzing estimation and inference in high-dimensional models. High-dimensional models are characterized by having a number of unknown
Bootstrapping Lasso Estimators
In this article, we consider bootstrapping the Lasso estimator of the regression parameter in a multiple linear regression model. It is known that the standard bootstrap method fails to be
Optimal Data-Driven Regression Discontinuity Plots
Exploratory data analysis plays a central role in applied statistics and econometrics. In the popular regression-discontinuity (RD) design, the use of graphical analysis has been strongly advocated
Robust Nonparametric Confidence Intervals for Regression‐Discontinuity Designs
In the regression‐discontinuity (RD) design, units are assigned to treatment based on whether their value of an observed covariate exceeds a known cutoff. In this design, local polynomial estimators