Shape-constrained partial identification of a population mean under unknown probabilities of sample selection

  title={Shape-constrained partial identification of a population mean under unknown probabilities of sample selection},
  author={Luke W. Miratrix and Stefan Wager and Jos{\'e} R. Zubizarreta},
  journal={arXiv: Statistics Theory},
A prevailing challenge in the biomedical and social sciences is to estimate a population mean from a sample obtained with unknown selection probabilities. Using a well-known ratio estimator, Aronow and Lee (2013) proposed a method for partial identification of the mean by allowing the unknown selection probabilities to vary arbitrarily between two fixed extreme values. In this paper, we show how to leverage auxiliary shape constraints on the population outcome distribution, such as symmetry or… 

Figures from this paper

An Interval Estimation Approach to Sample Selection Bias

A widespread and largely unaddressed challenge in statistics is that non-random participation in study samples can bias the estimation of parameters of interest. To address this problem, we propose a

Sample-constrained partial identification with application to selection bias

Many partial identification problems can be characterized by the optimal value of a function over a set where both the function and set need to be estimated by empirical data. Despite some progress

An Interval Estimation Approach to Selection Bias in Observational Studies

A persistent challenge in observational studies is that non-random participation in study samples can result in biased estimates of parameters of interest. To address this problem, we present a

Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap

To identify the estimand in missing data problems and observational studies, it is common to base the statistical estimation on the ‘missingness at random’ and ‘no unmeasured confounder’ assumptions.

Design-Based Uncertainty for Quasi-Experiments

Social scientists are often interested in estimating causal effects in settings where all units in the population are observed (e.g. all 50 US states). Design-based approaches, which view the

Bounds and semiparametric inference in $L^\infty$- and $L^2$-sensitivity analysis for observational studies

Sensitivity analysis for the unconfoundedness assumption is a crucial component of observational studies. The marginal sensitivity model has become increasingly popular for this purpose due to its

Bounds on the conditional and average treatment effect with unobserved confounding factors

A loss minimization approach that quantifies bounds on the conditional average treatment effect (CATE) when unobserved confounder have a bounded effect on the odds of treatment selection and a semi-parametric framework that extends/bounds the augmented inverse propensity weighted (AIPW) estimator for the ATE beyond the assumption that all confounders are observed.

Learning from a Biased Sample

Applying the distributionally robust optimization framework, a method for learning a decision rule that minimizes the worst-case risk incurred under a family of test distributions that can generate the training distribution under Γ-biased sampling is proposed, which is equivalent to an augmented convex risk minimization problem.

Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile Balancing

Inverse propensity weighting (IPW) is a popular method for estimating treatment effects from observational data. However, its correctness relies on the untestable (and frequently implausible)

Sensitivity Analysis with the $R^2$-calculus

Causal inference necessarily relies upon untestable assumptions; hence, it is crucial to assess the robustness of obtained results to violations of identification assumptions. However, such



Interval estimation of population means under unknown but bounded probabilities of sample selection

Applying concepts from partial identification to the domain of finite population sampling, we propose a method for interval estimation of a population mean when the probabilities of sample selection

Semiparametric Exponential Families for Heavy-Tailed Data

We propose a semiparametric method for fitting the tail of a heavy-tailed population given a relatively small sample from that population and a larger sample from a related background population. We

Spectral Density Ratio Models for Multivariate Extremes

The modeling of multivariate extremes has received increasing recent attention because of its importance in risk assessment. In classical statistics of extremes, the joint distribution of two or more

Using specially designed exponential families for density estimation

i this paper have Y being portions of the real line or of the plane, but the methodology applies just as well to higher dimensionalities and to more complicated spaces. . Estimates of g y are

Empirical likelihood ratio confidence intervals for a single functional

SUMMARY The empirical distribution function based on a sample is well known to be the maximum likelihood estimate of the distribution from which the sample was taken. In this paper the likelihood

On the theory of ratio estimates

Estimated variances, yielded by large sample approach, are adjusted by a proportional regression approach; subsequently, under the assump­ tion of normality, exact statements on confidence intervals

Inference and Modeling with Log-concave Distributions

Log-concave distributions are an attractive choice for modeling and inference, for several reasons: The class of log-concave distributions contains most of the commonly used parametric distributions

Partial Identification of Probability Distributions

Missing Outcomes.- Instrumental Variables.- Conditional Prediction with Missing Data.- Contaminated Outcomes.- Regressions, Short and Long.- Response-Based Sampling.- Analysis of Treatment Response.-

On logarithmic concave measures and functions

The purpose of the present paper is to give a new proof for the main theorem proved in [3] and develop further properties of logarithmic concave measures and functions. Having in mind the

Convex Optimization

A comprehensive introduction to the subject of convex optimization shows in detail how such problems can be solved numerically with great efficiency.