• Corpus ID: 237385926

Optimal subgroup selection

  title={Optimal subgroup selection},
  author={Henry W. J. Reeve and Timothy I. Cannings and Richard J. Samworth},
In clinical trials and other applications, we often see regions of the feature space that appear to exhibit interesting behaviour, but it is unclear whether these observed phenomena are reflected at the population level. Focusing on a regression setting, we consider the subgroup selection challenge of identifying a region of the feature space on which the regression function exceeds a pre-determined threshold. We formulate the problem as one of constrained optimisation, where we seek a low… 
2 Citations

Figures from this paper

Variable selection, monotone likelihood ratio and group sparsity
3Department of Information Systems, Decision Sciences and Statistics ESSEC Business School 95000 Cergy, France e-mail: ndaoud@essec.edu Abstract: In the pivotal variable selection problem, we derive
Optimal nonparametric testing of Missing Completely At Random, and its connections to compatibility
An incompatibility index is used as a natural measure of ease of detectability, establish its key properties, and it is proved that the MCAR tests can attain the minimax separation rate according to this measure, up to logarithmic factors.


Subgroup identification from randomized clinical trial data.
This work considers the problem of identifying a subgroup of patients who may have an enhanced treatment effect in a randomized clinical trial, and it is desirable that the subgroup be defined by a limited number of covariates, and presents a method developed to find subgroups ofEnhanced treatment effect.
Subgroup selection in adaptive signature designs of confirmatory clinical trials
A simple and general characterization of the optimal subgroup that maximizes the power for demonstrating treatment efficacy or the expected gain based on a specified utility function is provided and a procedure for subgroup selection that involves prediction modelling, augmented inverse probability weighting and low dimensional maximization is proposed.
Subgroup identification in clinical trials via the predicted individual treatment effect
This work explores linear models and compares different estimation methods, such as maximum likelihood and the Lasso with and without randomized response, and evaluates via simulations the performance of using the predicted individual treatment effect to identify subgroups where a novel treatment leads to better outcomes compared to a control treatment.
Tutorial in biostatistics: data‐driven subgroup identification and analysis in clinical trials
This tutorial focuses on a general class of problems arising in data-driven subgroup analysis, namely, identification of biomarkers with strong predictive properties and patient subgroups with desirable characteristics such as improved benefit and/or safety.
Subgroup identification based on differential effect search—A recursive partitioning method for establishing response to treatment in patient subpopulations
This work develops a novel recursive partitioning method for identifying subgroups of subjects with enhanced treatment effects based on a differential effect search algorithm and provides guidance on key topics of interest that include generating multiple promising subgroups using different splitting criteria, choosing optimal values of complexity parameters via cross‐validation, and addressing Type I error rate inflation inherent in data mining applications using a resampling‐based method.
Minimax Optimal Level-Set Estimation
This paper describes a new methodology and associated theoretical analysis for rapid and accurate extraction of level sets of a multivariate function from noisy data through a novel error metric sensitive to both the error in the location of the level-set estimate and the deviation of the function from the critical level.
Responder identification in clinical trials with censored data
No improvement in the reporting of clinical trial subgroup effects in high-impact general medical journals
While the objective of this study is to determine if the use of subgroup analyses and multivariable risk indices has increased, whether statistical methodology has improved over time, and which study characteristics predict subgroup analysis, the results show a significant decrease in the reporting of sub group analyses using appropriate methods.
Machine learning analysis plans for randomised controlled trials: detecting treatment effect heterogeneity with strict control of type I error
A novel analysis strategy for detecting TEH in randomised data using ML methods, whilst ensuring proper control of the false positive discovery rate is presented, which finds that artesunate provides a differential benefit to patients with high numbers of circulating ring stage parasites.