Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring

  title={Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring},
  author={Hunyong Cho and Shannon T. Holloway and David J Couper and Michael R. Kosorok},
We propose a reinforcement learning method for estimating an optimal dynamic treatment regime for survival outcomes with dependent censoring. The estimator allows the failure time to be conditionally independent of censoring and dependent on the treatment decision times, supports a flexible number of treatment arms and treatment stages, and can maximize either the mean survival time or the survival probability at a certain time point. The estimator is constructed using generalized random… 

Figures and Tables from this paper



On estimation of optimal treatment regimes for maximizing t‐year survival probability

This work proposes two nonparametric estimators for the survival function of patients following a given treatment regime involving one or more decisions, i.e., the so-called value, and estimates an optimal regime by maximizing these estimator for the value over a prespecified class of regimes.

Semiparametric efficient estimation of survival distributions in two-stage randomisation designs in clinical trials with censored data

Two-stage randomisation designs are useful in the evaluation of combination therapies where patients are initially randomised to an induction therapy and then, depending upon their response and


This paper presents a novel Q-learning algorithm that is adjusted for censored data and allows a flexible number of stages, and shows that when the optimal Q-function belongs to the approximation space, the expected survival time for policies obtained by the algorithm converges to that of the optimal policy.

Estimating Optimal Dynamic Treatment Regimes With Survival Outcomes

A doubly robust, easy to implement method for estimating optimal DTRs with survival endpoints subject to right-censoring which requires solving a series of weighted generalized estimating equations.

Interactive Q-Learning for Quantiles

This work derives estimators of decision rules for optimizing probabilities and quantiles computed with respect to the response distribution for two-stage, binary treatment settings and illustrates the approach with data from a sequentially randomized trial where the primary outcome is remission of depression symptoms.

Recovery of Information and Adjustment for Dependent Censoring Using Surrogate Markers

A class of tests and estimators for the parameters of the Cox proportional hazards model, the accelerated failure time model, and a model for the effect of treatment on the mean of a response

Bayesian Nonparametric Estimation for Dynamic Treatment Regimes With Sequential Transition Times

A dataset arising from a clinical trial involving multi-stage chemotherapy regimes for acute leukemia is analyzed, and the proposed nonparametric Bayesian approach can substantially improve inference compared to existing methods.

Survival ensembles.

A unified and flexible framework for ensemble learning in the presence of censoring for right-censored data is proposed and a random forest algorithm and a generic gradient boosting algorithm are introduced for the construction of prognostic and diagnostic models.

Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part I: Main Content

This article describes an approach to estimate the optimal dynamic treatment regime among a set of enforceable regimes, comprised by regimes defined by simple rules based on a subset of past information and discusses locally efficient, double-robust estimation of the model parameters and of the index of the optimal treatment regime in the set.

Tree-structured survival analysis.

In this note, tree-structured recursive partitioning schemes for classification, probability class estimation, and regression are adapted to cover censored survival analysis, applicable to more general situations than are those of the famous semi-parametric model of Cox.