Author pages are created from data sourced from our academic publisher partnerships and public sources.
- Publications
- Influence
Share This Author
An Optimistic Perspective on Offline Reinforcement Learning
- Rishabh Agarwal, D. Schuurmans, Mohammad Norouzi
- Computer ScienceICML
- 10 July 2019
TLDR
AlgaeDICE: Policy Gradient from Arbitrary Experience
- Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, D. Schuurmans
- Computer ScienceArXiv
- 4 December 2019
TLDR
On the Global Convergence Rates of Softmax Policy Gradient Methods
- Jincheng Mei, Chenjun Xiao, Csaba Szepesvari, D. Schuurmans
- Computer ScienceICML
- 13 May 2020
TLDR
GenDICE: Generalized Offline Estimation of Stationary Values
- Ruiyi Zhang, Bo Dai, Lihong Li, D. Schuurmans
- Computer ScienceICLR
- 21 February 2020
TLDR
Systolic Peak Detection in Acceleration Photoplethysmograms Measured from Emergency Responders in Tropical Conditions
- M. Elgendi, I. Norton, M. Brearley, D. Abbott, D. Schuurmans
- Computer SciencePloS one
- 22 October 2013
TLDR
Domain Aggregation Networks for Multi-Source Domain Adaptation
- Junfeng Wen, R. Greiner, D. Schuurmans
- Computer ScienceICML
- 11 September 2019
TLDR
Off-Policy Evaluation via the Regularized Lagrangian
- Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, D. Schuurmans
- MathematicsNeurIPS
- 7 July 2020
TLDR
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
- Seyed Kamyar Seyed Ghasemipour, D. Schuurmans, S. Gu
- Computer ScienceICML
- 21 July 2020
TLDR
CoinDICE: Off-Policy Confidence Interval Estimation
- Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvari, D. Schuurmans
- Computer Science, MathematicsNeurIPS
- 22 October 2020
TLDR
Variational Rejection Sampling
- Aditya Grover, R. Gummadi, M. Lázaro-Gredilla, D. Schuurmans, S. Ermon
- Computer ScienceAISTATS
- 31 March 2018
TLDR
...
...