Author pages are created from data sourced from our academic publisher partnerships and public sources.
- Publications
- Influence
Share This Author
XLNet: Generalized Autoregressive Pretraining for Language Understanding
- Zhilin Yang, Zihang Dai, Yiming Yang, J. Carbonell, R. Salakhutdinov, Quoc V. Le
- Computer ScienceNeurIPS
- 19 June 2019
TLDR
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
- Zihang Dai, Zhilin Yang, Yiming Yang, J. Carbonell, Quoc V. Le, R. Salakhutdinov
- Computer ScienceACL
- 9 January 2019
TLDR
Unsupervised Data Augmentation for Consistency Training
- Qizhe Xie, Zihang Dai, E. Hovy, Minh-Thang Luong, Quoc V. Le
- Computer ScienceNeurIPS
- 29 April 2019
TLDR
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model
- Zhilin Yang, Zihang Dai, R. Salakhutdinov, William W. Cohen
- Computer ScienceICLR
- 10 November 2017
TLDR
Good Semi-supervised Learning That Requires a Bad GAN
- Zihang Dai, Zhilin Yang, Fan Yang, William W. Cohen, R. Salakhutdinov
- Computer ScienceNIPS
- 1 May 2017
TLDR
Unsupervised Data Augmentation
- Qizhe Xie, Zihang Dai, E. Hovy, Minh-Thang Luong, Quoc V. Le
- Computer ScienceArXiv
- 29 April 2019
TLDR
Controllable Invariance through Adversarial Feature Learning
- Qizhe Xie, Zihang Dai, Yulun Du, E. Hovy, Graham Neubig
- Computer ScienceNIPS
- 31 May 2017
TLDR
Meta Pseudo Labels
- Hieu Pham, Qizhe Xie, Zihang Dai, Quoc V. Le
- Computer ScienceIEEE/CVF Conference on Computer Vision and…
- 23 March 2020
We present Meta Pseudo Labels, a semi-supervised learning method that achieves a new state-of-the-art top-1 accuracy of 90.2% on ImageNet, which is 1.6% better than the existing state-of-the-art…
Pay Attention to MLPs
- Hanxiao Liu, Zihang Dai, David R. So, Quoc V. Le
- Computer ScienceNeurIPS
- 17 May 2021
TLDR
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
- Zirui Wang, Jiahui Yu, Adams Wei Yu, Zihang Dai, Yulia Tsvetkov, Yuan Cao
- Computer ScienceArXiv
- 24 August 2021
(b)). These results suggest zero-shot cross-modality transfer emerges with the scaling of weakly labeled data.
...
1
2
3
4
...