On the Weaknesses of Reinforcement Learning for Neural Machine Translation
@article{Choshen2020OnTW, title={On the Weaknesses of Reinforcement Learning for Neural Machine Translation}, author={Leshem Choshen and Lior Fox and Zohar Aizenbud and Omri Abend}, journal={ArXiv}, year={2020}, volume={abs/1907.01752} }
Reinforcement learning (RL) is frequently used to increase performance in text generation tasks, including machine translation (MT), notably through the use of Minimum Risk Training (MRT) and Generative Adversarial Networks (GAN). However, little is known about what and how these methods learn in the context of MT. We prove that one of the most common RL methods for MT does not optimize the expected reward, as well as show that other methods take an infeasibly long time to converge. In fact… CONTINUE READING
Figures, Tables, and Topics from this paper
14 Citations
BERT as a Teacher: Contextual Embeddings for Sequence-Level Reward
- Computer Science, Mathematics
- ArXiv
- 2020
- 3
- PDF
Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models
- Computer Science, Mathematics
- ArXiv
- 2020
- 2
- PDF
MLE-guided parameter search for task loss minimization in neural sequence modeling
- Computer Science, Mathematics
- ArXiv
- 2020
- 1
- PDF
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning
- Computer Science
- ArXiv
- 2020
- PDF
Learning from Human Feedback: Challenges for Real-World Reinforcement Learning in NLP
- Computer Science
- ArXiv
- 2020
- PDF
References
SHOWING 1-10 OF 48 REFERENCES
A Study of Reinforcement Learning for Neural Machine Translation
- Computer Science, Mathematics
- EMNLP
- 2018
- 70
- PDF
Classical Structured Prediction Losses for Sequence to Sequence Learning
- Computer Science
- NAACL-HLT
- 2018
- 104
- Highly Influential
- PDF
Language Generation with Recurrent Generative Adversarial Networks without Pre-training
- Computer Science
- ArXiv
- 2017
- 79
- PDF
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
- Computer Science, Mathematics
- AAAI
- 2017
- 1,219
- PDF
Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets
- Computer Science
- NAACL-HLT
- 2018
- 117
- Highly Influential
- PDF
Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST-CMU at WAT2016
- Computer Science
- WAT@COLING
- 2016
- 21
- PDF