Later-stage Minimum Bayes-Risk Decoding for Neural Machine Translation


For extended periods of time, sequence generation models rely on beam search as the decoding algorithm. However, the performance of beam search degrades when the model is over-confident about a suboptimal prediction. In this work, we enhance beam search by performing minimum Bayes-risk (MBR) decoding for some extra steps at a later stage. In our experiments… (More)

2 Figures and Tables


