Findings of the 2014 Workshop on Statistical Machine Translation
- Ondrej Bojar, C. Buck, A. Tamchyna
- Computer ScienceWMT@ACL
- 1 June 2014
This paper presents the results of the WMT14 shared tasks, which included a standard news translation task, a separate medical translation task, a task for run-time estimation of machine translation…
Findings of the 2012 Workshop on Statistical Machine Translation
- Chris Callison-Burch, Philipp Koehn, Christof Monz, Matt Post, Radu Soricut, Lucia Specia
- Computer Science, PsychologyWMT@NAACL-HLT
- 7 June 2012
A large-scale manual evaluation of 103 machine translation systems submitted by 34 teams was conducted, which used the ranking of these systems to measure how strongly automatic metrics correlate with human judgments of translation quality for 12 evaluation metrics.
(Meta-) Evaluation of Machine Translation
- Chris Callison-Burch, C. Fordyce, Philipp Koehn, Christof Monz, J. Schroeder
- Computer ScienceWMT@ACL
- 23 June 2007
An extensive human evaluation was carried out not only to rank the different MT systems, but also to perform higher-level analysis of the evaluation process, revealing surprising facts about the most commonly used methodologies.
Findings of the 2016 Conference on Machine Translation
- Ondrej Bojar, R. Chatterjee, Marcos Zampieri
- Computer ScienceConference on Machine Translation
- 12 August 2016
This paper presents the results of the WMT16 shared tasks, which included five machine translation (MT) tasks (standard news, IT-domain, biomedical, multimodal, pronoun), three evaluation tasks…
Findings of the 2018 Conference on Machine Translation (WMT18)
- Ondrej Bojar, C. Federmann, Christof Monz
- Computer Science, PsychologyConference on Machine Translation
- 31 October 2018
This paper presents the results of the premier shared task organized alongside the Conference on Machine Translation (WMT) 2018. Participants were asked to build machine translation systems for any…
Findings of the 2017 Conference on Machine Translation (WMT17)
- Ondrej Bojar, R. Chatterjee, M. Turchi
- Computer Science, PsychologyConference on Machine Translation
- 1 September 2017
This paper presents the results of the WMT17 shared tasks, which included
three machine translation (MT) tasks (news, biomedical, and multimodal), two evaluation tasks (metrics and run-time…
Findings of the 2013 Workshop on Statistical Machine Translation
- Ondrej Bojar, C. Buck, Lucia Specia
- Computer ScienceWMT@ACL
- 1 August 2013
We present the results of the WMT13 shared tasks, which included a translation task, a task for run-time estimation of machine translation quality, and an unofficial metrics task. This year, 143…
Findings of the 2011 Workshop on Statistical Machine Translation
- Chris Callison-Burch, Philipp Koehn, Christof Monz, Omar Zaidan
- Computer ScienceWMT@EMNLP
- 1 July 2011
The WMT11 shared tasks, which included a translation task, a system combination task, and a task for machine translation evaluation metrics, show how strongly automatic metrics correlate with human judgments of translation quality for 21 evaluation metrics.
Further Meta-Evaluation of Machine Translation
- Chris Callison-Burch, C. Fordyce, Philipp Koehn, Christof Monz, J. Schroeder
- Computer ScienceWMT@ACL
- 19 June 2008
This paper analyzes the translation quality of machine translation systems for 10 language pairs translating between Czech, English, French, German, Hungarian, and Spanish and uses the human judgments of the systems to analyze automatic evaluation metrics for translation quality.
Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation
- Chris Callison-Burch, Philipp Koehn, Christof Monz, Kay Peterson, Mark A. Przybocki, Omar Zaidan
- Computer ScienceWMT@ACL
- 15 July 2010
A large-scale manual evaluation of 104 machine translation systems and 41 system combination entries was conducted, which used the ranking of these systems to measure how strongly automatic metrics correlate with human judgments of translation quality for 26 metrics.
...
...