Moses: Open Source Toolkit for Statistical Machine Translation
- Philipp Koehn, Hieu T. Hoang, Evan Herbst
- Computer ScienceAnnual Meeting of the Association for…
- 1 June 2007
We describe an open-source toolkit for statistical machine translation whose novel contributions are (a) support for linguistically motivated factors, (b) confusion network decoding, and (c)…
Statistical Phrase-Based Translation
- Philipp Koehn, F. Och, D. Marcu
- Computer ScienceNorth American Chapter of the Association for…
- 27 May 2003
The empirical results suggest that the highest levels of performance can be obtained through relatively simple means: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translation.
Europarl: A Parallel Corpus for Statistical Machine Translation
- Philipp Koehn
- Computer ScienceMachine Translation Summit
- 2005
A corpus of parallel text in 11 languages from the proceedings of the European Parliament is collected and its acquisition and application as training data for statistical machine translation (SMT) is focused on.
Abstract Meaning Representation for Sembanking
- Laura Banarescu, Claire Bonial, Nathan Schneider
- Computer ScienceLAW@ACL
- 1 August 2013
A sembank of simple, whole-sentence semantic structures will spur new work in statistical natural language understanding and generation, like the Penn Treebank encouraged work on statistical parsing.
Statistical Significance Tests for Machine Translation Evaluation
- Philipp Koehn
- Computer ScienceConference on Empirical Methods in Natural…
- 1 July 2004
If two translation systems differ differ in performance on a test set, can we trust that this indicates a difference in true system quality? To answer this question, we describe bootstrap resampling…
Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models
- Philipp Koehn
- Computer ScienceConference of the Association for Machine…
- 28 September 2004
We describe Pharaoh, a freely available decoder for phrase-based statistical machine translation models. The decoder is the implement at ion of an efficient dynamic programming search algorithm with…
Synthesis Lectures on Human Language Technologies
- Philip Williams, Rico Sennrich, Matt Post, Philipp Koehn
- Computer Science
- 2016
Clause Restructuring for Statistical Machine Translation
- M. Collins, Philipp Koehn, I. Kucerova
- Computer ScienceAnnual Meeting of the Association for…
- 25 June 2005
The reordering approach is applied as a pre-processing step in both the training and decoding phases of a phrase-based statistical MT system, showing an improvement from 25.2% Bleu score for a baseline system to 26.8% Blee score for the system with reordering.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Adithya Renduchintala, Rebecca Knowles, Philipp Koehn, Jason Eisner
- Engineering
- 1 August 2016
Factored Translation Models
- Philipp Koehn, Hieu Hoang
- Computer ScienceConference on Empirical Methods in Natural…
- 1 June 2007
In a number of experiments, it is shown that factored translation models lead to better translation performance, both in terms of automatic scores, as well as more grammatical coherence.
...
...