• Publications
  • Influence
Translating with Non-contiguous Phrases
TLDR
This paper presents a phrase-based statistical machine translation method, based on non-contiguous phrases. Expand
  • 84
  • 11
  • PDF
The RWTH Machine Translation System
TLDR
We present the statistical machine translation system used by RWTH in the second TC-STAR evaluation. Expand
  • 36
  • 6
  • PDF
Automatic sentence segmentation and punctuation prediction for spoken language translation
TLDR
We present a novel sentence segmentation method which is specifically tailored to the requirements of machine translation algorithms and is competitive with state-of-the-art approaches for detecting sentence-like units. Expand
  • 92
  • 5
  • PDF
Deciphering Foreign Language by Combining Language Models and Context Vectors
TLDR
In this paper we show how to train statistical machine translation systems on real-life tasks using only non-parallel monolingual data from two languages. Expand
  • 41
  • 5
  • PDF
Extending Statistical Machine Translation with Discriminative and Trigger-Based Lexicon Models
TLDR
We propose two extensions of standard word lexicons in statistical machine translation: A discriminative word lexicon that uses sentence-level source information to predict the target words and a trigger-based lexicon model that extends IBM model 1 with a second trigger, allowing for a fine-grained lexical choice of target words. Expand
  • 95
  • 4
  • PDF
Logistic Model Trees with AUC Split Criterion for the KDD Cup 2009 Small Challenge
TLDR
In this work, we describe our approach to the "Small Challenge" of the KDD cup 2009, a classification task with incomplete data. Expand
  • 28
  • 3
  • PDF
Training Phrase Translation Models with Leaving-One-Out
TLDR
We describe a novel leaving-one-out approach to prevent over-fitting that allows us to train phrase models that show improved translation performance on the WMT08 Europarl German-English task. Expand
  • 97
  • 2
  • PDF
Modeling punctuation prediction as machine translation
TLDR
In this paper we analyze different methods for punctuation prediction and show improvements in the quality of the final translation output. Expand
  • 63
  • 2
  • PDF
The RWTH statistical machine translation system for the IWSLT 2006 evaluation
TLDR
We give an overview of the RWTH phrase-based statistical machine translation system that was used in the evaluation campaign of the International Workshop on Spoken Language Translation (IWSLT) 2006. Expand
  • 58
  • 2
  • PDF
A Source-side Decoding Sequence Model for Statistical Machine Translation
We propose a source-side decoding sequence language model for phrase-based statistical machine translation. This model is a reordering model in the sense that it helps the decoder find the correctExpand
  • 15
  • 2
  • PDF