Using Word-Dependent Transition Models in HMM-Based Word Alignment for Statistical Machine Translation

@inproceedings{He2007UsingWT,
  title={Using Word-Dependent Transition Models in HMM-Based Word Alignment for Statistical Machine Translation},
  author={Xiaodong He},
  booktitle={WMT@ACL},
  year={2007}
}
In this paper, we present a Bayesian Learning based method to train word dependent transition models for HMM based word alignment. We present word alignment results on the Canadian Hansards corpus as compared to the conventional HMM and IBM model 4. We show that this method gives consistent and significant alignment error rate (AER) reduction. We also conducted machine translation (MT) experiments on the Europarl corpus. MT results show that word alignment based on this method can be used in a… CONTINUE READING
Highly Influential
This paper has highly influenced a number of papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 61 citations. REVIEW CITATIONS

9 Figures & Tables

Extracted Numerical Results

  • MT results show that word alignment based on this method can be used in a phrase-based machine translation system to yield up to 1% absolute improvement in BLEU score, compared to a conventional HMM, and 0.8% compared to a IBM model 4 based word alignment.
  • Compared to baseline HMM alignment model, WDHMM can improve the BLEU score nearly 1% on in-domain test sets, and the improvement reduces to about 0.5% on the out-of-domain test. When compared to IBM model 4, WDHMM still gives higher BLEU scores, and outperform model 4 by about 0.8% on the test set.
  • In table 5, it is shown that BLEU gains of WDHMM over HMM and IBM-4 on different test sets, except the gain over IBM model 4 on the devtest set, are statistically significant with a significance level > 95%.

Topics

Statistics

051020082009201020112012201320142015201620172018
Citations per Year

62 Citations

Semantic Scholar estimates that this publication has 62 citations based on the available data.

See our FAQ for additional information.