Processing of Swedish Compounds for Phrase-Based Statistical Machine Translation

@inproceedings{Stymne2008ProcessingOS,
  title={Processing of Swedish Compounds for Phrase-Based Statistical Machine Translation},
  author={Sara Stymne and Maria Holmqvist},
  year={2008}
}
We investigated the effects of processing Swedish compounds for phrase-based SMT between Swedish and English. Compounds were split in a pre-processing step using an unsupervised empirical method. After translation into Swedish, compounds were merged, using a novel merging algorithm. We investigated two ways of handling compound parts, by marking them as compound parts or by normalizing them to a canonical form. We found that compound splitting did improve translation into Swedish, according to… CONTINUE READING
Highly Cited
This paper has 26 citations. REVIEW CITATIONS