Noisy SMS Machine Translation in Low-Density Languages

  title={Noisy SMS Machine Translation in Low-Density Languages},
  author={Vladimir Eidelman and Kristy Hollingshead and Philip Resnik},
This paper presents the system we developed for the 2011 WMT Haitian Creole–English SMS featured translation task. Applying standard statistical machine translation methods to noisy real-world SMS data in a low-density language setting such as Haitian Creole poses a unique set of challenges, which we attempt to address in this work. Along with techniques to better exploit the limited available training data, we explore the benefits of several methods for alleviating the additional noise… CONTINUE READING

Figures, Tables, and Topics from this paper.

Similar Papers

Loading similar papers…