Uzbek-English and Turkish-English Morpheme Alignment Corpora

Abstract

Morphologically-rich languages pose problems for machine translation (MT) systems, including word-alignment errors, data sparsity and multiple affixes. Current alignment models at word-level do not distinguish words and morphemes, thus yielding low-quality alignment and subsequently affecting end translation quality. Models using morpheme-level alignment… (More)

Topics

5 Figures and Tables

Slides referencing similar topics