Log-linear Models for Uyghur Segmentation in Spoken Language Translation

Abstract

To alleviate data sparsity in spoken Uyghur machine translation, we proposed a log-linear based morphological segmentation approach. Instead of learning model only from monolingual annotated corpus, this approach optimizes Uyghur segmentation for spoken translation based on both bilingual and monolingual corpus. Our approach relies on several features such… (More)
DOI: 10.26615/978-954-452-049-6_065

Topics

9 Figures and Tables

Slides referencing similar topics