Detection of Non-Native Sentences Using Machine-Translated Training Data

Abstract

Training statistical models to detect nonnative sentences requires a large corpus of non-native writing samples, which is often not readily available. This paper examines the extent to which machinetranslated (MT) sentences can substitute as training data. Two tasks are examined. For the native vs non-native classification task, nonnative training data… (More)

Topics

4 Figures and Tables