Shengwei Tian

Learn More
This paper presents the generation of Uyghur Noun Suffix DFA combined with Conditional Random Fields (CRF) for stemming algorithm. Because of the agglutinative nature of Uyghur language, stemming is an essential task for Uyghur language processing applications. We generate Uyghur noun inflectional suffixes finite state machines (FSMs) by using the(More)
This paper proposes a Hybrid algorithm based on mistake spread suppression to align Chinese-Uighur Sentences. Aiming at the shortcoming of mistake spread in alignment algorithm based on length, this paper presents a new kind of suppression strategy for mistake spread. This strategy omits Chinese segmentation and processing for pos tagging. By using(More)
  • 1