Vinh Van Nguyen

Learn More
Though phrase-based SMT has achieved high translation quality, it still lacks of generalization ability to capture word order differences between languages. In this paper we describe a general method for tree-to-string phrase-based SMT. We study how syntactic transformation is incorporated into phrase-based SMT and its effectiveness. We design syntactic(More)
In this paper, we present a Conditional Random Fields (CRFs) framework for the Clause Splitting problem. We adapt the CRFs model to this problem in order to use a very large sets of arbitrary, overlapping and non-independent features. In addition , we propose the use of rich linguistic information along with a new bottom-up dynamic algorithm for decoding to(More)
Consider a sequence of n independent random variables with a common continuous distribution F , and consider the task of choosing an increasing subsequence where the observations are revealed sequentially and where an observation must be accepted or rejected when it is first revealed. There is a unique selection policy π * n that is optimal in the sense(More)
Vietnamese accentless texts exist on parallel with official vietnamese documents and play an important role in instant message, mobile SMS and online searching. Understanding correctly these texts is not simple because of the lexical ambiguity caused by the diversity in adding diacritics to a given accentless sequence. There have been some methods for(More)
Reordering is a major challenge in machine translation (MT) between two languages with significant differences in word order. In this paper, we present an approach to learn reordering rules as pre-processing step based on a dependency parser in phrase-based statistical machine translation (SMT) from Vietnamese to English. Dependency parser and(More)
Dynamic instability in the form of chatter is a highly undesirable phenomenon that occurs during machining, resulting in poor surface finish and reduced tool life. Though analytical models exist for identifying conditions that avoid chatter, they do not account for process uncertainties or they require extensive cutting tests. Therefore, an on-line chatter(More)