Learn More
In this paper, we present a reordering model based on Maximum Entropy. This model is extended from a hierarchical reordering model with PBSMT (Galley and Manning, 2008), which integrates syntactic information directly in decoder as features of MaxEnt model. The advantages of this model are (1) maintaining the strength of phrase based approach with a(More)
Though phrase-based SMT has achieved high translation quality, it still lacks of generalization ability to capture word order differences between languages. In this paper we describe a general method for tree-to-string phrase-based SMT. We study how syntactic transformation is incorporated into phrase-based SMT and its effectiveness. We design syntactic(More)
Defining the processes of autoimmune attack of tissues is important for inhibiting continued tissue destruction. In type 1 diabetes, it is not known how cytotoxic effector T cell responses evolve over time in the pancreatic islets targeted for destruction. We used two-photon microscopy of live, intact, individual islets to investigate how progression of(More)
In this paper, we present a Conditional Random Fields (CRFs) framework for the Clause Splitting problem. We adapt the CRFs model to this problem in order to use a very large sets of arbitrary, overlapping and non-independent features. In addition , we propose the use of rich linguistic information along with a new bottom-up dynamic algorithm for decoding to(More)
Consider a sequence of n independent random variables with a common continuous distribution F , and consider the task of choosing an increasing subsequence where the observations are revealed sequentially and where an observation must be accepted or rejected when it is first revealed. There is a unique selection policy π * n that is optimal in the sense(More)
Vietnamese accentless texts exist on parallel with official vietnamese documents and play an important role in instant message, mobile SMS and online searching. Understanding correctly these texts is not simple because of the lexical ambiguity caused by the diversity in adding diacritics to a given accentless sequence. There have been some methods for(More)
Reordering is a major challenge in machine translation (MT) between two languages with significant differences in word order. In this paper, we present an approach to learn reordering rules as pre-processing step based on a dependency parser in phrase-based statistical machine translation (SMT) from Vietnamese to English. Dependency parser and(More)