Learn More
We considered a wide range of features for the DiSCo 2011 shared task about composi-tionality prediction for word pairs, including COALS-based endocentricity scores, compo-sitionality scores based on distributional clusters , statistics about wordnet-induced paraphrases , hyphenation, and the likelihood of long translation equivalents in other languages.(More)
Martins et al. (2008) presented what to the best of our knowledge still ranks as the best overall result on the CONLL-X Shared Task datasets. The paper shows how triads of stacked dependency parsers described in Martins et al. (2008) can label unlabeled data for each other in a way similar to co-training and produce end parsers that are significantly better(More)
A method for deriving an approximately labeled dependency treebank from the Thai Categorial Grammar Treebank has been implemented. The method involves a lexical dictionary for assigning dependency directions to the CG types associated with the grammatical entities in the CG bank, falling back on a generic mapping of CG types in case of unknown words.(More)
  • 1