Christian Rishøj

Learn More
We considered a wide range of features for the DiSCo 2011 shared task about compositionality prediction for word pairs, including COALS-based endocentricity scores, compositionality scores based on distributional clusters, statistics about wordnet-induced paraphrases, hyphenation, and the likelihood of long translation equivalents in other languages. Many(More)
Martins et al. (2008) presented what to the best of our knowledge still ranks as the best overall result on the CONLLX Shared Task datasets. The paper shows how triads of stacked dependency parsers described in Martins et al. (2008) can label unlabeled data for each other in a way similar to co-training and produce end parsers that are significantly better(More)
This brief article describes our contribution to the EVALITA 2009 Parsing Task, dependency track. The TUT and ISST treebanks are augmented with additional features. MIRA is used to find a weight matrix suited for the Covington algorithm, which is subsequently skewed by discriminatively learned hard constraints on dependency lengths. Our skewed algorithm is(More)
A method for deriving an approximately labeled dependency treebank from the Thai Categorial Grammar Treebank has been implemented. The method involves a lexical dictionary for assigning dependency directions to the CG types associated with the grammatical entities in the CG bank, falling back on a generic mapping of CG types in case of unknown words.(More)
  • 1