A Hybrid Extraction Model for Chinese Noun/Verb Synonymous bi-gram Collocations

@inproceedings{Li2011AHE,
  title={A Hybrid Extraction Model for Chinese Noun/Verb Synonymous bi-gram Collocations},
  author={Wanyin Li and Qin Lu},
  booktitle={PACLIC},
  year={2011}
}
Statistical-based collocation extraction approaches suffer from (1) low precision rate because high co-occurrence bi-grams may be syntactically unrelated and are thus not true collocations; (2) low recall rate because some true collocations with low occurrences cannot be identified successfully by statistical-based models. To integrate both syntactic rules as well as semantic knowledge into a statistical model for collocation extraction is one way to achieve a high precision while keeping a… CONTINUE READING