Two-Character Chinese Word Extraction Based on Hybrid of Internal and Contextual Measures

@inproceedings{Luo2003TwoCharacterCW,
  title={Two-Character Chinese Word Extraction Based on Hybrid of Internal and Contextual Measures},
  author={Shengfen Luo and Maosong Sun},
  booktitle={SIGHAN},
  year={2003}
}
Word extraction is one of the important tasks in text information processing. There are mainly two kinds of statisticbased measures for word extraction: the internal measure and the contextual measure. This paper discusses these two kinds of measures for Chinese word extraction. First, nine widely adopted internal measures are tested and compared on individual basis. Then various schemes of combining these measures are tried so as to improve the performance. Finally, the left/right entropy is… CONTINUE READING
Highly Cited
This paper has 45 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.
23 Citations
8 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 23 extracted citations

References

Publications referenced by this paper.

Similar Papers

Loading similar papers…