Huiming Duan

Learn More
The CIPS-SIGHAN CLP 2012 Chinese Word Segmentation on MicroBlog Corpora Bakeoff was held in the autumn of 2012. This bake-off task of Chinese word segmentation is focused on the performance of Chinese word segmentation algorithms on MicroBlog corpora. 17 groups submitted 20 results, among which the best system has all the P, R and F values near 95%, and the(More)
This paper presents a maximum entropy (ME)-based model for Chinese noun phrase metaphor recognition. The metaphor recognizing process will be viewed as a classification task between metaphor and literal meaning. Our experiments show that the metaphor recognizer based on the ME method is significantly better than the Example-based methods within the same(More)
Increase in three-character words attracts more and more attention from researchers. In the present paper, the ratio of three-character words unrecorded in the Grammatical Knowledge-base of Contemporary Chinese is obtained by an analysis of the tagged corpus of People's Daily of 1998. (henceforth, three-character unknown words). The results show that the(More)
  • 1