Su-qin Feng

  • Citations Per Year
Learn More
Covering ambiguity is a vital issue in Chinese word segmentation. Challenges are that disambiguation is depending on the contextual information. This paper collected contextual information statistics of covering ambiguity words and found a context calculation mode by using log likelihood ratio. A weighing calculation formula is designed for considering(More)
Combinatorial ambiguity has always been a vital issue in Chinese word segmentation. This paper presented a novel way for disambiguation by use of a multi maximal log likelihood ratio of the cooperative statistical table, which took the cooperative examples provided by the artificial checked word segmentation as the initial cooperative knowledge of covering(More)
Vector space model is commonly used in the formal representation on text, but this approach would not highlight the features which play a key role in the text contents. An improved feature selection method based on key words was proposed, which uses text structural information and mutual information theory to extract key words on text content. Through using(More)
  • 1