Learn More
At present, there are a great lot of paroxysmal text in the network, so personal pronoun anaphora resolution will help achieve web page information processing. According to the feature of Chinese personal pronoun in the paroxysmal text of the Chinese web, we present an approach of anaphora resolution, which is based on corpus adopting the maximum entropy.(More)
This paper presents a novel approach for discovering and extracting sets of words sharing semantic meaning. We utilize meta-patterns of high frequency words and content words in order to discover pattern candidates. Symmetric patterns are then identified using graph-based measures, and word categories are created based on graph clique sets. Our method is(More)
Chinese is written with the character of no space or other word delimiters. Chinese word segmentation (CWS) is the first step for Chinese language processing. Generally, words and fixed phrases(idioms, named entity) can be tagged successfully. However, besides words and fixed phrases, segmentation units should be tagged too. In the segmentation(More)
Finding all occurrences of a twig pattern in an XML database is a core operation for XML query algorithm. Prior work has called the function getNext(q) [1] or other functions based on the getNext function to find matching nodes. There are some useless call or return operations because returning the unmatched node to the upper procedure is useless and the(More)
  • 1