Chao-jan Chen

Learn More
This paper describes the design criteria and annotation guidelines of the Sinica Treebank. The three design criteria are: Maximal Resource Sharing, Minimal Structural Complexity, and Optimal Semantic Information. One of the important design decisions guided by these criteria is the encoding of thematic role information. We discuss the representational and(More)
This paper aims to present the methodology and guidelines for annotation in CKIP Chinese Treebank. Under the framework of the Information-based Case grammar (ICG), a lexical feature-based grammar formalism, which stipulates each lexical item containing both syntactic and semantic information, the potential phrasal heads of input are located and the semantic(More)
The paper describes a similarity-based model to present the morphological rules for Chinese compound nouns. This representation model serves functions of 1) as the morphological rules of the compounds, 2) as a mean to evaluate the proper-ness of a compound construction, and 3) as a mean to disambiguate the semantic ambiguity of the morphological head of a(More)
This paper presents a character-based model of automatic sense determination for Chinese compounds. The model adopts a sense approximation approach using synonymous compounds retrieved by measuring similarity of semantic template in compounding. The similarity measure is derived from an association network among characters and senses, which is built from a(More)
  • 1