The Penn Chinese TreeBank: Phrase structure annotation of a large corpus

  title={The Penn Chinese TreeBank: Phrase structure annotation of a large corpus},
  author={Naiwen Xue and Fei Xia and Fu-Dong Chiou and Martha Palmer},
  journal={Natural Language Engineering},
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, part-of-speech taggers, and parsers) for Chinese have been developed all over the world. However, since no large-scale bracketed corpora are available to the public, these tools are trained on corpora with di erent segmentation criteria, part-of-speech tagsets and bracketing guidelines, and therefore, comparisons are diÆcult. As a rst step towards addressing this issue, we have been preparing a… CONTINUE READING
Highly Influential
This paper has highly influenced 136 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 520 citations. REVIEW CITATIONS
372 Citations
46 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 372 extracted citations

521 Citations

Citations per Year
Semantic Scholar estimates that this publication has 521 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 46 references

The Grammatical Knowledge-base of Contemporary Chinese | A Complete Speci cation (in Chinese)

  • Yu, Shiwen, Xuefeng Zhu, Hui Wang, Yunyun Zhang.
  • Tsinghua University Press. 39
  • 1998
Highly Influential
6 Excerpts

A Procedure for Quantitatively Comparing the Syntactic Coverage of English grammars

  • E. Black, S. Abney, D. Flickinger, C. Gdaniec, et. al
  • In Proc. of the DARPA Speech and Natural Language…
  • 1991
Highly Influential
4 Excerpts

Building and Using Syntactically Annotated Corpora

  • Abeill e, Anne.
  • Kluwer. B
  • 2003

On the De nition of Word

  • Faro
  • 2003

Similar Papers

Loading similar papers…