The Lancaster Corpus of Mandarin Chinese: A Corpus for Monolingual and Contrastive Language Study

@inproceedings{McEnery2004TheLC,
  title={The Lancaster Corpus of Mandarin Chinese: A Corpus for Monolingual and Contrastive Language Study},
  author={Anthony McEnery and Zhonghua Xiao},
  booktitle={LREC},
  year={2004}
}
This paper presents the newly released Lancaster Corpus of Mandarin Chinese (LCMC), a Chinese match for the FLOB and Frown corpora of British and American English. LCMC is a one-million-word balanced corpus of written Mandarin Chinese. The corpus contains five hundred 2,000-word samples of written Chinese texts sampled from fifteen text categories published in Mainland China around 1991, totalling one million words. LCMC is XML-compliant and conforms to CES, with each document containing a… CONTINUE READING
Highly Cited
This paper has 51 citations. REVIEW CITATIONS

From This Paper

Figures, tables, results, and topics from this paper.

Key Quantitative Results

  • The corpus is segmented and POS tagged with a tagging precision rate of over 98%.

Citations

Publications citing this paper.
Showing 1-10 of 36 extracted citations

52 Citations

0510'09'12'15'18
Citations per Year
Semantic Scholar estimates that this publication has 52 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 10 references

Automatic recognition of Chinese unknown words based on role tagging

  • H. Zhang, Q. Liu, X. Cheng
  • In Proceedings of the 1 SIGHAN Workshop,
  • 2002

Towards an understanding of workplace discourse

  • C. Candlin, M. Hundt, A. Sand, R. Siemund
  • 2002

Towards an understanding of workplace discourse. In C. Candlin (ed) Research and Practice in Professional Discourse (pp. 137-86)

  • Y. Gu
  • 2002
1 Excerpt

Manual of information to accompany the Freiburg - Brown Corpus of American English (‘Frown’)

  • M. Hunt, A. Sand, P. Skandera
  • 1999

Manual of information to accompany the Freiburg - LOB Corpus of British English (‘FLOB’)

  • M. Hundt, A. Sand, R. Siemund
  • 1998

The BNC Handbook

  • G. Aston, L. Burnard
  • 1998
1 Excerpt

Variation in Spoken and Written Mandarin Narrative Discourse

  • M. Christensen
  • Ph.D. thesis,
  • 1994
1 Excerpt

Similar Papers

Loading similar papers…