Reducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations

@inproceedings{Sun2012ReducingAA,
  title={Reducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations},
  author={Weiwei Sun and Xiaojun Wan},
  booktitle={ACL},
  year={2012}
}
We address the issue of consuming heterogeneous annotation data for Chinese word segmentation and part-of-speech tagging. We empirically analyze the diversity between two representative corpora, i.e. Penn Chinese Treebank (CTB) and PKU’s People’s Daily (PPD), on manually mapped data, and show that their linguistic annotations are systematically different and highly compatible. The analysis is further exploited to improve processing accuracy by (1) integrating systems that are respectively… CONTINUE READING
Highly Cited
This paper has 24 citations. REVIEW CITATIONS

Citations

Publications citing this paper.

References

Publications referenced by this paper.
Showing 1-10 of 22 references

Similar Papers

Loading similar papers…