Integrating Linguistic Resources: The American National Corpus Model

  title={Integrating Linguistic Resources: The American National Corpus Model},
  author={Nancy Ide and Keith Suderman},
This paper describes the architecture of the American National Corpus and the design decisions we have made in order to make the corpus easy to use with a variety of existing tools with varying functionality, and to allow for layering multiple annotations over the data. The overall goal of the ANC project is to provide an “open linguistic infrastructure” for American English, consisting of as many self-generated or contributed annotations of the data as possible together with derived. The… CONTINUE READING

Figures and Topics from this paper.

Explore Further: Topics Discussed in This Paper


Publications referenced by this paper.

Combining POStaggers for improved accuracy on Swedish text

  • D. Tufis
  • 2003

The American National Corpus: A Standardized Resource of American English

  • N. Ide, C. Macleod
  • In Proceedings of Corpus Linguistics
  • 2001
2 Excerpts

Similar Papers

Loading similar papers…