The GUM corpus: creating multilayer resources in the classroom

@article{Zeldes2017TheGC,
  title={The GUM corpus: creating multilayer resources in the classroom},
  author={Amir Zeldes},
  journal={Language Resources and Evaluation},
  year={2017},
  volume={51},
  pages={581-612}
}
This paper presents the methodology, design principles and detailed evaluation of a new freely available multilayer corpus, collected and edited via classroom annotation using collaborative software. After briefly discussing corpus design for open, extensible corpora, five classroom annotation projects are presented, covering structural markup in TEI XML, multiple part of speech tagging, constituent and dependency parsing, information structural and coreference annotation, and Rhetorical… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 32 CITATIONS

Beyond The Wall Street Journal: Anchoring and Comparing Discourse Signals across Genres

  • ArXiv
  • 2019
VIEW 3 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

graphANNIS: A Fast Query Engine for Deeply Annotated Linguistic Corpora

VIEW 7 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

2 RELATEDWORK 2 . 1 Text Labelling In Games With A Purpose

Chris Madge, Jon Chamberlain, Udo Kruschwitz
  • 2019
VIEW 1 EXCERPT
CITES METHODS

A Discourse Signal Annotation System for RST Trees

Luke Gessler, Yang Liu, Amir Zeldes
  • 2019
VIEW 1 EXCERPT
CITES METHODS

References

Publications referenced by this paper.
SHOWING 1-10 OF 57 REFERENCES

The Life and Death of Discourse Entities: Identifying Singleton Mentions

M. Recasens, de Marneffe, M.-C, C. Potts
  • In Proceedings of NAACL 2013. Atlanta,
  • 2013
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

Collaborative Dependency Annotation

  • DepLing
  • 2013
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

The Potsdam Commentary Corpus

  • ACL 2004
  • 2004
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL