Corpus ID: 9405068

Developments in the TIGER Annotation Scheme and their Realization in the Corpus

@inproceedings{Brants2002DevelopmentsIT,
  title={Developments in the TIGER Annotation Scheme and their Realization in the Corpus},
  author={Sabine Brants and Silvia Hansen},
  booktitle={LREC},
  year={2002}
}
This paper presents the annotation of the German TIGER Treebank. First, issues concerning the annotation, representation as well as querying of the treebank are discussed. Within this context, the annotation tool ANNOTATE, the export and XML formats of the TIGER Treebank and the TIGER search tool are briefly introduced. Secondly, the developments of the TIGER annotation scheme and their realization in the corpus are introduced focussing on the differences between the underlying NEGRA annotation… Expand
TIGER: Linguistic Interpretation of a German Corpus
TLDR
The TIGER Treebank, a corpus of currently 40,000 syntactically annotated German newspaper sentences, is described and the query language which was designed to facilitate a simple formulation of complex queries is described, a graphical user interface for query input. Expand
The TIGER Treebank
This paper reports on the TIGER Treebank, a corpus of currently 35.000 syntactically annotated German newspaper sentences. We describe what kind of information is encoded in the treebank andExpand
How to Compare Treebanks
TLDR
This paper addresses the question how to compare syntactically annotated corpora and gain insights into the usefulness of specific design decisions and presents TePaCoC, a new testsuite for the evaluation of parsers on complex German grammatical constructions. Expand
Disambiguation of the Semantics of German Prepositions: a Case Study
In this paper, we describe our experiments in preposition disambiguation based on a – compared to a previous study – revised annotation scheme and new features derived from a matrix factorizationExpand
German Treebanks: TIGER and TüBa-D/Z
TLDR
This chapter presents two major treebanks of German, TIGER and TuBa-D/Z, and presents a comparison of the two annotation schemes along with their advantages and disadvantages. Expand
Annotation Scheme for Chinese Treebank
TLDR
A new annotation scheme for Chinese treebank is proposed and a 1,000,000 words Chinese tree bank is built covering a balanced collection of journalistic, literary, academic, and other documents to show the availability and compatibility of this annotation scheme. Expand
What kinds of trees grow in Swedish soil
TLDR
This paper will discuss and compare four different annotation schemes that have been proposed for Swedish in terms of their suitability for Swedish syntax as well as their relationship to linguistic theory and annotation schemes proposed for other languages. Expand
GRAIN-S: Manually Annotated Syntax for German Interviews
TLDR
GRAIN-S is a set of manually created syntactic annotations for radio interviews in German that follows TIGER, one of the established syntactic treebanks of German and can contribute to research into techniques for model adaptation and for building more corpus-independent tools. Expand
The Norwegian Dependency Treebank
TLDR
The core principles behind the syntactic annotation and how these principles were employed in certain specific cases are presented and the selection of texts and distribution between genres, as well as the annotation process and an evaluation of the inter-annotator agreement are presented. Expand
Automation and Validation of Annotation for Hindi Anaphora Resolution
TLDR
An effort has been made to provide the algorithm for semiautomatic annotation for Hindi text to cater anaphora resolution only and to automate the process of tagging as well as the handling of semantic information through addition tags. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 32 REFERENCES
The TIGER Treebank
This paper reports on the TIGER Treebank, a corpus of currently 35.000 syntactically annotated German newspaper sentences. We describe what kind of information is encoded in the treebank andExpand
From LFG Structures to TIGER Treebank Annotations
The Tiger project aims at creating a large German tree-bank of newspaper text by exploiting two diierent annotation methods: (i) an interactive combination of a cascaded probabilistic parser andExpand
Building a Treebank for Italian: a Data-driven Annotation Schema
TLDR
This paper presents a data-driven annotation schema developed for an Italian treebank ensuring data coverage and consistency between annotation of linguistic phenomena and describes the cyclical development of the annotation schema highlighting the richness and flexibility of the format. Expand
A Treebank of Spanish and its Application to Parsing
TLDR
The design of such a treebank for Spanish and its initial application to parser construction is described and some automatic pre-tagging of the data is performed, to speed treebank creation. Expand
The Penn Treebank: Annotating Predicate Argument Structure
TLDR
The implementation of crucial aspects of this new syntactic annotation scheme incorporates a more consistent treatment of a wide range of grammatical phenomena, provides a set of coindexed null elements in what can be thought of as "underlying" position for phenomena such as wh-movement, passive, and the subjects of infinitival constructions. Expand
Tagging and parsing with cascaded Markov models: automation of corpus annotation
TLDR
New techniques for parsing natural language based on Markov Models, which are commonly used in part-of-speech tagging for sequential processing on the world level, are applied to corpus annotation and partial parsing and are evaluated using corpora of different languages and domains. Expand
English for the Computer: The SUSANNE Corpus and Analytic Scheme
  • G. Sampson
  • Computer Science
  • Computational Linguistics
  • 2002
TLDR
This book attempts to define a "Linnaean taxonomy" for the English language: an annotation scheme, the SUSANNE scheme, which yields a labelled constituency structure for any string of English, comprehensively identifying all of its surface and logical structural properties. Expand
A Linguistically Interpreted Corpus of German Newspaper Text
TLDR
This paper reports on the development of an annotation scheme and annotation tools for unrestricted German text based on argument structure, but also permits the extraction of other kinds of representations. Expand
Building a Treebank for French
TLDR
A treebank project for French has annotated a newspaper corpus of 1 Million words with part of speech, inflection, compounds, lemmas and constituency and presents some uses of the corpus. Expand
Comparing English worldwide : the International Corpus of English
The International Corpus of English is a unique linguistic and sociolinguistic project. When complete it will consist of fifteen or more parallel corpora of spoken English drawn from countries whereExpand
...
1
2
3
4
...