Learn More
In this paper we present DIRNDL, an annotated corpus resource comprising syntactic annotations as well as information status labels and prosodic information. We introduce each annotation layer and then focus on the linking of the data in a standoff approach. The corpus is based on data from radio news broadcasts , i.e. two sets of primary data: spoken radio(More)
In the framework of the preparation of linguistic web services for corpus processing, the need for a representation format was felt, which supports interoperability between different web services in a corpus processing pipeline, but also provides a well-defined interface to both, legacy tools and their data formats and upcoming international standards. We(More)
Data models and encoding formats for syntactically annotated text corpora need to deal with syntactic ambiguity; underspecified representations are particularly well suited for the representation of ambiguous data because they allow for high informational efficiency. We discuss the issue of being informationally efficient, and the trade-off between(More)
Depending on the nature of a linguistic theory, empirical investigations of its soundness may focus on corpus studies related to lexical, syntactic, semantic or other phenomena. Especially work in research networks usually comprises analyses of different levels of description, where each one must be as reliable as possible when the same sentences and texts(More)
  • Jürgen Heit, Harvard Univ, Plödereder Erhard, Betreuer, Charles P Shelton, L Christopher +7 others
  • 2006
Quality attributes grouped under the term of dependability have been the subject of research for over fifty years. During this time, dependability and its quality attributes have been overloaded with many different definitions. This thesis will therefore take a detailed look at the most important quality attributes of dependability. Dependability plays a(More)