- Full text PDF available (28)
- This year (3)
- Last 5 years (14)
- Last 10 years (28)
Journals and Conferences
Data Set Used
We provide a robust and detailed annotation scheme for information status, which is easy to use, follows a semantic rather than cognitive motivation, and achieves reasonable inter-annotator scores. Our annotation scheme is based on two main assumptions: firstly, that information status strongly depends on (in)definiteness, and secondly, that it ought to be… (More)
We investigate the influence of information status (IS) on constituent order in German, and integrate our findings into a loglinear surface realisation ranking model. We show that the distribution of pairs of IS categories is strongly asymmetric. Moreover, each category is correlated with morphosyntactic features, which can be automatically detected. We… (More)
The main objective of the paper is to show that for an adequate analysis of an item’s information status in spoken language two levels of givenness have to be investigated: a referential and a lexical level. This separation is a crucial step towards our goal to arrive at the best possible classification of nominal expressions occurring in natural discourse… (More)
In this paper we present DIRNDL, an annotated corpus resource comprising syntactic annotations as well as information status labels and prosodic information. We introduce each annotation layer and then focus on the linking of the data in a standoff approach. The corpus is based on data from radio news broadcasts, i.e. two sets of primary data: spoken radio… (More)
In this article we discuss some empirical results concerning the impact of different levels of information status (i.e. referents and words, respectively) on the prosodic realization of referential expressions in annotated corpora of read and spontaneous speech. Both at the referential and at the lexical level not only given and new but also intermediate… (More)
This article presents a survey of and an investigation into the notion of information status. Based on insights from DRT and presupposition theory a new variant of IS taxonomis is developed, considering issues such as accommodation and underspecification of text with regard to hearer knowledge.
The article discusses several issues relevant for the annotation of written and spoken corpus data with information structure. We discuss ways to identify focus top-down (via Questions under Discussion) or bottom-up (starting from pitch accents). We introduce a two-dimensional labelling scheme for information status and propose a way to distinguish between… (More)
We present a model for automatically predicting information status labels for German referring expressions. We train a CRF on manually annotated phrases, and predict a fine-grained set of labels. We achieve an accuracy score of 69.56% on our most detailed label set, 76.62% when gold standard coreference is available.
This paper is the first to examine the effect of prosodic features on coreference resolution in spoken discourse. We test features from different prosodic levels and investigate which strategies can be applied. Our results on the basis of manual prosodic labelling show that the presence of an accent is a helpful feature in a machine-learning setting.… (More)