Learn More
In this article, we present a language-independent, unsupervised approach to sentence boundary detection. It is based on the assumption that a large number of ambiguities in the determination of sentence boundaries can be eliminated once abbreviations have been identified. Instead of relying on orthographic clues, the proposed system is able to detect(More)
We describe a language-independent, flexible , and accurate method for the detection of abbreviations in text corpora. It is based on the idea that an abbreviation can be viewed as a collocation, and can be identified by using methods for collocation detection such as the log likelihood ratio. Although the log likelihood ratio is known to show a good recall(More)
The detection of abbreviations is an important step in the process of sentence boundary detection. We describe a flexible, language-independent and accurate method based on the idea that an abbreviation can be viewed as a collocation. As such, it can be identified by using methods for collocation detection such as the log likelihood ratio. Although the log(More)
Traditional approaches to prepositions classify preposition lexemes into different semantic classes like spatial, temporal or modal ones. In this paper we reconsider the so-called spatial prepositions and their sub classification. We propose a sense based classification of prepositions using traditional criteria to determine different classes of(More)
We determine the productivity of determin-erless PPs in German quantitatively, restricting ourselves to the preposition unter. The study is based on two German newspaper corpora, comprising some 210 million words. The problematic construction, i.e. unter followed by a determinerless singular noun occurs some 16.000 times in the corpus. To clarify the(More)
Extraposed relative clauses pose certain problems for movement-based analyses. They seem to be insensitive to island constraints, and show intricate interactions with variable binding. Our main goal is to motivate a non-movement analysis of relative clause extraposition in which the extraposition site is syntactically superior to the phrase that it(More)