• Publications
  • Influence
Chinese Whispers - an Efficient Graph Clustering Algorithm and its Application to Natural Language Processing Problems
TLDR
We introduce Chinese Whispers, a randomized graph-clustering algorithm, which is time-linear in the number of edges. Expand
  • 332
  • 38
  • PDF
UKP: Computing Semantic Textual Similarity by Combining Multiple Content Similarity Measures
TLDR
We present the UKP system which performed best in the Semantic Textual Similarity (STS) task at SemEval-2012 in two out of three metrics. Expand
  • 209
  • 33
  • PDF
Do Supervised Distributional Methods Really Learn Lexical Inference Relations?
TLDR
Distributional representations of words have been recently used in supervised settings for recognizing lexical inference relations between word pairs, such as hypernymy and entailment, and show that they do not actually learn a relation between two words. Expand
  • 187
  • 31
  • PDF
WebAnno: A Flexible, Web-based and Visually Supported System for Distributed Annotations
TLDR
We present WebAnno, a general purpose web-based annotation tool that is immediately usable by any annotator with internet access. Expand
  • 160
  • 22
  • PDF
TopicTiling: A Text Segmentation Algorithm based on LDA
TLDR
This work presents a Text Segmentation algorithm called TopicTiling, which is based on the well-known TextTiling algorithm, and segments documents using the Latent Dirichlet Allocation (LDA) topic model. Expand
  • 88
  • 20
  • PDF
Corpus Portal for Search in Monolingual Corpora
TLDR
A simple and flexible schema for storing and presenting monolingual language resources is proposed. Expand
  • 200
  • 19
  • PDF
A Report on the Complex Word Identification Shared Task 2018
TLDR
We report the findings of the second Complex Word Identification (CWI) shared task organized as part of the BEA workshop co-located with NAACL-HLT'2018. Expand
  • 55
  • 12
  • PDF
Unsupervised Part-of-Speech Tagging Employing Efficient Graph Clustering
TLDR
An unsupervised part-of-speech (POS) tagging system that relies on graph clustering methods is described. Expand
  • 105
  • 11
  • PDF
NoSta-D Named Entity Annotation for German: Guidelines and Dataset
TLDR
We describe the annotation of a new dataset for German Named Entity Recognition (NER). Expand
  • 50
  • 11
  • PDF
Distributional Semantics and Compositionality 2011: Shared Task Description and Results
TLDR
This paper gives an overview of the shared task at the ACL-HLT 2011 DiSCo (Distributional Semantics and Compositionality) workshop. Expand
  • 50
  • 11
  • PDF