• Publications
  • Influence
Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages
TLDR
This paper reports on the first shared task on statistical parsing of morphologically rich languages (MRLs). Expand
  • 144
  • 14
  • PDF
Is it Really that Difficult to Parse German?
TLDR
This paper presents a comparative study of probabilistic treebank parsing of German, using the Negra and TuBa-D/Z tree-banks. Expand
  • 56
  • 10
  • PDF
Data-Driven Parsing using Probabilistic Linear Context-Free Rewriting Systems
TLDR
This paper presents the first efficient implementation of a weighted deductive CYK parser for Probabilistic Linear Context-Free Rewriting Systems (PLCFRSs). Expand
  • 39
  • 3
  • PDF
Direct Parsing of Discontinuous Constituents in German
TLDR
We use a parser for Probabilistic Linear Context-Free Rewriting Systems (PLCFRS), a formalism with high expressivity, to directly parse the German NeGra and TIGER treebanks. Expand
  • 22
  • 3
  • PDF
Discontinuous Incremental Shift-reduce Parsing
TLDR
We present an extension to incremental shift-reduce parsing that handles discontinuous constituents, using a linear classifier and beam search. Expand
  • 29
  • 2
  • PDF
Discontinuous parsing with continuous trees
TLDR
We introduce a new method for incremental shift-reduce parsing of discontinuous constituency trees, based on the fact that discontinuous trees can be transformed into continuous trees by changing the order of the terminal nodes. Expand
  • 20
  • 2
  • PDF
PLCFRS Parsing Revisited: Restricting the Fan-Out to Two
TLDR
A parser for binary PLCFRS of fan-out two, together with a novel monotonous estimate for A parsing, with which we conduct experiments on modified versions of the German NeGra treebank and the Discontinuous Penn Treebank in which all trees have block degree two. Expand
  • 22
  • 2
  • PDF
The IUCL+ System: Word-Level Language Identification via Extended Markov Models
TLDR
We describe the IUCL+ system for the shared task of the First Workshop on Computational Approaches to Code Switching (Solorio et al., 2014), in which participants were challenged to label each word in Twitter texts as a named entity or one of two candidate languages. Expand
  • 16
  • 2
  • PDF
Annotating Coordination in the Penn Treebank
TLDR
In this paper, we present an annotation scheme for the Penn Treebank which introduces a distinction between coordinating from non-coordinating punctuation. Expand
  • 14
  • 2
  • PDF
SAWT: Sequence Annotation Web Tool
TLDR
We present SAWT, a web-based tool for the annotation of token sequences with an arbitrary set of labels. Expand
  • 6
  • 2
  • PDF
...
1
2
3
4
5
...