• Publications
  • Influence
Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition
TLDR
The CoNLL-2003 shared task: language-independent named entity recognition is described and a general overview of the systems that have taken part in the task and discuss their performance is presented.
Introduction to the CoNLL-2002 Shared Task: Language-Independent Named Entity Recognition
TLDR
The CoNLL-2002 shared task: language-independent named entity recognition is described and a general overview of the systems that have taken part in the task and discuss their performance is presented.
Introduction to the CoNLL-2000 Shared Task Chunking
We describe the CoNLL-2000 shared task: dividing text into syntactically related non-overlapping groups of words, so-called text chunking. We give background information on the data sets, present a
Representing Text Chunks
TLDR
It is shown that the the data representation choice has a minor influence on chunking performance, however, equipped with the most suitable data representation, the memory-based learning chunker was able to improve the best published chunking results for a standard data set.
Introduction to the CoNLL-2001 shared task: clause identification
TLDR
The CoNLL-2001 shared task: dividing text into clauses is described, with background information on the data sets and a general overview of the systems that have taken part in the shared task.
Memory-Based Shallow Parsing
We present memory-based learning approaches to shallow parsing and apply these to five tasks: base noun phrase identification, arbitrary base phrase recognition, clause detection, noun phrase parsing
Extracting Hypernym Pairs from the Web
TLDR
It is shown that the abundance of available data on the web enables obtaining good results with relatively unsophisticated techniques in hypernym extraction from morphological clues and from large text corpora.
Cornetto: A Combinatorial Lexical Semantic Database for Dutch
One of the goals of the STEVIN programme is the realisation of a digital infrastructure that will enforce the position of the Dutch language in the modern information and communication technology.A
Large Scale Syntactic Annotation of Written Dutch: Lassy
TLDR
This chapter presents the Lassy Small and Lassy Large treebanks, as well as related tools and applications, which have been developed and made available for syntactically annotated corpora.
...
1
2
3
4
5
...