Share This Author
Universal Dependencies v1: A Multilingual Treebank Collection
This paper describes v1 of the universal guidelines, the underlying design principles, and the currently available treebanks for 33 languages, as well as highlighting the needs for sound comparative evaluation and cross-lingual learning experiments.
Universal Dependencies 2.1
The annotation scheme is based on (universal) Stanford dependencies, Google universal part-of-speech tags, and the Interset interlingua for morphosyntactic tagsets for morpho-lingual tagsets.
Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages
- Djamé Seddah, Reut Tsarfaty, Eric Villemonte de la Clergerie
- Computer ScienceSPMRL@EMNLP
- 18 October 2013
This paper presents and analyzes parsing results obtained by the task participants, and provides an analysis and comparison of the parsers across languages and frameworks, reported for gold input as well as more realistic parsing scenarios.
Statistical Parsing of Morphologically Rich Languages (SPMRL) What, How and Whither
This paper synthesizes the contributions of researchers working on parsing Arabic, Basque, French, German, Hebrew, Hindi and Korean to point out shared solutions across languages and suggests itself as a source of directions for future investigations.
Introducing the SPMRL 2014 Shared Task on Parsing Morphologically-rich Languages
This paper provides a short overview of the 2014 SPMRL shared task goals, data sets, and evaluation setup and describes the description of participating systems and the analysis of their results as part of (Seddah et al., 2014).
A Unified Morpho-Syntactic Scheme of Stanford Dependencies
- Reut Tsarfaty
- Computer ScienceACL
- 1 August 2013
A novel extension of SD is presented, called Unified-SD (U-SD), which unifies the annotation of structurally and morphologically-marked relations via an inheritance hierarchy and creates a new resource composed of U-SDannotated constituency and dependency treebanks for the MRL Modern Hebrew.
Universal Dependencies 2.0 – CoNLL 2017 Shared Task Development and Test Data
This release contains the test data used in the CoNLL 2017 shared task on parsing Universal Dependencies, and complements the UD 2.0 release with 18 new parallel test sets and 4 test sets in surprise languages.
Parsing Morphologically Rich Languages: Introduction to the Special Issue
This special issue reports on methods that successfully address the challenges involved in parsing a range of morphologically rich languages (MRLs), and describes the challenges in parsing MRLs and outlines the contributions of the articles in the special issue.
Cross-Framework Evaluation for Statistical Parsing
A principled protocol for evaluating parsing results across frameworks based on function trees, tree generalization and edit distance metrics is presented, which extends a previously proposed framework for cross-theory evaluation and allows us to compare a wider class of parsers.
Evaluating NLP Models via Contrast Sets
A new annotation paradigm for NLP is proposed that helps to close systematic gaps in the test data, and it is recommended that after a dataset is constructed, the dataset authors manually perturb the test instances in small but meaningful ways that change the gold label, creating contrast sets.