Share This Author
How Naked is the Naked Truth? A Multilingual Lexicon of Nominal Compound Compositionality
- Carlos Ramisch, S. Cordeiro, Leonardo Zilio, M. Idiart, Aline Villavicencio
- 1 August 2016
A new multilingual resource containing judgments about nominal compound compositionality in English, French and Portuguese is introduced that can serve as basis for evaluating tasks such as lexical substitution and compositionality prediction.
B2SG: a TOEFL-like Task for Portuguese
- Rodrigo Wilkens, Leonardo Zilio, Eduardo Ferreira, Aline Villavicencio
- Computer ScienceLREC
- 1 May 2016
The BabelNet-Based Semantic Gold Standard (B2SG) was automatically constructed based on BabelNet and partly evaluated by human judges and can be used as the basis for evaluating the accuracy of the similarity relations on distributional thesauri.
Automatic extraction and evaluation of MWE
A method for automatically extracting and evaluating MWE in the Europarl corpus is presented and an XML parser is developed to evaluate MWE candidates against those rules and also against online dictionaries.
Using NLP for Enhancing Second Language Acquisition
This study presents SMILLE, a system that draws on the Noticing Hypothesis and on input enhancements, addressing the lack of salience of grammatical infor mation in online documents chosen by a given…
PassPort: A Dependency Parsing Model for Portuguese
PassPort is introduced, a model for the dependency parsing of Portuguese trained with the Stanford Parser, which achieved very similar results for dependency parsing, with a LAS of 85.02 for PassPort against 84.36 for PALAVRAS.
Enhancing Grammatical Structures in Web-Based Texts.
The SMILLE system is presented, a system that uses Natural Language Processing for enhancing grammatical information in texts chosen by a given user and is designed to draw the users’ attention to specific grammatical structures and thus help them to notice their occurrence in authentic contexts.
Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)
This multilingual resource contains corpora in which verbal MWEs have been manually annotated, and contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.1.
An SLA Corpus Annotated with Pedagogically Relevant Grammatical Structures
This study automatically annotated a corpus of texts produced by language learners with pedagogically relevant grammatical structures and observed how these structures are being employed by learners from different proficiency levels.
Adaptive System for Language Learning
- Leonardo Zilio, Cedric Fairon
- Computer Science, LinguisticsIEEE 17th International Conference on Advanced…
- 1 July 2017
A system that combines NLP and hand-written rules for enhancing the text of authentic Web pages based on the needs of a specific language learner, and applies hand- written rules for retrieving language information that is relevant according to a given Common European Framework of Reference for Languages level is presented.
SW4ALL: a CEFR Classified and Aligned Corpus for Language Learning
The SW4ALL is presented, a corpus with documents classified by language proficiency level (based on the CEFR recommendations) that allows the learner to observe ways of describing the same topic or content by using strategies from different proficiency levels.