• Publications
  • Influence
On Improving the Accuracy of Readability Classification using Insights from Second Language Acquisition
TLDR
We investigate the problem of readability assessment using a range of lexical and syntactic measures originally designed to measure language development in the production of second language learners and study their impact on predicting the grade level of texts. Expand
  • 140
  • 18
  • PDF
Readability Classification for German using Lexical, Syntactic, and Morphological Features
TLDR
We investigate the problem of reading level assessment for German texts on a newly compiled corpus of freely available easy and difficult articles, targeted at adult and child readers respectively. Expand
  • 84
  • 9
  • PDF
Universal Dependencies 2.1
TLDR
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-language learning, and parsing research from a language typology perspective. Expand
  • 53
  • 9
Automatic CEFR Level Prediction for Estonian Learner Text
TLDR
This paper reports on approaches for automatically predicting a learner’s language proficiency in Estonian according to the European CEFR scale. Expand
  • 28
  • 6
  • PDF
Experiments with Universal CEFR Classification
TLDR
In this paper, we explore universal CEFR classification using domain-specific and domain-agnostic, theory-guided as well as data-driven features. Expand
  • 14
  • 5
  • PDF
Automated Assessment of Non-Native Learner Essays: Investigating the Role of Linguistic Features
  • Sowmya Vajjala
  • Computer Science
  • International Journal of Artificial Intelligence…
  • 2 December 2016
TLDR
We explore the role of various linguistic features in automatic essay scoring using two publicly available datasets of non-native English essays written in test taking scenarios. Expand
  • 26
  • 3
  • PDF
Combining Shallow and Linguistically Motivated Features in Native Language Identification
TLDR
We explore a range of features and ensembles for the task of Native Language Identification as part of the NLI Shared Task (Tetreault et al., 2013). Expand
  • 18
  • 3
  • PDF
Readability assessment for text simplification: From analysing documents to identifying sentential simplifications
TLDR
Readability assessment can play a role in the evaluation of a simplification algorithm as well as in the identification of what to simplify. Expand
  • 45
  • 2
  • PDF
A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity
TLDR
We propose a supervised machine learning model, based on a range of linguistic features, that can reliably classify texts according to their difficulty level. Expand
  • 29
  • 2
  • PDF