• Publications
  • Influence
On Improving the Accuracy of Readability Classification using Insights from Second Language Acquisition
TLDR
It is shown that the developmental measures from Second Language Acquisition research when combined with traditional readability features such as word length and sentence length provide a good indication of text readability across different grades. Expand
Readability Classification for German using Lexical, Syntactic, and Morphological Features
TLDR
It is shown that readability classification for German based on syntactic, lexical and language model features from previous research on English is highly successful, reaching 89.7% accuracy, with the new morphological features making an important contribution. Expand
The MERLIN corpus: Learner language and the CEFR
TLDR
The MERLIN corpus is a written learner corpus for Czech, German, and Italian that has been designed to illustrate the Common European Framework of Reference for Languages (CEFR) with authentic learner data, supporting a broadening of the scope of research in areas such as automatic proficiency classification or native language identification. Expand
Evaluating Answers to Reading Comprehension Questions in Context: Results for German and the Role of Information Structure
TLDR
The first results for content assessment of reading comprehension activities for German are discussed and it is argued that the surface-based account of information given in the question should be replaced with a more sophisticated, linguistically informed analysis of the information structuring of the answer in the context of the question. Expand
Towards interlanguage POS annotation for effective learner corpora in SLA and FLT
Learner corpora can serve as a teaching resource for Foreign Language Teaching (FLT) and contribute empirical insights for Second Language Acquisition (SLA) research. To support effective queryingExpand
Detecting Errors in Part-of-Speech Annotation
TLDR
A new method is proposed for detecting errors in "gold-standard" part-of-speech annotation based on n-grams occurring in the corpus with multiple taggings based on closed-class analysis and finite-state tagging guide patterns. Expand
On using intelligent computer-assisted language learning in real-life foreign language teaching and learning
TLDR
The relationship between activity design and restrictions needed to make natural language processing tractable and reliable and pedagogical considerations and the influence of activity design choices on the integration of ICALL systems into FLTL practice are explored. Expand
Enhancing Authentic Web Pages for Language Learners
TLDR
An NLP architecture and web-based implementation providing automatic visual input enhancement for web pages enabling learners to freely choose the web pages they want to read and the system displays an enhanced version of the pages. Expand
Creation and Analysis of a Reading Comprehension Exercise Corpus : Towards Evaluating Meaning in Context
TLDR
Results showing that for such task-based corpora, meaning assessment can be performed with reasonable agreement are presented, and several sources of disagreement are discussed. Expand
Head-driven phrase structure grammar: linguistic approach, formal foundations, and computational realization
TLDR
A kind of aerial view of the linguistic approach is sought, the formal foundations of this paradigm are summarized, computational work developed based on this paradigm is characterized, and a description of its applications is characterized. Expand
...
1
2
3
4
5
...