Insights from Russian second language readability classification: complexity-dependent training requirements, and feature evaluation of multiple categories

@inproceedings{Reynolds2016InsightsFR,
  title={Insights from Russian second language readability classification: complexity-dependent training requirements, and feature evaluation of multiple categories},
  author={Robert Joshua Reynolds},
  booktitle={BEA@NAACL-HLT},
  year={2016}
}
  • Robert Joshua Reynolds
  • Published in BEA@NAACL-HLT 2016
  • Computer Science
  • I investigate Russian second language readability assessment using a machine-learning approach with a range of lexical, morphological, syntactic, and discourse features. Testing the model with a new collection of Russian L2 readability corpora achieves an F-score of 0.671 and adjacent accuracy 0.919 on a 6-level classification task. Information gain and feature subset evaluation shows that morphological features are collectively the most informative. Learning curves for binary classifiers… CONTINUE READING
    10 Citations

    Figures, Tables, and Topics from this paper.

    AutomAted text ReAdAbility Assessment foR RussiAn second lAnguAge leARneRs
    • 2018
    • 1
    • PDF
    Automatic Analysis of Linguistic Complexity and Its Application in Language Learning Research
    An Empirical Analysis of Linguistic, Typographic, and Structural Features in Simplified German Texts
    • 1
    • Highly Influenced
    • PDF
    Advances in Computational Intelligence

    References

    SHOWING 1-10 OF 69 REFERENCES
    A Comparison of Features for Automatic Readability Assessment
    • 186
    • PDF
    Readability Classification for German using Lexical, Syntactic, and Morphological Features
    • 80
    • PDF
    Single-Sentence Readability Prediction in Russian
    • 18
    • PDF
    A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity
    • 27
    • PDF
    Automatic readability assessment
    • 34
    A machine learning approach to reading level assessment
    • 174
    • PDF
    A New Dataset and Method for Automatically Grading ESOL Texts
    • 330