Real Time Web Text Classification and Analysis of Reading Difficulty

  title={Real Time Web Text Classification and Analysis of Reading Difficulty},
  author={Eleni Miltsakaki and Audrey Troutt},
The automatic analysis and categorization of web text has witnessed a booming interest due to the increased text availability of different formats, content, genre and authorship. We present a new tool that searches the web and performs in real-time a) html-free text extraction, b) classification for thematic content and c) evaluation of expected reading difficulty. This tool will be useful to adolescent and adult low-level reading students who face, among other challenges, a troubling lack of… CONTINUE READING
Highly Cited
This paper has 38 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 23 extracted citations

Fast Text Classification Using Randomized Explicit Semantic Analysis

2015 IEEE International Conference on Information Reuse and Integration • 2015


Publications referenced by this paper.
Showing 1-10 of 14 references

The relationship of the component skills of reading to ials performance: Tipping points and five classes of adult literacy learners

John Strucker, Yamamoto Kentaro, Irwin Kirsch.
NCSALL Reports • 2007
View 2 Excerpts

Digest of education statistics 2005 (nces 2006-030)

T. D. Snyder, A. G. Tan, C. M. Hoffman.
U.S. Department of Education, National Center for Education Statistics. Washington, DC: U.S. Government Printing Office. • 2006
View 2 Excerpts

Similar Papers

Loading similar papers…