Automatic Text Categorization in Terms of Genre and Author

@article{Stamatatos2000AutomaticTC,
  title={Automatic Text Categorization in Terms of Genre and Author},
  author={E. Stamatatos and N. Fakotakis and G. Kokkinakis},
  journal={Computational Linguistics},
  year={2000},
  volume={26},
  pages={471-495}
}
  • E. Stamatatos, N. Fakotakis, G. Kokkinakis
  • Published 2000
  • Computer Science
  • Computational Linguistics
  • The two main factors that characterize a text are its content and its style, and both can be used as a means of categorization. In this paper we present an approach to text categorization in terms of genre and author for Modern Greek. In contrast to previous stylometric approaches, we attempt to take full advantage of existing natural language processing (NLP) tools. To this end, we propose a set of style markers including analysis-level measures that represent the way in which the input text… CONTINUE READING
    438 Citations
    On building an automatic text classification model with minimal computational costs
    • PDF
    Shallow Text Analysis and Machine Learning for Authorship Attribtion
    • 42
    • PDF
    Universality of Stylistic Traits in Texts
    • 3
    Stylometric analysis of classical Arabic texts for genre detection
    • 7
    Style Markers Based on Stop-word List
    • PDF
    Genre Classification Problem: in Pursuit of Systematics on a Big Webcorpus
    • PDF

    References

    SHOWING 1-10 OF 59 REFERENCES
    An empirical text categorizing computational model based on stylistic aspects
    • 14
    Robust Text Processing in Automated Information Retrieval
    • 48
    • PDF
    How Variable May a Constant be? Measures of Lexical Richness in Perspective
    • 367
    A Practical Chunker for Unrestricted Text
    • 34
    A re-examination of text categorization methods
    • 2,856
    • PDF
    Using Register-Diversified Corpora for General Language Studies
    • D. Biber
    • Computer Science
    • Comput. Linguistics
    • 1993
    • 224
    • PDF
    Automatic Detection of Text Genre
    • 417
    • PDF
    The Authorship of Greek Prose
    • 73