Inductive learning algorithms and representations for text categorization

@inproceedings{Dumais1998InductiveLA,
  title={Inductive learning algorithms and representations for text categorization},
  author={S. Dumais and John C. Platt and David Hecherman and M. Sahami},
  booktitle={CIKM '98},
  year={1998}
}
  • S. Dumais, John C. Platt, +1 author M. Sahami
  • Published in CIKM '98 1998
  • Computer Science
  • 1. ABSTRACT Text categorization – the assignment of natural language texts to one or more predefined categories based on their content – is an important component in many information organization and management tasks. We compare the effectiveness of five different automatic learning algorithms for text categorization in terms of learning speed, realtime classification speed, and classification accuracy. We also examine training set size, and alternative document representations. Very accurate… CONTINUE READING
    An Extensive Empirical Study of Feature Selection Metrics for Text Classification
    • 2,534
    • Highly Influenced
    • PDF
    Machine learning in automated text categorization
    • 7,872
    • Highly Influenced
    • PDF
    Support Vector Machine Active Learning with Applications to Text Classification
    • 2,793
    • Highly Influenced
    • PDF
    Training linear SVMs in linear time
    • 1,955
    • PDF
    One-Class SVMs for Document Classification
    • 1,118
    • Highly Influenced
    • PDF
    SMOTE: Synthetic Minority Over-sampling Technique
    • 9,234
    • PDF
    Kernel Methods for Pattern Analysis
    • 3,686
    • PDF

    References

    Publications referenced by this paper.
    SHOWING 1-2 OF 2 REFERENCES
    Text Categorization with Support Vector Machines: Learning with Many Relevant Features
    • 7,701
    • Highly Influential
    • PDF
    A Bayesian Approach to Learning Bayesian Networks with Local Structure
    • 366
    • Highly Influential
    • PDF