Corpus ID: 15122710

Hybrid Approach Combining Machine Learning and a Rule-Based Expert System for Text Categorization

  title={Hybrid Approach Combining Machine Learning and a Rule-Based Expert System for Text Categorization},
  author={Julio Villena-Rom{\'a}n and Sonia Collada-P{\'e}rez and Sara Lana-Serrano and J. Gonz{\'a}lez},
  booktitle={FLAIRS Conference},
  • Julio Villena-Román, Sonia Collada-Pérez, +1 author J. González
  • Published in FLAIRS Conference 2011
  • Computer Science
  • This paper discusses a novel hybrid approach for text categorization that combines a machine learning algorithm, which provides a base model trained with a labeled corpus, with a rule-based expert system, which is used to improve the results provided by the previous classifier, by filtering false positives and dealing with false negatives. [...] Key Method We also describe an implementation based on k-Nearest Neighbor and a simple rule language to express lists of positive, negative and relevant (multiword…Expand Abstract
    34 Citations

    Figures, Tables, and Topics from this paper

    HAUSS: Incrementally building a summarizer combining multiple techniques
    • 13
    Relation Extraction of Medical Concepts Using Categorization and Sentiment Analysis
    • 8
    Supervised text classification of medical triage reports
    • 2


    Using a generalized instance set for automatic text categorization
    • 230
    Context-sensitive learning methods for text categorization
    • 570
    • PDF
    Machine learning in automated text categorization
    • 8,062
    • Highly Influential
    • PDF
    Text Categorization Using Hybrid Multiple Model Schemes
    • 4
    Inductive learning algorithms and representations for text categorization
    • 1,188
    • PDF
    A re-examination of text categorization methods
    • 2,851
    • PDF
    Text Categorization with Support Vector Machines: Learning with Many Relevant Features
    • 8,187
    • Highly Influential
    • PDF
    Text classification using ESC-based stochastic decision lists
    • 43
    • Highly Influential
    TCS: a shell for content-based text categorization
    • 136
    • Highly Influential