Improving text categorization methods for event tracking

  title={Improving text categorization methods for event tracking},
  author={Yiming Yang and Tom Ault and Thomas Pierce and Charles W. Lattimer},
Automated tracking of events from chronologically ordered document streams is a new challenge for statistical text classification. Existing learning techniques must be adapted or improved in order to effectively handle difficult situations where the number of positive training instances per event is extremely small, the majority of training documents are unlabelled, and most of the events have a short duration in time. We adapted several supervised text categorization methods, specifically… CONTINUE READING
Highly Cited
This paper has 171 citations. REVIEW CITATIONS

From This Paper

Figures, tables, results, and topics from this paper.

Key Quantitative Results

  • All of these methods showed significant improvement (up to 71% reduction in weighted error rates) over the performance of the original kNN algorithm on TDT benchmark collections, making kNN among the top-performing systems in the recent TDT3 official evaluation.


Publications citing this paper.
Showing 1-10 of 97 extracted citations

171 Citations

Citations per Year
Semantic Scholar estimates that this publication has 171 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…