Boosting and Rocchio Applied to Text Filtering

@inproceedings{Schapire1998BoostingAR,
  title={Boosting and Rocchio Applied to Text Filtering},
  author={Robert E. Schapire and Yoram Singer and Amit Singhal},
  booktitle={SIGIR},
  year={1998}
}
We discuss two learning algorithms for text filtering: modified Rocchio and a boosting algorithm called AdaBoost. We show how both algorithms can be adapted to maximize any general utility matrix that associates cost (or gain) for each pair of machine prediction and correct label. We first show that AdaBoost significantly outperforms another highly effective text filtering algorithm. We then compare AdaBoost and Rocchio over three large text filtering tasks. Overall both algorithms are… CONTINUE READING

Figures, Tables, and Topics from this paper.

Explore Further: Topics Discussed in This Paper

Citations

Publications citing this paper.
SHOWING 1-10 OF 242 CITATIONS, ESTIMATED 27% COVERAGE

FILTER CITATIONS BY YEAR

1997
2019

CITATION STATISTICS

  • 26 Highly Influenced Citations

  • Averaged 3 Citations per year over the last 3 years

References

Publications referenced by this paper.
SHOWING 1-4 OF 4 REFERENCES

An evaluation of phrasal and clustered representations on a text categorization

  • David Lewis
  • Proceedings of the Fifteenth Annual International…
  • 1992
Highly Influential
15 Excerpts

Overview of the third Text REtrieval Conference (TREC-3)

  • D. K. Harman
  • Proceedings of the Third Text REtrieval…
  • 1995
Highly Influential
3 Excerpts

Similar Papers

Loading similar papers…