Tackling the Poor Assumptions of Naive Bayes Text Classifiers

@inproceedings{Rennie2003TacklingTP,
  title={Tackling the Poor Assumptions of Naive Bayes Text Classifiers},
  author={Jason D. M. Rennie and Lawrence Shih and Jaime Teevan and David R. Karger},
  booktitle={ICML},
  year={2003}
}
Naive Bayes is often used as a baseline in text classification because it is fast and easy to implement. Its severe assumptions make such efficiency possible but also adversely affect the quality of its results. In this paper we propose simple, heuristic solutions to some of the problems with Naive Bayes classifiers, addressing both systemic issues as well as problems that arise because text is not actually generated according to a multinomial model. We find that our simple corrections result… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 607 CITATIONS

A New Instance-weighting Naive Bayes Text Classifiers

  • 2018 IEEE International Conference of Intelligent Robotic and Control Engineering (IRCE)
  • 2018
VIEW 9 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Towards Release Strategy Optimization for Apps in Google Play

VIEW 6 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

Hadith data mining and classification: a comparative analysis

  • Artificial Intelligence Review
  • 2016
VIEW 5 EXCERPTS
HIGHLY INFLUENCED

The data mining applications of shoulder pain patients treatment: physical therapy equipment usage approaches

  • 2015 2nd International Symposium on Future Information and Communication Technologies for Ubiquitous HealthCare (Ubi-HealthTech)
  • 2015
VIEW 9 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

Detecting Life Events in Feeds from Twitter

  • 2013 IEEE Seventh International Conference on Semantic Computing
  • 2013
VIEW 6 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

FILTER CITATIONS BY YEAR

2001
2019

CITATION STATISTICS

  • 94 Highly Influenced Citations

  • Averaged 50 Citations per year from 2017 through 2019

References

Publications referenced by this paper.
SHOWING 1-10 OF 16 REFERENCES