Text clustering based on the improved TFIDF by the iterative algorithm

@article{Wang2012TextCB,
  title={Text clustering based on the improved TFIDF by the iterative algorithm},
  author={Xingheng Wang and Jun Cao and Yao Liu and Shi Gao and X. X. Deng},
  journal={2012 IEEE Symposium on Electrical & Electronics Engineering (EEESYM)},
  year={2012},
  pages={140-143}
}
Text clustering, an important part of the machine learning and pattern recognition, has extensive applications in the field of natural language processing. In this paper, a method is given to improve the classic TFIDF algorithm on its shortcomings. This paper classifies the text through Naive Bayesian classifier. And uses the iterative algorithm to optimize the selection of feature words, and then to optimize the classification ceaselessly. Experimental results show that the algorithm has… CONTINUE READING
6 Extracted Citations
4 Extracted References
Similar Papers

Referenced Papers

Publications referenced by this paper.
Showing 1-4 of 4 references

An empirical study of feature selection for text categorization based on term weightage[C]/

  • B CHOW, K. NARAYANAN
  • Proceedings of the 2004 IEEEIWIC/ACM…
  • 2004

On the construction effective vocabularies for information retrieval[C]/

  • G SALTON, T Y.CLEMENT
  • Proceedings of the 1973 Meeting on Programming…
  • 1973

Similar Papers

Loading similar papers…