A new weighting algorithm for linear classifier

@article{Chen2003ANW,
  title={A new weighting algorithm for linear classifier},
  author={Keli Chen and Chengqing Zong},
  journal={International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003},
  year={2003},
  pages={650-655}
}
In the domain of text categorization (TC), the TF (term frequency)* IDF (inverse document frequency) weighting algorithm and TF*IWF*IWF weighting algorithm are widely used. However, the two algorithms are too biased by the term frequency and neglect the imbalance between classes. In this paper, we propose a new weighting algorithm, which is named as TF (term frequency)*IWF (inverse word frequency)*IWF (inverse word frequency)*VE (variance and expectation). The new algorithm improves the TF*IWF… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-7 OF 7 CITATIONS

References

Publications referenced by this paper.
SHOWING 1-9 OF 9 REFERENCES

A re-examination of text categorization methods

Yang Yiniing, Xin Liu
  • I n Proceedings of the 22nd Annual Inteniational ACM SIGIR Conference on Research and Development i n Infomiation Retrieval(SIGIR-99),
  • 1999

An Evaluation of Statistical Approaches to Text Categorization

Yang Yiming
  • Infomiation Retried,
  • 1999

A probabilistic analysis of the Rocchio algorithm with TFIDF for test catcgorization

Thorsten Joachims
  • In Proceedings of the 14th lntemational Conference on Machine Learning (ICML-97)
  • 1996

Similar Papers

Loading similar papers…