A scalability analysis of classifiers in text categorization

  title={A scalability analysis of classifiers in text categorization},
  author={Yiming Yang and Jian Zhang and Bryan Kisiel},
Real-world applications of text categorization often require a system to deal with tens of thousands of categories defined over a large taxonomy. This paper addresses the problem with respect to a set of popular algorithms in text categorization, including Support Vector Machines, k-nearest neighbor, ridge regression, linear least square fit and logistic regression. By providing a formal analysis of the computational complexity of each classification method, followed by an investigation on the… CONTINUE READING
Highly Cited
This paper has 204 citations. REVIEW CITATIONS
117 Citations
4 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 117 extracted citations

204 Citations

Citations per Year
Semantic Scholar estimates that this publication has 204 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-4 of 4 references

The reuters corpus volume i as a text categorization test collection

  • D. Lewis, F. Li, T. Rose, Y. Yang
  • In Journal of Machine Learning Research, page (to…
  • 2003
Highly Influential
18 Excerpts

An evaluation of statistical approaches to text categorization

  • Y. Yang
  • Journal of Information Retrieval,
  • 1999
Highly Influential
20 Excerpts

Similar Papers

Loading similar papers…