Neighbor-weighted K-nearest neighbor for unbalanced text corpus

  title={Neighbor-weighted K-nearest neighbor for unbalanced text corpus},
  author={Songbo Tan},
  journal={Expert Syst. Appl.},
Text categorization or classification is the automated assigning of text documents to pre-defined classes based on their contents. Many of classification algorithms usually assume that the training examples are evenly distributed among different classes. However, unbalanced data sets often appear in many practical applications. In order to deal with uneven text sets, we propose the neighbor-weighted K-nearest neighbor algorithm, i.e. NWKNN. The experimental results indicate that our algorithm… CONTINUE READING
Highly Influential
This paper has highly influenced 21 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 426 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 138 extracted citations

Research on Text Categorization of KNN Based on K-Means for Class Imbalanced Problem

2016 Sixth International Conference on Instrumentation & Measurement, Computer, Communication and Control (IMCCC) • 2016
View 10 Excerpts
Highly Influenced

Use relative weight to improve the kNN for unbalanced text category

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) • 2010
View 4 Excerpts
Highly Influenced

People-flow counting in complex environments by combining depth and color information

Multimedia Tools and Applications • 2016
View 4 Excerpts
Highly Influenced

Lexical-semantic SLVM for XML Document Classification

JSW • 2014
View 3 Excerpts
Highly Influenced

Optimizing the k-NN metric weights using differential evolution

2010 International Conference on Multimedia Computing and Information Technology (MCIT) • 2010
View 5 Excerpts
Highly Influenced

Accurate Chinese Text Classification via Multiple Strategies

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) • 2007
View 8 Excerpts
Highly Influenced

427 Citations

Citations per Year
Semantic Scholar estimates that this publication has 427 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…