Corpus ID: 18045340

2 Materials and Methods 2 . 1 Related Work

@inproceedings{Erkan20112MA,
  title={2 Materials and Methods 2 . 1 Related Work},
  author={G{\"u}nes Erkan and A. Hassan and Q. Diao and Dragomir R. Radev},
  year={2011}
}
We present new nearest neighbor methods for text classification and an evaluation of these methods against the existing nearest neighbor methods as well as other well-known text classification algorithms. Inspired by the language modeling approach to information retrieval, we show improvements in k-nearest neighbor (kNN) classification by replacing the classical cosine similarity with a KL divergence based similarity measure. We also present an extension of kNN to the semi-supervised case which… Expand

Figures and Tables from this paper

References

SHOWING 1-10 OF 31 REFERENCES
Integrating Background Knowledge into Nearest-Neighbor Text Classification
An Evaluation of Statistical Approaches to Text Categorization
A re-examination of text categorization methods
RCV1: A New Benchmark Collection for Text Categorization Research
Distributional clustering of words for text classification
Improving multi-class text classification with Naive Bayes
Text Classification from Labeled and Unlabeled Documents using EM
Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions
Transductive Inference for Text Classification using Support Vector Machines
...
1
2
3
4
...