Feature Selection for the Classification of Large Document Collections

  title={Feature Selection for the Classification of Large Document Collections},
  author={Janez Brank and Dunja Mladenic and Marko Grobelnik and Natasa Milic-Frayling},
  journal={J. UCS},
Feature selection methods are often applied in the context of document classification. They are particularly important for processing large data sets that may contain millions of documents and are typically represented by a large number, possibly tens of thousands of features. Processing large data sets thus raises the issue of computational resources and we often have to find the right trade-off between the size of the feature set and the number of training data that we can taken into account… CONTINUE READING
3 Citations
32 References
Similar Papers

Similar Papers

Loading similar papers…