Using Web Searches on Important Words to Create Background Sets for LSI Classification.

@inproceedings{Zelikovitz2006UsingWS,
  title={Using Web Searches on Important Words to Create Background Sets for LSI Classification.},
  author={Sarah Zelikovitz and Marina Kogan},
  booktitle={FLAIRS Conference},
  year={2006}
}
The world wide web has a wealth of information that is related to almost any text classification task. This paper presents a method for mining the web to improve text classification, by creating a background text set. Our algorithm uses the information gain criterion to create lists of important words for each class of a text categorization problem. It then searches the web on various combinations of these words to produce a set of related data. We use this set of background text with Latent… CONTINUE READING

Citations

Publications citing this paper.
Showing 1-10 of 14 extracted citations

Similar Papers

Loading similar papers…