Corpus Based Unsupervised Labeling of Documents

@inproceedings{Rao2006CorpusBU,
  title={Corpus Based Unsupervised Labeling of Documents},
  author={Delip Rao and P Deepak and Deepak Khemani},
  booktitle={FLAIRS Conference},
  year={2006}
}
Text categorization involves mapping of documents to a fixed set of labels. A similar but equally important problem is that of assigning labels to large corpora. With a deluge of documents from sources like the World Wide Web, manual labeling by domain experts is prohibitively expensive. The problem of reducing effort in labeling of documents has warranted… CONTINUE READING