Text Modeling for Real-Time Document Categorization

@article{Byrnes2005TextMF,
  title={Text Modeling for Real-Time Document Categorization},
  author={John Byrnes and Richard Rohwer},
  journal={2005 IEEE Aerospace Conference},
  year={2005},
  pages={1-11}
}
We report on experiments in adapting document categorization techniques to provide for implementation in high-speed hardware. Because resources are scarce, it is important to have a small set of robust and maximally informative variables over which learning can occur. We generate variables using information-theoretic clustering. The resulting performance is… CONTINUE READING