Text Modeling for Real-Time Document Categorization

  title={Text Modeling for Real-Time Document Categorization},
  author={John Byrnes and Richard Rohwer},
  journal={2005 IEEE Aerospace Conference},
We report on experiments in adapting document categorization techniques to provide for implementation in high-speed hardware. Because resources are scarce, it is important to have a small set of robust and maximally informative variables over which learning can occur. We generate variables using information-theoretic clustering. The resulting performance is… CONTINUE READING