Poisson mixtures

  title={Poisson mixtures},
  author={Kenneth Ward Church and William A. Gale},
  journal={Natural Language Engineering},
Shannon (1948) showed that a wide range of practical problems can be reduced to the problem of estimating probability distributions of words and ngrams in text. It has become standard practice in text compression, speech recognition, information retrieval and many other applications of Shannon’s theory to introduce a ‘‘bag-of-words’’ assumption. But obviously, word rates vary from genre to genre, author to author, topic to topic, document to document, section to section, and paragraph to… CONTINUE READING
Highly Influential
This paper has highly influenced 17 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 168 citations. REVIEW CITATIONS