The latent words language model

@article{Deschacht2012TheLW,
  title={The latent words language model},
  author={Koen Deschacht and Jan De Belder and Marie-Francine Moens},
  journal={Computer Speech & Language},
  year={2012},
  volume={26},
  pages={384-409}
}
Statistical language models have found many applications in information retrieval since their introduction almost three decades ago. Currently the most popular models are n-gram models, which are known to suffer from serious sparseness issues, which is a result of the large vocabulary size |V | of any given corpus and of the exponential nature of n-grams, where potentially |V | n-grams can occur in a corpus. Even when many n-grams in fact never occur due to grammatical and semantic restrictions… CONTINUE READING
Highly Cited
This paper has 42 citations. REVIEW CITATIONS
Recent Discussions
This paper has been referenced on Twitter 1 time over the past 90 days. VIEW TWEETS

Citations

Publications citing this paper.
Showing 1-10 of 31 extracted citations

References

Publications referenced by this paper.
Showing 1-2 of 2 references

Similar Papers

Loading similar papers…