Learn More
• Topic models take a corpus of documents as input, and jointly cluster: words by the documents that they occur in, and documents by the words that they contain • If the corpus is small and/or the documents are short, these clusters will be noisy • Latent feature representations of words learnt from large external corpora (e.g., word2vec, Glove) capture(More)
  • 1