Class-based n-gram models of natural languagePeter


We address the problem of predicting a word from previous words in a sample of text. In particular, we discuss n-gram models based on classes of words. We also discuss several statistical algorithms for assigning words to classes based on the frequency of their coo occurrence with other words. We nd that we are able to extract classes that have the avor of… (More)



