Word-Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora

@inproceedings{Yarowsky1992WordSenseDU,
  title={Word-Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora},
  author={David Yarowsky},
  booktitle={COLING},
  year={1992}
}
  • David Yarowsky
  • Published in COLING 1992
  • Computer Science
  • This paper describes a program that disambiguates English word senses in unrestricted text using statistical models of the major Roget's Thesaurus categories. [...] Key Method The selection of categories is accomplished by identifying and weighting words that are indicative of each category when seen in context, using a Bayesian theoretical framework.Other statistical approaches have required special corpora or hand-labeled training examples for much of the lexicon. Our use of class models overcomes this…Expand Abstract
    832 Citations
    Word Sense Disambiguation of Adjectives Using Probabilistic Networks
    • 12
    • PDF
    Word Sense Disambiguation based on Semantic Density
    • 42
    • PDF
    Distinguishing Word Senses in Untagged Text
    • 176
    • PDF
    Corpus-Based Statistical Sense Resolution
    • 186
    • PDF
    Principled Disambiguation: Discriminating Adjective Senses with Modified Nouns
    • 82
    • PDF

    References

    SHOWING 1-10 OF 79 REFERENCES
    A method for disambiguating word senses in a large corpus
    • 638
    • PDF
    Word-Sense Disambiguation Using Statistical Methods
    • 436
    • PDF
    Subject-Dependent Co-Occurence and Word Sense Disambiguation
    • 117
    • PDF
    Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone
    • M. Lesk
    • Engineering, Computer Science
    • SIGDOC '86
    • 1986
    • 1,998
    • PDF
    Learning to disambiguate
    • S. Weiss
    • Computer Science
    • Inf. Storage Retr.
    • 1973
    • 64
    An Experiment in Computational Discrimination of English Word Senses
    • E. Black
    • Computer Science
    • IBM J. Res. Dev.
    • 1988
    • 87
    • Highly Influential
    Two Languages Are More Informative Than One
    • 215
    • PDF
    Disambiguation by short contexts
    • 127