Word-Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora

  title={Word-Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora},
  author={David Yarowsky},
This paper describes a program that disambignates English word senses in unrestricted text using statistical models of the major Roget's Thesaurus categories. Roget's categories serve as approximations of conceptual classes. The categories listed for a word in Roger's index tend to correspond to sense distinctions; thus selecting the most likely category provides a useful level of sense disambiguatiou. The selection of categories is accomplished by identifying and weighting words that are… CONTINUE READING
Highly Influential
This paper has highly influenced 49 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 813 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 508 extracted citations

814 Citations

Citations per Year
Semantic Scholar estimates that this publication has 814 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 14 references

Ditcriminatlon Decisions for 100,000-Dimensional Spaces

Gale, William, Kenneth hutch, David Yarowsky
Computatiosal Linguistics • 1992
View 1 Excerpt

*Noun Hctnograph Disambiguation

Collins English Dictionary, Collins, +3 authors Matti

'An Experiment in Computational Discrimination of English Word Sea~s." IBM Journal of Research and Dev¢lopmenl, v 32

Black, Ez

Collin~ Cobuild English Language Dictionary, Collins, London and Glasgow

J. Sinclair, P. Ilanks, +3 authors P. et el

Ka]owledge Resource Tools for Aeo~'ssing Large Text Files," in Machine Translation: Theoretical and Methodological Issues, Serges Nirenberg, ed

Walker, Donald

'Parring and Contprehending with Word Experts (A Theory and its Realization),'* in StrategiesfgrNalural

S. Small, C. Rieger
Language Processing, • 1982
View 1 Excerpt

Similar Papers

Loading similar papers…