Automatic Thesaurus Generation from Raw Text using Knowledge-Poor Techniques

  title={Automatic Thesaurus Generation from Raw Text using Knowledge-Poor Techniques},
  author={Gregory Grefenstette},
In addition to showing how lexical units are related within a eld, domain-speciic thesauri give an idea of what subjects are important to that eld and are thus useful at many points in an information system. The major impediment to creation of thesauri has been the cost of their manual creation. We present here a number of automatic techniques that jointly produce a rst draft of a thesaurus from any domain-deening collection of text. The techniques are knowledge-poor in that no domain knowledge… CONTINUE READING
Highly Cited
This paper has 39 citations. REVIEW CITATIONS

From This Paper

Topics from this paper.
27 Citations
4 References
Similar Papers


Publications referenced by this paper.
Showing 1-4 of 4 references

Thesaurus software

  • Jessica Milstead
  • 1990
Highly Influential
5 Excerpts

Aspects of Text

  • Martin Phillips
  • 1985
1 Excerpt

Similar Papers

Loading similar papers…