Extracting Collocations from Text Corpora

  title={Extracting Collocations from Text Corpora},
  author={Dekang Lin},
A collocation is a habitual word combination. Collocational knowledge is essential for many tasks in natural language processing. We present a method for extracting collocations from text corpora. By comparison with the SUSANNE corpus, we show that both high precision and broad coverage can be achieved with our method. Finally, we describe an application of the automatically extracted collocations for computing word similarities. 
Highly Influential
This paper has highly influenced 19 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 204 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 128 extracted citations

Creative discovery in the lexical "validation gap"

Computer Speech & Language • 2005
View 5 Excerpts
Highly Influenced

205 Citations

Citations per Year
Semantic Scholar estimates that this publication has 205 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 14 references

Training and Scaling Preference Functions for Disambiguation

Computational Linguistics • 1994
View 5 Excerpts
Highly Influenced

Retrieving Collocations from Text: Xtract

Computational Linguistics • 1993
View 4 Excerpts
Highly Influenced

WordNet: An on-line lexical database

George A. Miller.
International Journal of Lexicography, 3(4):235–312. • 1990
View 3 Excerpts
Highly Influenced

Determining Similarity and Inferring Relations in a Lexical Knowledge Base

Stephen D. Richardson.
Ph.D. thesis, The City University of New York. • 1997
View 1 Excerpt

A method for refining automaticallydiscovered lexical relations

Marti A. Hearst, Gregory Grefenstette.
Carl Weir, editor, Statistically-Based Natural Language Programming Techniques, number W-92-01 in Technical • 1992
View 1 Excerpt

Similar Papers

Loading similar papers…