Data Mining Meets Collocations Discovery

  title={Data Mining Meets Collocations Discovery},
  author={Helena Ahonen-Myka and Antoine Doucet},
In this paper we discuss the problem of discovering interesting word sequences in the light of two traditions: sequential pattern mining (from data mining) and collocations discovery (from computational linguistics). Smadja (1993) defines a collocation as “a recurrent combination of words that cooccur more often than chance and that correspond to arbitrary word usages.” The notion of arbitrariness underlines the fact that if one word of a collocation is substituted by a synonym, the resulting… CONTINUE READING
Highly Cited
This paper has 20 citations. REVIEW CITATIONS
8 Citations
13 References
Similar Papers

Similar Papers

Loading similar papers…