Multiword Expressions in the wild? The mwetoolkit comes in handy

  title={Multiword Expressions in the wild? The mwetoolkit comes in handy},
  author={Carlos Ramisch and Aline Villavicencio and Christian Boitet},
The mwetoolkit is a tool for automatic extraction of Multiword Expressions (MWEs) from monolingual corpora. It both generates and validates MWE candidates. The generation is based on surface forms, while for the validation, a series of criteria for removing noise are provided, such as some (language independent) association measures.1 In this paper, we present the use of the mwetoolkit in a standard configuration, for extracting MWEs from a corpus of general-purpose English. The functionalities… CONTINUE READING
Highly Cited
This paper has 57 citations. REVIEW CITATIONS
38 Citations
9 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 38 extracted citations

57 Citations

Citations per Year
Semantic Scholar estimates that this publication has 57 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-9 of 9 references

A hybrid approach for multiword expression identification

  • Ramisch, Carlos, Helena de Medeiros Caseli, Aline Villavicencio, André Machado, Maria José Finatto.
  • Proc. of the 9th PROPOR (PROPOR 2010), volume…
  • 2010
5 Excerpts

Collocation extraction based on syntactic parsing

  • Seretan, Violeta.
  • Ph.D. thesis, University of Geneva, Geneva…
  • 2008

Twistin’ the night away

  • Jackendoff, Ray.
  • Language, 73:534–559.
  • 1997

Similar Papers

Loading similar papers…