• Publications
  • Influence
Unsupervised Type and Token Identification of Idiomatic Expressions
TLDR
We develop statistical measures that each model a specific property of idiomatic expressions by looking at their actual usage patterns in text. Expand
  • 162
  • 30
  • PDF
A Probabilistic Computational Model of Cross-Situational Word Learning
TLDR
We present a novel computational model of early word learning to shed light on the mechanisms that might be at work in this process. Expand
  • 151
  • 19
  • PDF
Pulling their Weight: Exploiting Syntactic Forms for the Automatic Identification of Idiomatic Expressions in Context
TLDR
We explore the use of informative prior knowledge about the overall syntactic behaviour of a potentially-idiomatic expression (type- based knowledge) to determine whether an instance of the expression is used idiomatically or literally (token-based knowledge). Expand
  • 92
  • 16
  • PDF
The Elongator Complex Interacts with PCNA and Modulates Transcriptional Silencing and Sensitivity to DNA Damage Agents
Histone chaperones CAF-1 and Asf1 function to deposit newly synthesized histones onto replicating DNA to promote nucleosome formation in a proliferating cell nuclear antigen (PCNA) dependent process.Expand
  • 99
  • 13
  • PDF
Automatically Constructing a Lexicon of Verb Phrase Idiomatic Combinations
TLDR
We investigate the lexical and syntactic flexibility of a class of idiomatic expressions, i.e., those that involve the combination of a verb plus a noun in its direct object position. Expand
  • 99
  • 11
  • PDF
Statistical Measures of the Semi-Productivity of Light Verb Constructions
We propose a statistical measure for the degree of acceptability of light verb constructions, such as take a walk, based on their linguistic properties. Our measure shows good correlations with humanExpand
  • 76
  • 9
  • PDF
The Use of Syntax in Word Completion Utilities
  • 28
  • 9
Distinguishing Subtypes of Multiword Expressions Using Linguistically-Motivated Statistical Measures
We identify several classes of multiword expressions that each require a different encoding in a (computational) lexicon, as well as a different treatment within a computational system. We examineExpand
  • 79
  • 8
  • PDF
Testing the Efficacy of Part-of-Speech Information in Word Completion
TLDR
We introduce two new algorithms that combine part-of-speech tag trigrams with word bigrams, and evaluate them with a test-bench constructed for the purpose. Expand
  • 52
  • 7
  • PDF
The VNC-Tokens Dataset
Idiomatic expressions formed from a verb and a noun in its dir ect object position are a productive cross-lingual class of multiword expressions, which can be used both idiomatically and as a li terExpand
  • 60
  • 6
  • PDF