Eleanor Olds Batchelder

Learn More
Prelinguistic infants must find a way to isolate meaningful chunks from the continuous streams of speech that they hear. BootLex, a new model which uses distributional cues to build a lexicon, demonstrates how much can be accomplished using this single source of information. This conceptually simple probabilistic algorithm achieves significant segmentation(More)
We present an algorithm for segmenting unrestricted Japanese text that is able to detect up to 98% of the words in a corpus. The segmentation technique, which is simple and extremely fast, does not depend on a lexicon or any formal notion of what a word is in Japanese, and the training procedure does not require annotated text of any kind. Relying almost(More)
This paper offers a detailed analysis of the English suffix -ee (employee, escapee, refugee, etc.) based on 1500 naturally-occurring tokens of some 500 word types. The data suggest that formation of nouns in -ee is moderately but genuinely productive, and that analyses based on the syntactic argument structure of the stem verb are unsatisfactory. Instead,(More)
Prelinguistic infants must find a way to isolate meaningful chunks from the continuous streams of speech that they hear. This bootstrapping problem has recently been the focus of several attempts to model the cognitive problem computationally. How can we evaluate whether this kind of simulation is relevant to the cognitive situation, and how can we compare(More)
  • 1