Learn More
In Data-Oriented Parsing (DOP), an annotated language corpus is used as a stochastic grammar. The most probable analysis of a new input sentence is constructed by combining sub-analyses from the corpus in the most probable way. This approach has been succesfully used for syntactic analysis, using corpora with syntactic annotations such as the Penn Treebank.(More)
Many practical information extraction systems use simple taxonomies for mapping extracted strings to client-specific concept codes. In such taxonomies, concepts are defined as groups of semantically similar words and phrases. For the mapping to be accurate, a new client-specific taxonomy – usually nothing more than a set of concept codes, each with a single(More)
This paper describes a method for automatically learning effective dialogue strategies, generated from a library of dialogue content, using reinforcement learning from user feedback. This library includes greetings, social dialogue, chitchat , jokes and relationship building, as well as the more usual clarification and verification components of dialogue.(More)
The NWO Priority Programme Language and Speech Technology is a 5-year research programme aiming at the development of spoken language information systems. In the Programme , two alternative natural language processing (NLP) modules are developed in parallel: a grammar-based (conventional, rule-based) module and a data-oriented (memory-based, stochastic,(More)
  • 1