David Milward

Learn More
This paper motivates the use of Information Extraction (IE) for gathering data on protein interactions, describes the customization of an existing IE system, SRI's Highlight, for this task and presents the results of an experiment on unseen Medline abstracts which show that customization to a new domain can be fast, reliable and cost-effective.
An account of grammatical acquisition is developed within the parameter-setting framework applied to a generalized categorial grammar (GCG). The GCG is embedded in a default inheritance network yielding a natural partial ordering (reeecting generality) of parameters which determines a partial order for parameter setting. Computational simulation shows that(More)
The CALBC initiative aims to provide a large-scale biomedical text corpus that contains semantic annotations for named entities of different kinds. The generation of this corpus requires that the annotations from different automatic annotation systems be harmonized. In the first phase, the annotation systems from five participants (EMBL-EBI, EMC Rotterdam,(More)
Requests for reprints should be addressed to Patrick Sturt, Centre for Cognitive Science, University of Edinburgh, 2 Buccleuch Place, Edinburgh EH8 9LW, UK. E-mail: sturt6cogsci.ed.ac.uk. Some of the material reported in this paper has been presented in talks given to the 1995 CUNY Sentence Processing Conference in Tucson, AZ, and the 1995 EACL Conference(More)
We describe a categorial system (PPTS) based on partial proof trees (PPTs) as the building blocks of the system. The PPTs are obtained by unfolding the arguments of the type that would be associated with a lexical item in a simple categorial grammar. The PPTs are the basic types in the system and a derivation proceeds by combining PPTs together. We describe(More)
The production of gold standard corpora is time-consuming and costly. We propose an alternative: the ‚silver standard corpus ̳ (SSC), a corpus that has been generated by the harmonisation of the annotations that have been delivered from a selection of annotation systems. The systems have to share the type system for the annotations and the harmonisation(More)
Previous work has demonstrated the success of statistical language models when enough training data is available [1], but despite that, grammar-based systems are proving the preferred choice in successful commercial systems such as HeyAnita [2], BeVocal [3] and Tellme [4], largely due to the difficulty involved in obtaining a corpus of training data. Here(More)
1 This paper provides an account of deenite and pronominal anaphora which draws together insights from dynamic semantics and situation semantics. The result is a dynamic semantics in which contexts consist of the parts of a scenario which have been brought to a hearer's attention. The contexts are structured by meta-level disjunction, corresponding to a set(More)