Philip Hanna

Learn More
Theoretically, an improvement in a language model occurs as the size of the n-grams increases from 3 to 5 or higher. As the n-gram size increases, the number of parameters and calculations, and the storage requirement increase very rapidly if we attempt to store all possible combinations of n-grams. To avoid these problems, the reduced n-grams' approach(More)
A number of different approaches have been applied to the treatment of errors in spoken dialogue systems, including careful design to prevent potential errors, methods for on-line error detection, and error recovery when errors have occurred and have been detected. The approach to error handling presented here is premised on the theory of grounding , in(More)
In this article we describe how Java can be used to implement an object-based, cross-domain, mixed initiative spoken dialogue manager (DM). We describe how dialogue that crosses between several business domains can be modelled as an inheriting and collaborating suite of objects suitable for implementation in Java. We describe the main features of the Java(More)
Advanced spoken dialogue systems incorporate functionalities such as mixed-initiative and cross-domain dialogues. In this paper an object-based approach to cross domain dialogue modelling is described in which service agents representing primary transaction types and support agents representing tasks such as eliciting payment details are selected as(More)
This paper presents some of the main features of a prototype spoken dialogue manager (DM) that has been incorporated into the DARPA Communicator architecture. Developed in Java, the object components that constitute the DM separate generic from domain-specific dialogue behaviour in the interests of maintainability and extensibility. Confirmation strategies(More)
—This paper investigates the effect of modeling sub-band correlation for noisy speech recognition. Subband feature streams are assumed to be independent in many subband-based speech recognition systems. However, speech recognition experimental results suggest this assumption is unrealistic. In this paper, a method is proposed to incorporate correlation into(More)
Most current speech recognition systems are built upon a single type of model, e.g. an HMM or certain type of segment based model, and furthermore typically employs only one type of acoustic feature e.g. MFCCs and their variants. This entails that the system may not be robust should the modeling assumptions be violated. Recent research efforts have(More)
Statistical language models should improve as the size of the n-grams increases from 3 to 5 or higher. However, the number of parameters and calculations, and the storage requirement increase very rapidly if we attempt to store all possible combinations of n-grams. To avoid these problems, the reduced n-grams' approach previously developed by O'Boyle (1993)(More)
A number of automatic lexicon construction methods have been proposed in recent years. Such approaches employ a dynamic programming (DP) match to collect statistics concerning differences between the observed phone sequence and that which was predicted by a standard lexicon. A more expressive lexicon is then constructed based upon the collected statistics,(More)