Phoneme recognition using time-delay neural networks
- A. Waibel, Toshiyuki Hanazawa, Geoffrey E. Hinton, K. Shikano, Kevin J. Lang
- Computer ScienceIEEE Transactions on Acoustics Speech and Signal…
- 1 March 1989
The authors present a time-delay neural network (TDNN) approach to phoneme recognition which is characterized by two important properties: (1) using a three-layer arrangement of simple computing…
Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder
- Thanh-Le Ha, J. Niehues, A. Waibel
- Computer ScienceInternational Workshop on Spoken Language…
- 15 November 2016
This paper presents the first attempts in building a multilingual Neural Machine Translation framework under a unified approach in which the information shared among languages can be helpful in the translation of individual language pairs and points out a novel way to make use of monolingual data with Neural Machine translation.
Readings in speech recognition
Online handwriting recognition: the NPen++ recognizer
- Stefan Jäger, S. Manke, Jürgen Reichert, A. Waibel
- Computer ScienceInternational Journal on Document Analysis and…
- 1 March 2001
Initial recognition rates for whole sentences are promising and show that the MS-TDNN architecture is suited to recognizing handwritten data ranging from single characters to whole sentences.
A real-time face tracker
The authors present a real-time face tracker that can track a person's face while the person moves freely in a room and can be applied to teleconferencing and many HCI applications including lip reading and gaze tracking.
Language-independent and language-adaptive acoustic modeling for speech recognition
Interpreting BLEU/NIST Scores: How Much Improvement do We Need to Have a Better System?
- Y. Zhang, S. Vogel, A. Waibel
- Computer ScienceInternational Conference on Language Resources…
- 1 May 2004
A novel method of calculating the confidence intervals for BLEU/NIST scores using bootstrapping is reported, which can determine whether two MT systems are significantly different from each other.
Recognizing emotion in speech
- F. Dellaert, T. Polzin, A. Waibel
- Computer ScienceProceeding of Fourth International Conference on…
- 3 October 1996
A new method of extracting prosodic features from speech, based on a smoothing spline approximation of the pitch contour, is presented, which obtains classification performance that is close to human performance on the task.
A time-delay neural network architecture for isolated word recognition
Modular Construction of Time-Delay Neural Networks for Speech Recognition
- A. Waibel
- Computer ScienceNeural Computation
- 1 March 1989
It is shown that small networks trained to perform limited tasks develop time invariant, hidden abstractions that can be exploited to train larger, more complex nets efficiently, and phoneme recognition networks of increasing complexity can be constructed that all achieve superior recognition performance.