• Publications
  • Influence
RASTA processing of speech
TLDR
The theoretical and experimental foundations of the RASTA method are reviewed, the relationship with human auditory perception is discussed, the original method is extended to combinations of additive noise and convolutional noise, and an application is shown to speech enhancement.
Connectionist Speech Recognition: A Hybrid Approach
From the Publisher: Connectionist Speech Recognition: A Hybrid Approach describes the theory and implementation of a method to incorporate neural network approaches into state-of-the-art continuous
The ICSI Meeting Corpus
TLDR
A corpus of data from natural meetings that occurred at the International Computer Science Institute in Berkeley, California over the last three years is collected, which supports work in automatic speech recognition, noise robustness, dialog modeling, prosody, rich transcription, information retrieval, and more.
A view of the parallel computing landscape
Writing programs that scale with increasing numbers of cores should be as easy as writing programs for sequential computers.
RASTA-PLP speech analysis technique
TLDR
The authors have developed a technique that is more robust to such steady-state spectral factors in speech that is conceptually simple and computationally efficient.
Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition
TLDR
This Second Edition of Speech and Audio Signal Processing will update and revise the original book to augment it with new material describing both the enabling technologies of digital music distribution and a range of exciting new research areas in automatic music content processing that have emerged in the past five years, driven by the digital music revolution.
On using MLP features in LVCSR
TLDR
Recognition results show that MLP features can significantly improve recognition performance in large vocabulary continuous speech recognition (LVCSR) tasks for the NIST 2001 Hub-5 evaluation set with models trained on the Switchboard Corpus, even when discriminative training and system combination are used.
Deep and Wide: Multiple Layers in Automatic Speech Recognition
  • N. Morgan
  • Computer Science
    IEEE Transactions on Audio, Speech, and Language…
  • 2012
TLDR
It is concluded that while the deep processing structures can provide improvements for this genre, choice of features and the structure with which they are incorporated, including layer width, can also be significant factors.
...
1
2
3
4
5
...