Author pages are created from data sourced from our academic publisher partnerships and public sources.
Share This Author
Julius - an open source real-time large vocabulary recognition engine
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, 2001, Aalborg, Denmark.
Recent Development of Open-Source Speech Recognition Engine Julius
An overview of Julius, major features and specifications are described, and the developments conducted in the recent years are summarized.
Overview of the IR for Spoken Documents Task in NTCIR-9 Workshop
This paper explains the data used in the subtasks, how to make transcriptions by speech recognition and the details of each subtask of the IR for Spoken Documents Task in NTCIR-9 Workshop.
Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
- Yoshiaki Bando, M. Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara
- Computer ScienceIEEE International Conference on Acoustics…
- 31 October 2017
This paper presents a statistical method of single-channel speech enhancement that uses a variational autoencoder (VAE) as a prior distribution on clean speech that outperformed the conventional DNN-based method in unseen noisy environments.
Benchmark test for speech recognition using the Corpus of Spontaneous Japanese
- Tatsuya Kawahara
- Computer Science
We present benchmark results of automatic speech recognition using the Corpus of Spontaneous Japanese (CSJ), which has been developed in the five-year national project and will be the largest…
Designing Precise and Robust Dialogue Response Evaluators
This work proposes to build a reference-free evaluator and exploit the power of semi-supervised training and pretrained (masked) language models and achieves a strong correlation with human judgement and generalizes robustly to diverse responses and corpora.
An Unsupervised Model for Joint Phrase Alignment and Extraction
- Graham Neubig, Taro Watanabe, E. Sumita, Shinsuke Mori, Tatsuya Kawahara
- Computer ScienceACL
- 19 June 2011
An unsupervised model for joint phrase alignment and extraction using non-parametric Bayesian methods and inversion transduction grammars (ITGs) is presented, which matches the accuracy of traditional two-step word alignment/phrase extraction approach while reducing the phrase table to a fraction of the original size.
Bayesian Learning of a Language Model from Continuous Speech
- Graham Neubig, M. Mimura, Shinsuke Mori, Tatsuya Kawahara
- Computer ScienceIEICE Trans. Inf. Syst.
- 1 February 2012
Experimental results on natural, adult-directed speech demonstrate that LMs built using only continuous speech are able to significantly reduce ASR phoneme error rates, and the proposed technique of joint Bayesian learning of lexical units and an LM over lattices is shown to significantly contribute to this improvement.
ERICA: The ERATO Intelligent Conversational Android
- Dylan F. Glas, T. Minato, C. Ishi, Tatsuya Kawahara, H. Ishiguro
- Computer Science25th IEEE International Symposium on Robot and…
- 1 August 2016
An overview of the requirements and design of the platform, the development process of an interactive application, report on ERICA's first autonomous public demonstration, and discuss the main technical challenges that remain to be addressed in order to create humanlike, autonomous androids are presented.