• Publications
  • Influence
Foreword
  • K. Hirose
  • Computer Science
  • Journal of Physiology-Paris
  • 30 June 2011
  • 342
  • 31
Analysis of voice fundamental frequency contours for declarative sentences of Japanese
TLDR
A model for the generation of fundamental frequency contours (F0 contours) of spoken, sentences is presented for the purpose of elucidating the relationship between the sentence F0 contour and the linguistic and non-linguistic information. Expand
  • 481
  • 23
  • PDF
WFST-Based Grapheme-to-Phoneme Conversion: Open Source tools for Alignment, Model-Building and Decoding
TLDR
This paper introduces a new open source, WFST-based toolkit for Grapheme-toPhoneme conversion. Expand
  • 86
  • 12
  • PDF
Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring
TLDR
This work introduces a modified WFST-based multiple to multiple EM-driven alignment algorithm for Graphemeto-Phoneme (G2P) conversion, and preliminary experimental results applying a Recurrent Neural Network Language Model (RNNLM) as an N-best rescoring mechanism for G2P conversion. Expand
  • 40
  • 6
  • PDF
Failure transitions for joint n-gram models and G2p conversion
TLDR
This work investigates two related issues in the area of WFSTbased G2P conversion. Expand
  • 21
  • 6
  • PDF
Single-Mixture Audio Source Separation by Subspace Decomposition of Hilbert Spectrum
  • M. K. I. Molla, K. Hirose
  • Mathematics, Computer Science
  • IEEE Transactions on Audio, Speech, and Language…
  • 1 March 2007
TLDR
A novel technique is developed to separate the audio sources from a single mixture. Expand
  • 88
  • 5
Tone nucleus modeling for Chinese lexical tone recognition
TLDR
This paper presents a new scheme to deal with variations in fundamental frequency (F0) contours for lexical tone recognition in continuous Chinese speech. Expand
  • 64
  • 5
Filled pauses as cues to the complexity of upcoming phrases for native and non-native listeners
TLDR
We examined whether filled pauses (FPs) affect listeners' predictions about the complexity of upcoming phrases in Japanese. Expand
  • 83
  • 5
Robust speech recognition based on a Bayesian prediction approach
TLDR
We study a category of robust speech recognition problem in which mismatches exist between training and testing conditions, and no accurate knowledge of the mismatch mechanism is available. Expand
  • 72
  • 4
  • PDF
Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework
TLDR
This paper provides an analysis of several practical issues related to the theory and implementation of Grapheme-to-Phoneme (G2P) conversion systems utilizing the Weighted Finite-State Transducer paradigm. Expand
  • 50
  • 4