Author pages are created from data sourced from our academic publisher partnerships and public sources.
Share This Author
An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
This paper proposed a new physiological feature which emphasizes individual information for text-independent speaker identification by using a non-uniform subband processing strategy to emphasize the physiological information involved in speech production. Expand
Voice Activity Detection Based on an Unsupervised Learning Framework
- D. Ying, Yonghong Yan, J. Dang, F. Soong
- Computer Science
- IEEE Transactions on Audio, Speech, and Language…
- 1 November 2011
An unsupervised learning framework to construct statistical models for VAD is proposed by a sequential Gaussian mixture model that does not rely on an assumption that the first several frames of an utterance are nonspeech, which is widely used in most VADs. Expand
Construction and control of a physiological articulatory model.
- J. Dang, K. Honda
- Mathematics, Medicine
- The Journal of the Acoustical Society of America
- 30 January 2004
A control method is proposed for the articulatory model based on the EP maps, in which co-contractions of agonist and antagonist muscles are taken into account and, by utilizing the co-contraction, the tongue tip and tongue dorsum can be controlled to reach their targets independently. Expand
Acoustic characteristics of the piriform fossa in models and humans.
MRI-based mechanical modeling, in-vivo experiments and numerical computations showed that the influence of the piriform fossa extends to the lower vowel formants in addition to the local troughs, and this global effect can be explained by the location of the fossa near the glottal end of the vocal tract. Expand
Morphological and acoustical analysis of the nasal and the paranasal cavities.
The results indicate that asymmetry between the two nasal passages can cause extra pole-zero pairs, and suggest that the paranasal cavities play an important role in shaping the spectral characteristics of human nasal sounds. Expand
Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework
The approach taken in this study is to integrate features extracted from actual articulatory data with acoustic MFCC features in a way that allows recognition using MFCC only, and uses the probabilistic dependency between them. Expand
A Computational Tongue Model and its Clinical Application
A full 3D physiological model of the tongue was constructed based on MRI data from a normal subject, and the basic functions of the model based on anatomic and physiological knowledge were verified and suggested that the model is a useful tool for pre-operative planning of glossectomy. Expand
Acoustic characteristics of the human paranasal sinuses derived from transmission characteristic measurement and morphological observation.
The simulation with the four-zero model showed that the paranasal sinuses not only introduce antiresonances in the transfer function, but also change the spectral shape of the nasal formants. Expand
Visualisation of hypopharyngeal cavities and vocal-tract acoustic modelling
- K. Honda, T. Kitamura, +10 authors J. Dang
- Materials Science, Medicine
- Computer methods in biomechanics and biomedical…
- 15 July 2010
An acoustic model of vowel production was proposed with three components: voice source, hypopharyngeal cavities and vocal tract proper, which provides effective means in controlling voice quality and expressing individual vocal characteristics. Expand
A model-based investigation of activations of the tongue muscles in vowel production
The results suggest that the muscles Transversus and Verticalis play important roles in manipulating the length of tongue for vowel production; and, it is better to separate the Styloglossus into two control units, the anterior portion and posterior portion, in vowel production. Expand