Learn More
In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier's archiving and manuscript policies are encouraged to visit: Abstract Dysarthria is a speech motor disorder usually resulting in a substantive(More)
Dysarthria is a motor speech disorder that is often associated with irregular phonation (e.g. vocal fry) and amplitude, incoordination of articulators, and restricted movement of articulators, among other problems. The present study is part of a project on voice transformation systems for dysarthria, with the goal of producing intelligibility-enhanced(More)
Linguistic control factors such as word stress and phrase-finality affect not only duration and pitch, but also spectral structure and spectral dynamics. We propose methods for predicting and synthesizing prosodic effects on spectral balance: (i) A new multi-dimensional measure of spectral balance, in which energy is measured in five " formant frequency(More)
This paper focuses on generation of expressive speech, specifically speech displaying vocal affect. Generating speech with vocal affect is important for diagnosis, research, and remedia-tion for children with autism and developmental language disorders. However, because vocal affect involves many acoustic factors working together in complex ways, it is(More)
In natural speech, prosodic factors such as accent, stress, phrasal position and speaking style play important roles in controlling several acoustic features, including segmental duration, pitch, and spectral balance, i.e., the amplitude pattern across different frequency ranges of the power spectrum. To synthesize speech that sounds natural, these effects(More)
In this paper, a method of pitch contour modelling based on the hidden Markov model (HMM) states of an acoustic unit is presented. A pair of vectors is computed from the alignment of the speech data with the acoustic unit's HMM states. The pitch contour feature of the acoustic unit is represented by the vector pair so that the variants of the acoustic(More)
Accurate estimation of velar movements is useful for automatic speech recognition, speech enhancement, and diagnosis of certain speech disorders. This paper reports on initial results of a project on estimation of velar movements, for two-microphone setups where the microphones are differentially positioned to pick up nasal and oral speech output. Toward(More)
Automatic detection of different oral-nasal configurations during speech is useful for understanding normal nasalization and assessing certain speech disorders. We propose an algorithm to extract nasalization features from dual-channel acoustic signals that are acquired by a simple two-microphone setup. The feature is based on a dual-channel acoustic model(More)