Alexander Hewer

Learn More
We describe a minimally-supervised method for computing a statistical shape space model of the palate surface. The model is created from a corpus of vol-umetric magnetic resonance imaging (MRI) scans collected from 12 speakers. We extract a 3D mesh of the palate from each speaker, then train the model using principal component analysis (PCA). The palate(More)
Vocal tract magnetic resonance imaging (MRI) has become one of the preferred imaging modalities for the analysis of human speech production. However, the raw image data must be segmented before further analysis can take place. This paper describes a hybrid approach to extract a 3D tongue model from 3D or 2D MRI scans of the vocal tract during speech, which(More)
1 Motivation In specific application areas, obtaining higher order motion information is of great interest. An example of such information is the Lagrangian strain tensor [3] that plays a vital role in mechanical engineering. Since this ten-sor is computed by means of first-order motion derivatives, it is tempting to estimate the optical flow field with a(More)
We present a multilinear statistical model of the human tongue that captures anatomical and tongue pose related shape variations separately. The model was derived from 3D magnetic resonance imaging data of 11 speakers sustaining speech related vocal tract configurations. The extraction was performed by using a minimally supervised method that uses as basis(More)
We present an end-to-end text-to-speech (TTS) synthesis system that generates audio and synchronized tongue motion directly from text. This is achieved by adapting a 3D model of the tongue surface to an articula-tory dataset and training a statistical parametric speech synthesis system directly on the tongue model parameter weights. We evaluate the model at(More)
  • 1