• Corpus ID: 15341108

Speech coding using mixture of gaussians polynomial model

  title={Speech coding using mixture of gaussians polynomial model},
  author={Parham Zolfaghari and Tony Robinson},
SPEECHCODINGUSINGMIXTUREOFGAUSSIANSPOLYNOMIALMODELParham ZolfaghariyTony RobinsonCREST/ATR Human Information Pro cessing Research Labs, Kyoto 619-02, Japanemail :zparham@hip.atr.co.jpyCambridge University Engineering Department,Cambridge CB2 1PZ, UKemail :ajr@eng.cam.ac.ukABSTRACTWehaveinestigated a noel metho d of sp ectral estimationbased on mixture of Gaussians in a sinusoidal analysis andsynthesisframework.Afterquantisationofthisparamet-ric scheme a xed frame-rate co der op erating at a bit… 
Speech Compression by Polynomial Approximation
A method for compressing speech based on polynomial approximations of the trajectories in time of various speech features (i.e., spectrum, gain, and pitch), which can be integrated into frame-based speech coders, and can also be applied to features that can be represented as temporal series greater in duration than the frame interval.


600-Bit-Per-Second Voice Digitizer (Linear Predictive Formant Vocoder).
A new approach to formant frequency determination, combined with a digital implementation, promises to eliminate these effects and to yield a useful formant vocoder and to provide speech synthesis at 600 bps.
A segment vocoder at 150 b/s
It is demonstrated in this paper that this random quantizer used in the original vocoder is near-optimal by comparing it with quantizers that use clustering algorithms for quantizing speech segments.
Formant analysis using mixtures of Gaussians
  • P. Zolfaghari, T. Robinson
  • Mathematics, Computer Science
    Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
  • 1996
The paper describes a new formant analysis technique whereby the formant parameters are represented in the form of Gaussian mixture distributions. These are estimated from the discrete Fourier
The spectral envelope estimation vocoder
A low bit-rate vocoder designed for improved speech reproduction quality and robustness is described, designed around a new algorithm, the spectral envelope estimator, which forms the nucleus of the spectral analyzer.
A segmental formant vocoder based on linearly varying mixture of Gaussians
A formant vocoder based on mixtures of Gaussians
  • P. Zolfaghari, T. Robinson
  • Mathematics, Computer Science
    1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1997
A voiced/unvoiced classification mechanism has been developed based on the harmonic nature of each formant in the DFT spectrum modulated by the Gaussian mixture distribution, and intelligible synthetic speech has been obtained.
Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model
A sinusoidal model for the speech waveform is used to develop a new synthesis technique that requires specification of only the amplitudes and frequencies of the component sine waves, and preserves the short-time spectral magnitude during rapid movements of spectral energy.
Model parameter estimation for mixture density polynomial segment models
The experimental results showed that modelling both the mean and variance trajectories is consistently superior to modelling only the mean trajectory and results in significant improvements over the conventional HMM.
WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition
The motivation for the corpus, the processes undertaken in its construction and the utilities needed as support tools are described, and comparative results on these tasks for British and American English are concluded.
Neural Networks for Pattern Recognition
The chapter discusses two important directions of research to improve learning algorithms: the dynamic node generation, which is used by the cascade correlation algorithm; and designing learning algorithms where the choice of parameters is not an issue.